<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>A 65-nm Humanoid Robot System-on-Chip Using Time-Domain 3-D Footstep Planning and Mixed-Signal ZMP Gait Scheduler With Inverse Kinematics</title></titleStmt>
			<publicationStmt>
				<publisher>IEEE</publisher>
				<date>04/01/2025</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10586759</idno>
					<idno type="doi">10.1109/JSSC.2025.3541484</idno>
					<title level='j'>IEEE Journal of Solid-State Circuits</title>
<idno>0018-9200</idno>
<biblScope unit="volume">60</biblScope>
<biblScope unit="issue">4</biblScope>					

					<author>Qiankai Cao</author><author>Juin Chuen_Oh</author><author>Jie Gu</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[This work presents a footstep planning chip for humanoid robot. It integrates a time-domain graph search engine for high-level 3-D footstep planning and a mixed-signal zero moment point (ZMP) gait scheduler with neural inverse kinematics, enabling efficient low-level motion control. The key contributions of this work include a time-domain graph search engine for 3-D footstep planning, featuring 3-D search capabilities, D * replanning for real-time adjustments, redundant path blocking, and efficient result readout. In addition, it introduces an energy-efficient mixed-signal ZMP gait scheduler for maintaining robot balance, along with a time-domain neural-network-based inverse kinematics module for controlling robot joints. This work is demonstrated in situ on a fully assembled robot using the 65-nm system-on-chip (SoC), achieving 2.7× energy savings for graph search and an 18.4× improvement in energy efficiency for motion control compared with prior works.Index Terms-3-D footstep planning, humanoid robot, inverse kinematics, mixed-signal, system-on-chip (SoC), zero moment point (ZMP).]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>A S AUTONOMOUS robotic systems have observed rapid growth in recent years, and humanoid robots are recently drawing significant interest. Compared with wheeled mobile robots, humanoid robots with human-like joint systems enable high degree-of-freedom (DOF) locomotion for complex tasks, e.g., search and rescue, housework, or medical treatment. However, there are significant challenges in motion control of such robots. First, it is complicated and computationally heavy for 3-D footstep planning on humanoid robot <ref type="bibr">[1]</ref>, <ref type="bibr">[2]</ref> with added dimensions of height and special movements, e.g., stepping over or stepping onto objects. Most research in this area has focused on treating footstep planning as a graph search problem, commonly solved using the A * algorithm <ref type="bibr">[3]</ref>, <ref type="bibr">[4]</ref>. The dominant strategy involves computing an extended sequence of footsteps that lead all the way to the target, which the robot follows closely using precise motion tracking. These plans typically consist of 30 or more steps and require a few seconds of computation time. At this timescale, replanning while the robot is in motion becomes impractical. However, real-world environments frequently present unexpected events that demand immediate replanning <ref type="bibr">[5]</ref>. Moving obstacles can change direction unpredictably, requiring quick responses to avoid collisions. To tackle these challenges, it is crucial to have a controller with a short and reliable response time. Second, the complex 10-20 DOFs' kinematic model for robot joint control leads to high computation workload. The kinematics problem plays a crucial role in robotic motion control. Forward kinematics refers to mapping from joint space to Cartesian task space, while inverse kinematics involves mapping from Cartesian task space to joint space <ref type="bibr">[6]</ref>. Due to the complexity of inverse kinematics <ref type="bibr">[6]</ref>, it is typically more challenging to solve compared with forward kinematics. Moreover, when a robot manipulator performs motion control, the computational demands of inverse kinematics can significantly consume CPU resources, slowing down the robot's performance. Addressing this issue is therefore essential for improving efficiency. Third, to maintain balance, special trajectory control of the robot's center of mass (CoM) through zero moment point (ZMP) needs to be judiciously performed for fall prevention. To enable certain behaviors in humanoid robots, such as walking, motion planners must simultaneously consider the dynamic effects of the resulting motion <ref type="bibr">[7]</ref>. This is because these robots rely on ground reaction forces at their supporting foot or feet, which are inherently unilateral. This constraint is effectively captured by the concept of the ZMP <ref type="bibr">[8]</ref>. The unilateral nature of the contact forces translates into the requirement that the ZMP must remain within the boundaries of the supporting polygon. When this condition is satisfied, the robot avoids rotation along the edges of the supporting polygon, ensuring stable contact as long as friction remains sufficient. This is crucial for maintaining stable control of humanoid robots.</p><p>Previous research has explored 2-D path planning, including wavefront expansion of the A * algorithm for graph searches <ref type="bibr">[3]</ref>, <ref type="bibr">[9]</ref>, the design of a path-planning processor for 2-D/3-D autonomous navigation of micro robots <ref type="bibr">[10]</ref>, and the development of an FPGA-based motion-planning accelerator for dual-arm robot manipulation systems <ref type="bibr">[11]</ref>. However, these studies do not support specialized humanoid movements, such as stepping on or over obstacles. There are also mixed-signal designs for robot control, including an oscillator-based NeuroSLAM accelerator for mobile wheeled robots <ref type="bibr">[12]</ref>, a time-domain mixed-signal neuromorphic accelerator with reinforcement learning for autonomous micro-robots <ref type="bibr">[13]</ref>, and an efficient mixed-signal accelerator developed for real-time swarm intelligence <ref type="bibr">[14]</ref>. These designs achieve remarkable energy efficiency for wheeled robots due to their mixed-signal nature. However, they cannot support more complex joint systems, such as those found in humanoid robots, which typically control many more motors than the 2-4-DoF systems used in wheeled robots. Some works are digital-based for robots but have not been demonstrated on real robots to showcase their full capabilities. For instance, a motion-control ASIC was designed primarily for industrial robot arms <ref type="bibr">[15]</ref>, a ray-casting accelerator was developed for edge robotics in augmented reality applications <ref type="bibr">[16]</ref>, and an artificial intelligence processor with PVT compensation was created for micro robots <ref type="bibr">[17]</ref>.</p><p>Despite the advancements in other works, a system-onchip (SoC) solution specifically for humanoid robots has been lacking until now. As depicted in Fig. <ref type="figure">1</ref>(a), a humanoid robot control system comprises a high-level controller and a low-level controller. The 3-D footstep planning module and ZMP-based scheduler collaborate to generate steps and the corresponding CoM trajectory toward the target. Once high-level plans are established, the detailed CoM trajectories are stored in memory and sent to the kinematic model with sequence control, after which joint control signals are converted into motor control signals. As shown in Fig. <ref type="figure">1(b)</ref>, compared with previous robot control platforms <ref type="bibr">[3]</ref>, <ref type="bibr">[10]</ref>, <ref type="bibr">[11]</ref>, <ref type="bibr">[14]</ref>, <ref type="bibr">[15]</ref>, this approach achieves a two-four orders of magnitude reduction in energy consumption for path planning, representing a significant improvement.</p><p>To address the aforementioned challenges and improve the performance of humanoid robot control, as shown in Fig. <ref type="figure">2</ref>, this work introduces a 65-nm mixed-signal SoC designed explicitly for humanoid robots, offering several key innovations: 1) a time-domain <ref type="bibr">[18]</ref>, <ref type="bibr">[21]</ref>, <ref type="bibr">[22]</ref> graph search engine for 3-D footstep planning featuring 3-D search, D * replanning <ref type="bibr">[19]</ref> for on-the-fly adjustment, blocking of redundant paths and efficient readout of search results; 2) a highly efficient mixed-signal ZMP gait scheduler, crucial for maintaining balance in humanoid robots; 3) a time-domain neural-networkbased inverse kinematic module for robot joint control; and 4) in situ demonstrations on a real assembled robot with the 65-nm SoC rendering 2.7&#215; overall energy saving for graph search and 18.4&#215; higher energy efficiency for motion control compared with prior works.</p><p>The rest of this article is organized as follows: Section II presents an overview of the SoC top-level architecture and the robot assembly. Section III delves into the time-domain graph search ASIC for 3-D footstep planning and D * replanning <ref type="bibr">[19]</ref>, including circuit details for the vertex, direction lock (DL), vertex lock and unlock module, delay cells, and scan chain tracing-back modules. Section IV covers the ZMP gait scheduler and neurokinematics for low-level motion control. Section V showcases the implementation and measurement results obtained from the test chip, along with real robot demonstrations and retrain methods with use cases. Section VI concludes this article. This article is an extended version of the conference publication in <ref type="bibr">[20]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. CHIP TOP-LEVEL ARCHITECTURE AND ROBOT SYSTEM</head><p>A. Chip Top-Level Architecture Fig. <ref type="figure">3</ref>(a) and (b) illustrates the top-level architecture of the chip, which includes: 1) a 40 &#215; 40 time-domain graph search engine with specialized mixed-signal circuits for high-level 3-D footstep planning; 2) a ZMP gait trajectory generator for controlling the CoM for robot balancing; 3) a hybrid time-digital domain neural network serving as an inverse kinematic estimator for joint control; and 4) a motor control module with UART to manage external motors via CAN bus. Upon completion of high-level footstep planning, the ZMP gait scheduler module passes the CoM trajectory in (X, Y, Z ) format to low-level joint control. The system then transitions to low-level control, where the neurokinematic module converts the Cartesian space of end-effectors into the 10-DoF joint space for each motor. Final motion control commands are transmitted through the motor control module using the CAN bus and UART protocol.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Robot Assembly and Special Movement</head><p>Fig. <ref type="figure">4</ref> showcases the assembled robot system <ref type="bibr">[23]</ref>, highlighting both the front and back views of its motor and control systems. The system includes a mini-FPGA that imports control signals and communicates with the test chip via a scan cable and driver board for direct motor control. The test chip, mounted on a demo board, transmits joint angle control signals through the UART interface using a CAN bus. In this work, the map input is processed by the test chip for 3-D footstep planning, followed by the generation of the ZMP trajectory pattern. The resulting data are then transmitted to the kinematic module for low-level motor control.</p><p>Compared with wheeled robots <ref type="bibr">[13]</ref>, <ref type="bibr">[14]</ref>, the motion controller for humanoid robots introduces several unique features tailored to their distinct movements. Instead of visualizing the environment as a 2-D grid map, the humanoid robot converts a 3-D floor map into a 3-D grid map that includes the height of objects. This allows the robot to decide whether to take shortcuts by stepping over obstacles that would typically block the way in a 2-D scenario. The ability to step on or over obstacles enables the robot to navigate around them and reach its target via a shorter route. In addition, the chip supports efficient path replanning on the go when the environment changes suddenly, and it blocks redundant paths to make quick decisions with reduced power consumption. These features enhance the effectiveness and energy efficiency of humanoid robot control.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. TIME-DOMAIN GRAPH ASIC FOR 3-D FOOTSTEP PLANNING AND D * REPLANNING</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. 3-D Footstep Planning and D * Algorithm for Replanning</head><p>Three-dimensional footstep planning <ref type="bibr">[24]</ref>, <ref type="bibr">[25]</ref>, <ref type="bibr">[26]</ref> can be modeled as a shortest path problem using the A * algorithm <ref type="bibr">[27]</ref>. In Fig. <ref type="figure">5</ref>(a), the floor map is transformed into a grid map, where white grids represent walkable areas and darker grids indicate objects or platforms of varying heights. Some objects are marked as traversable if their height is manageable for the robot to step over or on, while others are too tall and are considered obstacles, requiring the robot to find an alternate route. For instance, a rod on the map might be low enough for the robot to step over, so a 3-D footstep planning approach would allow the robot to do so. In contrast, a 2-D path-finding method would still treat the rod as an obstacle, forcing a detour and resulting in a much longer path. After the initial 3-D footstep planning is completed, replanning is triggered when a change in the environment, such as a fallen pillar, is detected, as illustrated in Fig. <ref type="figure">5(b)</ref>. The robot follows the original 3-D footstep plan, indicated by the blue dotted arrow, until it encounters the new obstacle. At this point, a replanning algorithm is used to generate an updated path, shown by the yellow dotted arrow, which takes the environmental change into account. The robot then abandons the previous route and follows the newly generated path from the updated starting point toward the target. Unlike the widely used A * algorithm, this work uses a more advanced D * replanning algorithm <ref type="bibr">[19]</ref>, allowing the robot to adjust its path dynamically while moving toward its destination. The steps of the D * algorithm are detailed in Fig. <ref type="figure">6</ref>. First, mapping information is scanned into the graph ASIC for shortest path calculation. Next, the shortest path is identified by the graph ASIC, represented by the red arrows. When an environmental change occurs, the affected nodes are detected, and a global unlock signal is sent to these nodes and their successorsdefined as all the vertices originating from the changed nodes. The connections to these successors are shown with blue arrows. Finally, a new starting point is set at the changed node, triggering another round of graph search that only updates the direction of the changed nodes and their successors. This D * approach achieves 1.8&#215; energy savings compared with the traditional A * algorithm, which requires recalculating the entire map. In this example, the changed nodes are blocked, causing the new path to detour around them, following the yellow arrows to reach the target.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Time-Domain Graph ASIC for 3-D Footstep Planning</head><p>Fig. <ref type="figure">7</ref> elaborates details of the 3-D footstep planning and the time-domain graph search engine. While 2-D occupancy grid maps typically meet the needs of wheeled robots <ref type="bibr">[13]</ref>, humanoid robots necessitate additional terrain height information to account for special movements of stepping over/onto objects in 3-D space. Different from the widely used A * algorithm, a more sophisticated D * replanning algorithm <ref type="bibr">[19]</ref> was adopted in this work, enabling the robot to adjust its path while heading to the destination. In the time-domain circuit implementation shown in Fig. <ref type="figure">7</ref>, a 40 &#215; 40 vertex array is deployed to generate locomotion trajectory. The mapping information, e.g., distance of single step and height of stairs, are mapped into a programmable 2-bit delay cell at interconnect of the vertexes. Inside each vertex, time-domain signals are passed from eight directions, including four planar directions similar as prior work <ref type="bibr">[3]</ref> and another four directions for the new dimension of height for stepping-over or steppingon movements. Each "vertex lock" circuit includes multiple NAND, NOR gates, and a DFF for catching the earliest time-domain signal and producing a "Lock" signal. A set of DL modules are used to record the direction of the first-come time-domain signal. The time-domain signals propagate as a wavefront through the whole map resulting in the shortest path being locked in the DL circuits.</p><p>Besides static planning, this work also supports D * on-thefly replanning when the environment is changed, e.g., an object moved by the robot. Finally, rather than a full memory scan outputting all the direction values as in <ref type="bibr">[3]</ref>, this design enables   only tracing back along the shortest path using the direction information. As shown in Fig. <ref type="figure">8</ref>, the shortest path is calculated by the graph search engine and is indicated by a red arrow. During the calculation, the vertices along the path are linked based on the DL results. In the waveform, "0" signifies the earliest arriving direction. If multiple "0"s appear, it means the earliest arriving signals come from multiple directions, but only one direction is selected as the result. For example, if the target vertex's DL_in shows a "0" in the "E" direction, the vertex in the "E" direction will be linked to the target vertex. The DL_out will then be scanned into the target vertex after all the DL_in bits are sequentially scanned out, following the sequence from steps 5 to 4, step 3, and so on, all the way back to the start vertex.</p><p>As shown in Fig. <ref type="figure">9</ref>, thanks to the described technique, this work achieves a 29.1&#215; speedup compared with the full memory scan approach used in previous research <ref type="bibr">[3]</ref>. The replanning process delivers an average of 1.8&#215; energy savings compared with methods that do not use replanning. Overall, the study realizes a 2.7&#215; reduction in energy consumption for path planning compared with prior work <ref type="bibr">[3]</ref>, due to the low-power techniques implemented in this study.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Circuit Design Detail of Graph Vertex</head><p>As illustrated in Fig. <ref type="figure">10</ref>(a), the delay cell encodes mapping information, such as single-step distances and stair heights, into a programmable 2-bit delay cell located at the vertex interconnections. In the figure, "00" denotes a standard single step, "01" represents a step-on, "10" signifies a step-over, and "11" is used for debugging purposes. The delay of each cell reflects the time cost of its corresponding configuration. An analog signal, Vtune, globally adjusts the delay across all the cells. Each vertex receives time-domain signals from eight directions. In Fig. <ref type="figure">10(b</ref>) and (c), a set of DL modules record the direction of the first-arriving signal. For example, in the DL (W) module, if W is not the earliest signal, a "trig" pulse sets DL_W to "1" to indicate the direction. If W is the earliest signal, no "trig" pulse is generated, leaving DL_W at "0." Each "vertex lock" circuit includes multiple NAND and NOR gates, plus a D flip-flop (DFF) to capture the earliest signal and generate a "Lock" signal for further processing. When the direction "S" is the earliest arriving signal, the DFF clock signal toggles, producing the "Lock" signal simultaneously.</p><p>In addition to static planning, this work supports D * onthe-fly replanning for dynamic environments. Sometimes, the environment could change due to robot behavior, such as a robot kicking or moving an object with robot hands. As a result, the 3-D footstep planning needs to be updated based on the latest map information. If redoing the entire map in A star algorithm, it will be unnecessary, as the robot has already moved to the new start point. So, a D star algorithm enables only partially redoing 3-D footstep planning and efficiently finding the shortest path using updated map information. In hardware implementation, the unlock function is managed by the circuit shown in the bottom right in Fig. <ref type="figure">10</ref>. For the changed nodes, an unlock_global signal is issued to reset the vertex lock DFF. For their successors, the unlock signal propagates along with the DL information to identify all the nodes that need to be reset. For blocking functions, a predefined configuration will be set for those nodes that are at the corner of the map. It will stop the vertices from propagating the pulses to avoid unnecessary path planning. This approach achieves an average energy saving of 32.9% across 50 random map search tasks.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IV. ZMP GAIT SCHEDULER AND NEUROKINEMATICS FOR LOW-LEVEL MOTION CONTROL</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Cart-Table Model and ZMP Gait Scheduler</head><p>Following our discussion on high-level 3-D footstep planning, let us move on to low-level gait scheduler and neurokinematic for motion control. Unlike robot cars or other types of multi-legged robots, a humanoid robot must consider maintaining balance. To address this, we developed a ZMP-based gait scheduler to maintain stability while walking. As illustrated in Fig. <ref type="figure">11</ref>, the dynamics of a walking robot can be described using a cart-table model. In the figure, to counteract the momentum generated by the car's gravity (CoM) "mg," the cart must move with an acceleration of &#7821;. And the torque &#964; ZMP at the supporting p x is shown as this formula. It is forced to be zero to control the robot to be stable. By considering the ZMP dynamics, we can design a gait scheduler to ensure the robot's CoM follows a specific trajectory, preventing the robot from falling. Fig. <ref type="figure">12</ref> provides details on the ZMP-based CoM control for the gait scheduler and the neurokinematic circuits for robot joint control. The ZMP is the location where the total moment of the robot at the ground is zero. For dynamic stability, the ZMP must remain within the support region of the robot. ZMP is used to establish the target trajectory for the robot's CoM. This mixed-signal ZMP phase generator achieves 3.4&#215; power savings compared with an equivalent digital solution.</p><p>A mixed-signal circuit <ref type="bibr">[35]</ref>, <ref type="bibr">[36]</ref>, <ref type="bibr">[37]</ref> featuring a VCO and multiplexers (MUXs) is used to generate a sinusoidal CoM   trajectory that ensures dynamic stability, commonly used for simple pattern generation. In practical humanoid gait planning, more flexible CoM trajectories, such as polynomial-based ones, are also widely adopted. In this work, a sinusoidal CoM trajectory (X, Y, Z ) is produced and encoded as time pulses for inverse kinematics computation. As shown in Fig. <ref type="figure">13</ref>, a ring oscillator with linear increased phases is used to convert digital sine wave-like CoM trajectory, x, y, z into time pulses. In comparison, digital solutions typically rely on LUTs, which require many flip-flops for implementation <ref type="bibr">[38]</ref>. In this work, digital information is encoded as time delays, reducing power consumption by avoiding the use of DFFs. In addition, the phases are split into ZMP_LSB and ZMP_MSB for TDMAC operation. This time-domain interface ensures compatibility with the subsequent neurokinematic module.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Neurokinematics for Inverse Kinematics</head><p>As shown in Fig. <ref type="figure">14</ref>, due to the highly complex trigonometric computation in IK, a neural network is used to approximate the calculation. A neural network consisting of a TDMAC as the hidden layer and a DMAC as the output layer is used to convert CoM trajectory (x, y, z) in Cartesian space into &#952; i in joint space, a process known as inverse kinematics. This calculation is particularly complex when the DoFs in robot systems are high. Fig. <ref type="figure">15</ref> details the hardware implementation of neurokinematics, featuring an 8-bit time-domain MAC <ref type="bibr">[18]</ref>, <ref type="bibr">[22]</ref>, <ref type="bibr">[28]</ref> developed using a bit partition technique. In this  design, a pulse from the ZMP module controls the propagation duration of the ring oscillator, while digital weights regulate the propagation speed of the ring. The result is captured by a counter, effectively performing an MAC operation in the time domain. To further enhance energy efficiency, the 8-bit weight is divided into LSB and MSB, with a pulse generator acting as a time-domain carry signal. When the LSB result counter is full, it triggers the pulse generator to create a carry pulse, which then controls the gating signal along with the ZMP input MSB.</p><p>As shown in Fig. <ref type="figure">16</ref>, the proposed neural kinematics module demonstrates a 2% loss compared with the ideal inverse kinematic model, with an additional 1% accuracy loss due to time-domain implementation. This is acceptable, as both the ZMP method can tolerate errors within the ZMP region, and the neural network is resilient to errors introduced by the proposed mixed-signal scheme. This method achieves a 7.5&#215; area saving using the time-domain MAC with bit partition technique, along with a 1.8&#215; latency reduction and a 12.1&#215; area saving for inverse kinematics compared with its digital counterpart.    derived from an ideal inverse kinematic model with functions such as arctan, arcsin, and arccos. A sine-wave-shaped CoM trajectory, generated under ZMP constraints, is used to compute joint angles (&#952; 1 foot, &#952; 2 ankle, &#952; 3 knee, &#952; 4 hip roll, and &#952; 5 hip pitch) via the ideal inverse kinematic model, shown as blue curves. This model serves as a reference for the neural network, with training aimed at minimizing the MSE between their outputs using ground-truth data as labels. Initially, the model undergoes the first round of training. However, errors due to quantization or time-domain variations arise, as shown in Fig. <ref type="figure">18</ref>(a). To address these errors, a second round of retraining is conducted, updating only the output layer, which significantly reduces the model's MSE, as depicted in Fig. <ref type="figure">18(b)</ref>. After optimization, the model is deployed on the robot to evaluate its performance. Including real CoM trajectories in training prevents unexpected Cartesian-to-joint space mappings, ensuring all possible trajectories are sampled to avoid model failure.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Neural Estimator With Retrain Methodology</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>V. MEASUREMENT RESULTS AND EVALUATION FOR TEST CASE</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Chip Implementation</head><p>A mixed-signal 3-D footstep planning SoC was designed using a 65-nm CMOS process. The real-world demonstration setup, shown in Fig. <ref type="figure">19</ref>, includes the test chip and environment. Fig. <ref type="figure">20</ref> presents the chip micrograph and implementation details. The active die area measures 3.34 mm 2 (2 &#215; 1.67 mm), with a nominal supply voltage of 1.0 V, a maximum frequency of 1 MHz, and peak power consumption of 432.8 &#956;W at 1.2 V. The graph search engine features 40 &#215; 40 vertices, 1600 &#215; 8 edges, and a node delay of 1.1 ns at 1.2 V. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Measurement Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Comparison to Prior Works</head><p>Table I illustrates a comparison table with prior works. In the comparison table, this work presents the first 3-D footstep planning SoC chips for humanoid robots. It supports both high-level footstep planning and low-level motion control. In terms of energy efficiency for control, this work   achieves 645 Hz/mW, which is 18.4&#215; higher than previous work, thanks to the mixed-signal circuit implementations. The energy efficiency for the neural network ranges from 3.2 to 6.5 TOPS/W. For path planning, this work demonstrates a more complex 3-D footstep planning with a 1.6&#215; higher search rate and an overall 2.7&#215; improvement in energy per task due to low-power features.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>D. Demonstration and Evaluation of Test Cases</head><p>Fig. <ref type="figure">23</ref> illustrates more complex graph scenarios involving blocking and replanning algorithms. In the baseline scenario, the robot can step over a narrow rod and make several turns to reach the target. In the blocking example, two corners of the graph are obstructed, preventing the time-domain signal from propagating through the blocked vertices. Consequently, the footstep planning <ref type="bibr">[29]</ref>, <ref type="bibr">[30]</ref> remains unchanged from the  baseline. In the environmentally changing scenario, the narrow rod becomes a wider obstacle, preventing the robot from stepping over it. As a result, the robot detours from its original path and finds a new route to reach the target. Fig. <ref type="figure">24</ref> shows the test board setup, which includes the test chip and LDOs to supply the chip's voltage source. An XTAL is used to generate a precise clock for producing motor control signals via the UART protocol. A level shifter converts the motor control signal from the on-chip 1.8 to 3.3 V for motor control. In the test environment, a 3 &#215; 3 m area with a 40 &#215; 40 grid map is constructed, featuring several paper boxes as obstacles <ref type="bibr">[31]</ref>, <ref type="bibr">[32]</ref>, <ref type="bibr">[33]</ref>, <ref type="bibr">[34]</ref>. A mop is placed on the floor, enabling the robot to step over the rod and find a shorter path compared with a wheeled robot. To further enhance the scalability of the proposed methods, the graph engine can TABLE I COMPARISON TABLE WITH PRIOR WORKS be adapted as a path planning engine for various types of robots, including quadrupeds and robotic cars. In addition, the low-level neurokinematic module is highly scalable and can be applied to any jointed robot, such as quadrupeds, which also require real-time inverse kinematic calculations. This flexibility makes the proposed system versatile and applicable to a wide range of robotic platforms, paving the way for broader adoption across diverse robotic applications.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>VI. CONCLUSION</head><p>This work presents a 65-nm SoC chip for humanoid robot control with in situ demonstration. It features a time-domain graph search engine for 3-D footstep planning. D * on-thefly replanning and a tracing back method along the shortest path with blocking of redundant paths are proposed. A mixedsignal ZMP-based gait scheduler is developed to maintain robot balance, and a mixed-signal neurokinematic module is developed for inverse kinematics in motion control. Overall, this approach achieves 2.7&#215; energy saving in graph search and an 18.4&#215; improvement in energy efficiency for motion control compared with previous works.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>Authorized licensed use limited to: Northwestern University. Downloaded on May 01,2025 at 17:45:28 UTC from IEEE Xplore. Restrictions apply.</p></note>
		</body>
		</text>
</TEI>
