Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking

Liu, Chujun; Lonsberry, Andrew; Nandor, Mark; Audu, Musa; Lonsberry, Alexander; Quinn, Roger

doi:10.3390/biomimetics4010028

Citation Details

Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking

A control system for bipedal walking in the sagittal plane was developed in simulation. The biped model was built based on anthropometric data for a 1.8 m tall male of average build. At the core of the controller is a deep deterministic policy gradient (DDPG) neural network that was trained in GAZEBO, a physics simulator, to predict the ideal foot placement to maintain stable walking despite external disturbances. The complexity of the DDPG network was decreased through carefully selected state variables and a distributed control system. Additional controllers for the hip joints during their stance phases and the ankle joint during toe-off phase help to stabilize the biped during walking. The simulated biped can walk at a steady pace of approximately 1 m/s, and during locomotion it can maintain stability with a 30 kg·m/s impulse applied forward on the torso or a 40 kg·m/s impulse applied rearward. It also maintains stable walking with a 10 kg backpack or a 25 kg front pack. The controller was trained on a 1.8 m tall model, but also stabilizes models 1.4–2.3 m tall with no changes. more »

Award ID(s):: 1739800

PAR ID:: 10112280

Author(s) / Creator(s):: Liu, Chujun; Lonsberry, Andrew; Nandor, Mark; Audu, Musa; Lonsberry, Alexander; Quinn, Roger

Date Published:: 2019-03-01

Journal Name:: Biomimetics

Volume:: 4

Issue:: 1

ISSN:: 2313-7673

Page Range / eLocation ID:: 28

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.3390/biomimetics4010028

More Like this