Hierarchical Control and Learning of a Foraging CyberOctopus

Shih, Chia-Hsien; Naughton, Noel; Halder, Udit; Chang, Heng-Sheng; Kim, Seung_Hyun; Gillette, Rhanor; Mehta, Prashant_G; Gazzola, Mattia  (ORCID:000000032129379X)

doi:10.1002/aisy.202300088

Citation Details

Hierarchical Control and Learning of a Foraging CyberOctopus

Inspired by the unique neurophysiology of the octopus, a hierarchical framework is proposed that simplifies the coordination of multiple soft arms by decomposing control into high‐level decision‐making, low‐level motor activation, and local reflexive behaviors via sensory feedback. When evaluated in the illustrative problem of a model octopus foraging for food, this hierarchical decomposition results in significant improvements relative to end‐to‐end methods. Performance is achieved through a mixed‐modes approach, whereby qualitatively different tasks are addressed via complementary control schemes. Herein, model‐free reinforcement learning is employed for high‐level decision‐making, while model‐based energy shaping takes care of arm‐level motor execution. To render the pairing computationally tenable, a novel neural network energy shaping (NN‐ES) controller is developed, achieving accurate motions with time‐to‐solutions 200 times faster than previous attempts. The hierarchical framework is then successfully deployed in increasingly challenging foraging scenarios, including an arena littered with obstacles in 3D space, demonstrating the viability of the approach. more »

Award ID(s):: 2209322 1830881

PAR ID:: 10441484

Author(s) / Creator(s):: Shih, Chia-Hsien ; Naughton, Noel ; Halder, Udit ; Chang, Heng-Sheng ; Kim, Seung_Hyun ; Gillette, Rhanor ; Mehta, Prashant_G ; Gazzola, Mattia

Publisher / Repository:: Wiley Blackwell (John Wiley & Sons)

Date Published:: 2023-06-22

Journal Name:: Advanced Intelligent Systems

Volume:: 5

Issue:: 9

ISSN:: 2640-4567

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1002/aisy.202300088

More Like this