skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A 3D‐Printed Self‐Learning Three‐Linked‐Sphere Robot for Autonomous Confined‐Space Navigation
Reinforcement learning control methods can impart robots with the ability to discover effective behavior, reducing their modeling and sensing requirements, and enabling their ability to adapt to environmental changes. However, it remains challenging for a robot to achieve navigation in confined and dynamic environments, which are characteristic of a broad range of biomedical applications, such as endoscopy with ingestible electronics. Herein, a compact, 3D‐printed three‐linked‐sphere robot synergistically integrated with a reinforcement learning algorithm that can perform adaptable, autonomous crawling in a confined channel is demonstrated. The scalable robot consists of three equally sized spheres that are linearly coupled, in which the extension and contraction in specific sequences dictate its navigation. The ability to achieve bidirectional locomotion across frictional surfaces in open and confined spaces without prior knowledge of the environment is also demonstrated. The synergistic integration of a highly scalable robotic apparatus and the model‐free reinforcement learning control strategy can enable autonomous navigation in a broad range of dynamic and confined environments. This capability can enable sensing, imaging, and surgical processes in previously inaccessible confined environments in the human body.  more » « less
Award ID(s):
1830958
PAR ID:
10370234
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Advanced Intelligent Systems
Volume:
3
Issue:
9
ISSN:
2640-4567
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The integration of an ingestible dosage form with sensing, actuation, and drug delivery capabilities can enable a broad range of surgical‐free diagnostic and treatment strategies. However, the gastrointestinal (GI) tract is a highly constrained and complex luminal construct that fundamentally limits the size of an ingestible system. Recent advancements in mesoscale magnetic crawlers have demonstrated the ability to effectively traverse complex and confined systems by leveraging magnetic fields to induce contraction and bending‐based locomotion. However, the integration of functional components (e.g., electronics) in the proposed ingestible system remains fundamentally challenging. Herein, the creation of a centralized compartment in a magnetic robot by imparting localized flexibility (MR‐LF) is demonstrated. The centralized compartment enables MR‐LF to be readily integrated with modular functional components and payloads, such as commercial off‐the‐shelf electronics and medication, while preserving its bidirectionality in an ingestible form factor. The ability of MR‐LF to incorporate electronics, perform drug delivery, guide continuum devices such as catheters, and navigate air–water environments in confined lumens is demonstrated. The MR‐LF enables functional integration to create a highly integrated ingestible system that can ultimately address a broad range of unmet clinical needs. An interactive preprint version of the article can be found athttps://doi.org/10.22541/au.166274072.23086985/v1. 
    more » « less
  2. This paper focuses on inverse reinforcement learning (IRL) to enable safe and efficient autonomous navigation in unknown partially observable environments. The objective is to infer a cost function that explains expert-demonstrated navigation behavior while relying only on the observations and state-control trajectory used by the expert. We develop a cost function representation composed of two parts: a probabilistic occupancy encoder, with recurrent dependence on the observation sequence, and a cost encoder, defined over the occupancy features. The representation parameters are optimized by differentiating the error between demonstrated controls and a control policy computed from the cost encoder. Such differentiation is typically computed by dynamic programming through the value function over the whole state space. We observe that this is inefficient in large partially observable environments because most states are unexplored. Instead, we rely on a closed-form subgradient of the cost-to-go obtained only over a subset of promising states via an efficient motion-planning algorithm such as A* or RRT. Our experiments show that our model exceeds the accuracy of baseline IRL algorithms in robot navigation tasks, while substantially improving the efficiency of training and test-time inference. 
    more » « less
  3. The use of underwater robot systems, including Autonomous Underwater Vehicles (AUVs), has been studied as an effective way of monitoring and exploring dynamic aquatic environments. Furthermore, advances in artificial intelligence techniques and computer processing led to a significant effort towards fully autonomous navigation and energy-efficient approaches. In this work, we formulate a reinforcement learning framework for long-term navigation of underwater vehicles in dynamic environments using the techniques of tile coding and eligibility traces. Simulation results used actual oceanic data from the Regional Ocean Modeling System (ROMS) data set collected in Southern California Bight (SCB) region, California, USA 
    more » « less
  4. null (Ed.)
    This work presents the design and autonomous navigation policy of the Resilient Micro Flyer, a new type of collision-tolerant robot tailored to fly through extremely confined environments and manhole-sized tubes. The robot maintains a low weight (<500g) and implements a combined rigid-compliant design through the integration of elastic flaps around its stiff collision-tolerant frame. These passive flaps ensure compliant collisions, contact sensing and smooth navigation in contact with the environment. Focusing on resilient autonomy, capable of running on resource-constrained hardware, we demonstrate the beneficial role of compliant collisions for the reliability of the onboard visual-inertial odometry and propose a safe navigation policy that exploits both collision-avoidance using lightweight time-of-flight sensing and adaptive control in response to collisions. The robot further realizes an explicit manhole navigation mode that exploits the direct mechanical feedback provided by the flaps and a special navigation strategy to self-align inside manholes with non-straight geometry. Comprehensive experimental studies are presented to evaluate, both individually and as a whole, how resilience is achieved based on the robot design and its navigation scheme. 
    more » « less
  5. The autonomous navigation of mobile robots in unknown environments is of great interest in mobile robotics. This article discusses a new strategy to navigate to a known target location in an unknown environment using a combination of the “go-to-goal” approach and reinforcement learning with biologically realistic spiking neural networks. While the “goto-goal” approach itself might lead to a solution for most environments, the added neural reinforcement learning in this work results in a strategy that takes the robot from a starting position to a target location in a near shortest possible time. To achieve the goal, we propose a reinforcement learning approach based on spiking neural networks. The presented biologically motivated delayed reward mechanism using eligibility traces results in a greedy approach that leads the robot to the target in a close to shortest possible time. 
    more » « less