Autonomous Learning in a Pseudo-Episodic Physical Environment

Haughn, Kevin P.; Inman, Daniel J.

doi:10.1007/s10846-022-01577-5

Citation Details

Autonomous Learning in a Pseudo-Episodic Physical Environment

Abstract Forpractical considerations reinforcement learning has proven to be a difficult task outside of simulation when applied to a physical experiment. Here we derive an optional approach to model free reinforcement learning, achieved entirely online, through careful experimental design and algorithmic decision making. We design a reinforcement learning scheme to implement traditionally episodic algorithms for an unstable 1-dimensional mechanical environment. The training scheme is completely autonomous, requiring no human to be present throughout the learning process. We show that the pseudo-episodic technique allows for additional learning updates with off-policy actor-critic and experience replay methods. We show that including these additional updates between periods of traditional training episodes can improve speed and consistency of learning. Furthermore, we validate the procedure in experimental hardware. In the physical environment, several algorithm variants learned rapidly, each surpassing baseline maximum reward. The algorithms in this research are model free and use only information obtained by an onboard sensor during training. more »

Award ID(s):: 1935216

PAR ID:: 10378241

Author(s) / Creator(s):: Haughn, Kevin P.; Inman, Daniel J.

Date Published:: 2022-02-01

Journal Name:: Journal of Intelligent & Robotic Systems

Volume:: 104

Issue:: 2

ISSN:: 0921-0296

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1007/s10846-022-01577-5

More Like this