Task-Directed Exploration in Continuous POMDPs for Robotic Manipulation of Articulated Objects

Curtis, Aidan; Kaelbling, Leslie; Jain, Siddarth

doi:10.1109/ICRA48891.2023.10160306

Citation Details

Task-Directed Exploration in Continuous POMDPs for Robotic Manipulation of Articulated Objects

Representing and reasoning about uncertainty is crucial for autonomous agents acting in partially observable environments with noisy sensors. Partially observable Markov decision processes (POMDPs) serve as a general framework for representing problems in which uncertainty is an important factor. Online sample-based POMDP methods have emerged as efficient approaches to solving large POMDPs and have been shown to extend to continuous domains. However, these solutions struggle to find long-horizon plans in problems with significant uncertainty. Exploration heuristics can help guide planning, but many real-world settings contain significant task-irrelevant uncertainty that might distract from the task objective. In this paper, we propose STRUG, an online POMDP solver capable of handling domains that require long-horizon planning with significant task-relevant and task-irrelevant uncertainty. We demonstrate our solution on several temporally extended versions of toy POMDP problems as well as robotic manipulation of articulated objects using a neural perception frontend to construct a distribution of possible models. Our results show that STRUG outperforms the current samplebased online POMDP solvers on several tasks. more »

Award ID(s):: 2214177

PAR ID:: 10444338

Author(s) / Creator(s):: Curtis, Aidan; Kaelbling, Leslie; Jain, Siddarth

Date Published:: 2023-01-01

Journal Name:: IEEE International Conference on Robotics and Automation

ISSN:: 1049-3492

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICRA48891.2023.10160306

More Like this