NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Language-Conditioned Observation Models for Visual Object Search

https://doi.org/10.1109/IROS55552.2023.10341492

Nguyen, Thao; Hrosinkov, Vladislav; Rosen, Eric; Tellex, Stefanie (October 2023, IEEE)
RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents

Rodriguez-Sanchez, Rafael; Spiegel, Benjamin; Wang, Jennifer; Patel, Roma; Tellex, Stefanie; Konidaris, George (July 2023, Proceedings of the 40th International Conference on Machine Learning)

We introduce RLang, a domain-specific language (DSL) for communicating domain knowledge to an RL agent. Unlike existing RL DSLs that ground to single elements of a decision-making formalism (e.g., the reward function or policy), RLang can specify information about every element of a Markov decision process. We define precise syntax and grounding semantics for RLang, and provide a parser that grounds RLang programs to an algorithm-agnostic partial world model and policy that can be exploited by an RL agent. We provide a series of example RLang programs demonstrating how different RL methods can exploit the resulting knowledge, encompassing model-free and model-based tabular algorithms, policy gradient and value-based methods, hierarchical approaches, and deep methods.
more » « less
Full Text Available
Skill Generalization with Verbs

Ma, Rachel; Lam, Lyndon; Spiegel, Benjamin; Ganeshan, Aditya; Patel, Roma; Abbatematteo, Ben; Paulius, David Paulius; Tellex, Stefanie; Konidaris, George (October 2023, Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems)

It is imperative that robots can understand natural language commands issued by humans. Such commands typically contain verbs that signify what action should be performed on a given object and that are applicable to many objects. We propose a method for generalizing manipulation skills to novel objects using verbs. Our method learns a probabilistic classifier that determines whether a given object trajectory can be described by a specific verb. We show that this classifier accurately generalizes to novel object categories with an average accuracy of 76.69% across 13 object categories and 14 verbs. We then perform policy search over the object kinematics to find an object trajectory that maximizes classifier prediction for a given verb. Our method allows a robot to generate a trajectory for a novel object based on a verb, which can then be used as input to a motion planner. We show that our model can generate trajectories that are usable for executing five verb commands applied to novel instances of two different object categories on a real robot.
more » « less
Full Text Available
Multi-Resolution POMDP Planning for Multi-Object Search in 3D

https://doi.org/10.1109/IROS51168.2021.9636737

Zheng, Kaiyu; Sung, Yoonchang; Konidaris, George; Tellex, Stefanie (September 2021, IROS)

Full Text Available
Spatial Language Understanding for Object Search in Partially Observed City-scale Environments

https://doi.org/10.1109/RO-MAN50785.2021.9515426

Zheng, Kaiyu; Bayazit, Deniz; Mathew, Rebecca; Pavlick, Ellie; Tellex, Stefanie (August 2021, ICRA)

Full Text Available
Robots That Use Language

https://doi.org/10.1146/annurev-control-101119-071628

Tellex, Stefanie; Gopalan, Nakul; Kress-Gazit, Hadas; Matuszek, Cynthia (May 2020, Annual Review of Control, Robotics, and Autonomous Systems)

This article surveys the use of natural language in robotics from a robotics point of view. To use human language, robots must map words to aspects of the physical world, mediated by the robot's sensors and actuators. This problem differs from other natural language processing domains due to the need to ground the language to noisy percepts and physical actions. Here, we describe central aspects of language use by robots, including understanding natural language requests, using language to drive learning about the physical world, and engaging in collaborative dialogue with a human partner. We describe common approaches, roughly divided into learning methods, logic-based methods, and methods that focus on questions of human–robot interaction. Finally, we describe several application domains for language-using robots.
more » « less
Full Text Available
Multi-Object Search using Object-Oriented POMDPs

https://doi.org/10.1109/ICRA.2019.8793888

Wandzel, Arthur; Oh, Yoonseon; Fishman, Michael; Kumar, Nishanth; Wong, Lawson L.S.; Tellex, Stefanie (May 2019, IEEE International Conference on Robotics and Automation (ICRA))

Full Text Available
Spoken language interaction with robots: Recommendations for future research

https://doi.org/10.1016/j.csl.2021.101255

Marge, Matthew; Espy-Wilson, Carol; Ward, Nigel G.; Alwan, Abeer; Artzi, Yoav; Bansal, Mohit; Blankenship, Gil; Chai, Joyce; Daumé, Hal; Dey, Debadeepta; et al (January 2022, Computer Speech & Language)
null (Ed.)
Full Text Available
Planning with Abstract Markov Decision Processes

Gopalan, Nakul; desJardins, Marie; Littman, Michael L.; MacGlashan, J.; Squire, S.; Tellex, Stefanie; Winder, John; Wong, Lawson L. (July 2017, 27th International Conference on Automated Planning and Scheduling)

Robots acting in human-scale environments must plan under uncertainty in large state–action spaces and face constantly changing reward functions as requirements and goals change. Planning under uncertainty in large state–action spaces requires hierarchical abstraction for efficient computation. We introduce a new hierarchical planning framework called Abstract Markov Decision Processes (AMDPs) that can plan in a fraction of the time needed for complex decision making in ordinary MDPs. AMDPs provide abstract states, actions, and transition dynamics in multiple layers above a base-level “flat” MDP. AMDPs decompose problems into a series of subtasks with both local reward and local transition functions used to create policies for subtasks. The resulting hierarchical planning method is independently optimal at each level of abstraction, and is recursively optimal when the local reward and transition functions are correct. We present empirical results showing significantly improved planning speed, while maintaining solution quality, in the Taxi domain and in a mobile-manipulation robotics problem. Furthermore, our approach allows specification of a decision-making model for a mobile-manipulation problem on a Turtlebot, spanning from low-level control actions operating on continuous variables all the way up through high-level object manipulation tasks.
more » « less
Full Text Available
Planning with Abstract Markov Decision Processes

Gopalan, Nakul; desJardins, Marie; Littman, Michael L.; MacGlashan, James; Squire, Shawn; Tellex, Stefanie; Winder, John; Wong, Lawson L.S. (January 2017, ICAPS)

Robots acting in human-scale environments must plan under uncertainty in large state–action spaces and face constantly changing reward functions as requirements and goals change. Planning under uncertainty in large state–action spaces requires hierarchical abstraction for efficient computation. We introduce a new hierarchical planning framework called Abstract Markov Decision Processes (AMDPs) that can plan in a fraction of the time needed for complex decision making in ordinary MDPs. AMDPs provide abstract states, actions, and transition dynamics in multiple layers above a base-level “flat” MDP. AMDPs decompose problems into a series of subtasks with both local reward and local transition functions used to create policies for subtasks. The resulting hierarchical planning method is independently optimal at each level of abstraction, and is recursively optimal when the local reward and transition functions are correct. We present empirical results showing significantly improved planning speed, while maintaining solution quality, in the Taxi domain and in a mobile-manipulation robotics problem. Furthermore, our approach allows specification of a decision-making model for a mobile-manipulation problem on a Turtlebot, spanning from low-level control actions operating on continuous variables all the way up through high-level object manipulation tasks.
more » « less
Full Text Available

Search for: All records