Search for: All records

Award ID contains: 1637614

« Prev Next »

Total Resources

10

Resource Type
Conference Paper

8

Conference Proceeding

0

Dataset

0

Journal Article

2

Workshop Report

0

Availability
Full Text / Resource Available

10

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Robots That Use Language

https://doi.org/10.1146/annurev-control-101119-071628

Tellex, Stefanie ; Gopalan, Nakul ; Kress-Gazit, Hadas ; Matuszek, Cynthia ( May 2020 , Annual Review of Control, Robotics, and Autonomous Systems)

This article surveys the use of natural language in robotics from a robotics point of view. To use human language, robots must map words to aspects of the physical world, mediated by the robot's sensors and actuators. This problem differs from other natural language processing domains due to the need to ground the language to noisy percepts and physical actions. Here, we describe central aspects of language use by robots, including understanding natural language requests, using language to drive learning about the physical world, and engaging in collaborative dialogue with a human partner. We describe common approaches, roughly divided into learning methods, logic-based methods, and methods that focus on questions of human–robot interaction. Finally, we describe several application domains for language-using robots.
more » « less
Full Text Available
The Expected-Length Model of Options

David Abel*, John Winder* ( January 2019 , IJCAI)

Effective options can make reinforcement learning easier by enhancing an agent’s ability to both ex- plore in a targeted manner and plan further into the future. However, learning an appropriate model of an option’s dynamics in hard, requiring estimat- ing a highly parameterized probability distribution. This paper introduces and motivates the Expected- Length Model (ELM) for options, an alternate model for transition dynamics. We prove ELM is a (biased) estimator of the traditional Multi- Time Model (MTM), but provide a non-vacuous bound on their deviation. We further prove that, in stochastic shortest path problems, ELM induces a value function that is sufficiently similar to the one induced by MTM, and is thus capable of support- ing near-optimal behavior. We explore the practical utility of this option model experimentally, finding consistent support for the thesis that ELM is a suit- able replacement for MTM. In some cases, we find ELM leads to more sample efficient learning, espe- cially when options are arranged in a hierarchy.
more » « less
Full Text Available
The value of abstraction

https://doi.org/10.1016/j.cobeha.2019.05.001

Ho, Mark K ( January 2019 , Current opinion in behavioral sciences)

Full Text Available
Planning with State Abstractions for Non-Markovian Task Specifications

Yoonseon Oh, Roma Patel ( January 2019 , Robotics science and systems)

Abstract—Often times, we specify tasks for a robot using tem- poral language that can also span different levels of abstraction. The example command “go to the kitchen before going to the second floor” contains spatial abstraction, given that “floor” consists of individual rooms that can also be referred to in isolation (“kitchen”, for example). There is also a temporal ordering of events, defined by the word “before”. Previous works have used Linear Temporal Logic (LTL) to interpret temporal language (such as “before”), and Abstract Markov Decision Processes (AMDPs) to interpret hierarchical abstractions (such as “kitchen” and “second floor”), separately. To handle both types of commands at once, we introduce the Abstract Product Markov Decision Process (AP-MDP), a novel approach capable of representing non-Markovian reward functions at different levels of abstractions. The AP-MDP framework translates LTL into its corresponding automata, creates a product Markov Decision Process (MDP) of the LTL specification and the environment MDP, and decomposes the problem into subproblems to enable efficient planning with abstractions. AP-MDP performs faster than a non-hierarchical method of solving LTL problems in over 95% of tasks, and this number only increases as the size of the en- vironment domain increases. We also present a neural sequence- to-sequence model trained to translate language commands into LTL expression, and a new corpus of non-Markovian language commands spanning different levels of abstraction. We test our framework with the collected language commands on a drone, demonstrating that our approach enables a robot to efficiently solve temporal commands at different levels of abstraction.
more » « less
Full Text Available
Multi-Object Search using Object-Oriented POMDPs

Arthur Wandzel, Yoonseon Oh ( January 2019 , IEEE International Conference on Robotics and Automation)

Abstract— A core capability of robots is to reason about mul- tiple objects under uncertainty. Partially Observable Markov Decision Processes (POMDPs) provide a means of reasoning under uncertainty for sequential decision making, but are computationally intractable in large domains. In this paper, we propose Object-Oriented POMDPs (OO-POMDPs), which represent the state and observation spaces in terms of classes and objects. The structure afforded by OO-POMDPs support a factorization of the agent’s belief into independent object distributions, which enables the size of the belief to scale linearly versus exponentially in the number of objects. We formulate a novel Multi-Object Search (MOS) task as an OO-POMDP for mobile robotics domains in which the agent must find the locations of multiple objects. Our solution exploits the structure of OO-POMDPs by featuring human language to selectively update the belief at task onset. Using this structure, we develop a new algorithm for efficiently solving OO-POMDPs: Object- Oriented Partially Observable Monte-Carlo Planning (OO- POMCP). We show that OO-POMCP with grounded language commands is sufficient for solving challenging MOS tasks both in simulation and on a physical mobile robot.
more » « less
Full Text Available
State Abstractions for Lifelong Reinforcement Learning

Abel, D. ; Arumugam, D. ; Lehnert, L. ; Littman, M. ( January 2018 , Proceedings of the 35th International Conference on Machine Learning)

Full Text Available
Policy and Value Transfer in Lifelong Reinforcement Learning

Abel, D. ; Jinnai, Y. ; Guo, Y. ; Konidaris, G. ; Littman, M. ( January 2018 , Proceedings of the 35th International Conference on Machine Learning)

Full Text Available
Learning Approximate Stochastic Transition Models

Song, Y. ; Grimm, C. ; Wang, X. ; Littman, M. ( January 2017 , arXiv preprint arXiv:1710.09718)

Full Text Available
Planning with Abstract Markov Decision Processes

Gopalan, Nakul ; desJardins, Marie ; Littman, Michael L. ; MacGlashan, James ; Squire, Shawn ; Tellex, Stefanie ; Winder, John ; Wong, Lawson L.S. ( January 2017 , ICAPS)

Robots acting in human-scale environments must plan under uncertainty in large state–action spaces and face constantly changing reward functions as requirements and goals change. Planning under uncertainty in large state–action spaces requires hierarchical abstraction for efficient computation. We introduce a new hierarchical planning framework called Abstract Markov Decision Processes (AMDPs) that can plan in a fraction of the time needed for complex decision making in ordinary MDPs. AMDPs provide abstract states, actions, and transition dynamics in multiple layers above a base-level “flat” MDP. AMDPs decompose problems into a series of subtasks with both local reward and local transition functions used to create policies for subtasks. The resulting hierarchical planning method is independently optimal at each level of abstraction, and is recursively optimal when the local reward and transition functions are correct. We present empirical results showing significantly improved planning speed, while maintaining solution quality, in the Taxi domain and in a mobile-manipulation robotics problem. Furthermore, our approach allows specification of a decision-making model for a mobile-manipulation problem on a Turtlebot, spanning from low-level control actions operating on continuous variables all the way up through high-level object manipulation tasks.
more » « less
Full Text Available
Near Optimal Behavior via Approximate State Abstraction

Abel, David ; Hershkowitz, D. Ellis ; Littman, Michael L. ( January 2016 , ICML)

The combinatorial explosion that plagues planning and reinforcement learning (RL) algorithms can be moderated using state abstraction. Prohibitively large task representations can be condensed such that essential information is preserved, and consequently, solutions are tractably computable. However, exact abstractions, which treat only fully-identical situations as equivalent, fail to present opportunities for abstraction in environments where no two situations are exactly alike. In this work, we investigate approximate state abstractions, which treat nearly-identical situations as equivalent. We present theoretical guarantees of the quality of behaviors derived from four types of approximate abstractions. Additionally, we empirically demonstrate that approximate abstractions lead to reduction in task complexity and bounded loss of optimality of behavior in a variety of environments.
more » « less
Full Text Available