NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery

Luo, Katie; Liu, Zhenzhen; Chen, Xiangyu; You, Yurong; Bainam, Sagie; Phoo, Cheng P; Campbell, Mark; Sun, Wen; Hariharan, Bharath; Weinberger, Kilian Q (December 2023, Advances in Neural Information Processing Systems 36 (NeurIPS 2023))

Recent advances in machine learning have shown that Reinforcement Learning from Human Feedback (RLHF) can improve machine learning models and align them with human preferences. Although very successful for Large Language Models (LLMs), these advancements have not had a comparable impact in research for autonomous vehicles—where alignment with human expectations can be imperative. In this paper, we propose to adapt similar RL-based methods to unsupervised object discovery, i.e. learning to detect objects from LiDAR points without any training labels. Instead of labels, we use simple heuristics to mimic human feedback. More explicitly, we combine multiple heuristics into a simple reward function that positively correlates its score with bounding box accuracy, i.e., boxes containing objects are scored higher than those without. We start from the detector’s own predictions to explore the space and reinforce boxes with high rewards through gradient updates. Empirically, we demonstrate that our approach is not only more accurate, but also orders of magnitudes faster to train compared to prior works on object discovery. Code is available at https://github.com/katieluo88/DRIFT.
more » « less
Full Text Available
Planning Paths through Occlusions in Urban Environments

Han, Yutao; Xia, Youya; Qi, Guo-Jun; Campbell, Mark (December 2022, Conference on Robot Learning (CoRL))
Learning to Assess Danger from Movies for Cooperative Escape Planning in Hazardous Environments

https://doi.org/10.1109/IROS47612.2022.9982279

Shree, Vikram; Allen, Sarah; Asfora, Beatriz; Banfi, Jacopo; Campbell, Mark (October 2022, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS))

Full Text Available
Is it Worth to Reason about Uncertainty in Occupancy Grid Maps during Path Planning?

https://doi.org/10.1109/ICRA46639.2022.9812431

Banfi, Jacopo; Woo, Lindsey; Campbell, Mark (May 2022, International Conference on Robotics and Automation (ICRA))

Full Text Available
Accelerated consensus in multi-agent networks via memory of local averages

https://doi.org/10.1109/CDC45484.2021.9683055

Bhaskar, Aditya; Rangarajan, Shriya; Shree, Vikram; Campbell, Mark; Parise, Francesca (December 2021, IEEE Conference on Decision and Control (CDC))

Full Text Available
Exploiting Natural Language for Efficient Risk-Aware Multi-Robot SaR Planning

https://doi.org/10.1109/LRA.2021.3062798

Shree, Vikram; Asfora, Beatriz; Zheng, Rachel; Hong, Samantha; Banfi, Jacopo; Campbell, Mark (April 2021, IEEE Robotics and Automation Letters)
null (Ed.)
Full Text Available
Mixed-Integer Linear Programming Models for Multi-Robot Non-Adversarial Search

https://doi.org/10.1109/LRA.2020.3017473

Asfora, Beatriz A.; Banfi, Jacopo; Campbell, Mark (October 2020, IEEE Robotics and Automation Letters)
null (Ed.)
Full Text Available
Planning High-Level Paths in Hostile, Dynamic, and Uncertain Environments

https://doi.org/10.1613/jair.1.12077

Banfi, Jacopo; Shree, Vikram; Campbell, Mark (September 2020, Journal of Artificial Intelligence Research)
null (Ed.)
This paper introduces and studies a graph-based variant of the path planning problem arising in hostile environments. We consider a setting where an agent (e.g. a robot) must reach a given destination while avoiding being intercepted by probabilistic entities which exist in the graph with a given probability and move according to a probabilistic motion pattern known a priori. Given a goal vertex and a deadline to reach it, the agent must compute the path to the goal that maximizes its chances of survival. We study the computational complexity of the problem, and present two algorithms for computing high quality solutions in the general case: an exact algorithm based on Mixed-Integer Nonlinear Programming, working well in instances of moderate size, and a pseudo-polynomial time heuristic algorithm allowing to solve large scale problems in reasonable time. We also consider the two limit cases where the agent can survive with probability 0 or 1, and provide specialized algorithms to detect these kinds of situations more efficiently.
more » « less
Full Text Available
Interactive Natural Language-Based Person Search

https://doi.org/10.1109/LRA.2020.2969921

Shree, Vikram; Chao, Wei-Lun; Campbell, Mark (April 2020, IEEE Robotics and Automation Letters)
null (Ed.)
Full Text Available
An Empirical Study of Person Re-Identification with Attributes

https://doi.org/10.1109/RO-MAN46459.2019.8956459

Shree, Vikram; Chao, Wei-Lun; Campbell, Mark (October 2019, IEEE International Conference on Robot and Human Interactive Communication (RO-MAN))
null (Ed.)
Full Text Available

Search for: All records