NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation

Shah, Rutav; Yu, Albert; Zhu, Yifeng; Zhu, Yuke; Martín-Martín, Roberto (January 2025, IEEE)

To operate at a building scale, service robots must perform very long-horizon mobile manipulation tasks by navigating to different rooms, accessing different floors, and interacting with a wide and unseen range of everyday objects. We refer to these tasks as Building-wide Mobile Manipulation. To tackle these inherently long-horizon tasks, we propose BUMBLE, a unified VLM-based framework integrating open-world RGBD perception, a wide spectrum of gross-to-fine motor skills, and dual-layered memory. Our extensive evaluation (90+ hours) indicates that BUMBLE outperforms multiple baselines in long-horizon building-wide tasks that require sequencing up to 12 ground truth skills spanning 15 minutes per trial. BUMBLE achieves 47.1% success rate averaged over 70 trials in different buildings, tasks, and scene layouts from different starting rooms and floors. Our user study demonstrates 22% higher satisfaction with our method than state-of-the-art mobile manipulation methods. Finally, we demonstrate the potential of using increasingly capable foundation models to push performance further.
more » « less
Free, publicly-accessible full text available January 31, 2026
LOTUS: Continual Imitation Learning for Robot Manipulation Through Unsupervised Skill Discovery

https://doi.org/10.1109/ICRA57147.2024.10611129

Wan, Weikang; Zhu, Yifeng; Shah, Rutav; Zhu, Yuke (May 2024, IEEE)

Full Text Available
What can we learn when fitting a simple telegraph model to a complex gene expression model?

https://doi.org/10.1371/journal.pcbi.1012118

Jiao, Feng; Li, Jing; Liu, Ting; Zhu, Yifeng; Che, Wenhao; Bleris, Leonidas; Jia, Chen (May 2024, PLOS Computational Biology)
Finley, Stacey D (Ed.)
In experiments, the distributions of mRNA or protein numbers in single cells are often fitted to the random telegraph model which includes synthesis and decay of mRNA or protein, and switching of the gene between active and inactive states. While commonly used, this model does not describe how fluctuations are influenced by crucial biological mechanisms such as feedback regulation, non-exponential gene inactivation durations, and multiple gene activation pathways. Here we investigate the dynamical properties of four relatively complex gene expression models by fitting their steady-state mRNA or protein number distributions to the simple telegraph model. We show that despite the underlying complex biological mechanisms, the telegraph model with three effective parameters can accurately capture the steady-state gene product distributions, as well as the conditional distributions in the active gene state, of the complex models. Some effective parameters are reliable and can reflect realistic dynamic behaviors of the complex models, while others may deviate significantly from their real values in the complex models. The effective parameters can also be applied to characterize the capability for a complex model to exhibit multimodality. Using additional information such as single-cell data at multiple time points, we provide an effective method of distinguishing the complex models from the telegraph model. Furthermore, using measurements under varying experimental conditions, we show that fitting the mRNA or protein number distributions to the telegraph model may even reveal the underlying gene regulation mechanisms of the complex models. The effectiveness of these methods is confirmed by analysis of single-cell data forE. coliand mammalian cells. All these results are robust with respect to cooperative transcriptional regulation and extrinsic noise. In particular, we find that faster relaxation speed to the steady state results in more precise parameter inference under large extrinsic noise.
more » « less
Full Text Available
Learning Generalizable Manipulation Policies with Object-Centric 3D Representations

Zhu, Yifeng; Jiang, Zhenyu; Stone, Peter; Zhu, Yuke (November 2023, Conference on Robot Learning)

We introduce GROOT, an imitation learning method for learning robust policies with object-centric and 3D priors. GROOT builds policies that generalize beyond their initial training conditions for vision-based manipulation. It constructs object-centric 3D representations that are robust toward background changes and camera views and reason over these representations using a transformer-based policy. Furthermore, we introduce a segmentation correspondence model that allows policies to generalize to new objects at test time. Through comprehensive experiments, we validate the robustness of GROOT policies against perceptual variations in simulated and real-world environments. GROOT's performance excels in generalization over background changes, camera viewpoint shifts, and the presence of new object instances, whereas both state-of-the-art end-to-end learning methods and object proposal-based approaches fall short. We also extensively evaluate GROOT policies on real robots, where we demonstrate the efficacy under very wild changes in setup.
more » « less
Full Text Available
PMEH: A Parallel and Write-Optimized Extendible Hashing for Persistent Memory

https://doi.org/10.1109/TCAD.2023.3271579

Hu, Jing; Chen, Jianxi; Zhu, Yifeng; Yang, Qing; Peng, Zhouxuan; Yu, Ya (November 2023, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

Full Text Available
Characterization of I/O Behaviors in Cloud Storage Workloads

https://doi.org/10.1109/TC.2023.3263726

Zou, Qiang; Zhu, Yifeng; Chen, Jianxi; Deng, Yuhui; Qin, Xiao (October 2023, IEEE Transactions on Computers)

Full Text Available
Cocktail: Mixing Data With Different Characteristics to Reduce Read Reclaims for nand Flash Memory

https://doi.org/10.1109/TCAD.2022.3214679

Zhang, Genxiong; Deng, Yuhui; Zhou, Yi; Pang, Shujie; Yue, Jianhui; Zhu, Yifeng (July 2023, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

Full Text Available
Learning to Walk by Steering: Perceptive Quadrupedal Locomotion in Dynamic Environments

https://doi.org/10.1109/ICRA48891.2023.10161302

Seo, Mingyo; Gupta, Ryan; Zhu, Yifeng; Skoutnev, Alexy; Sentis, Luis; Zhu, Yuke (May 2023, 2023 IEEE International Conference on Robotics and Automation (ICRA))

Full Text Available
Bottom-Up Skill Discovery From Unsegmented Demonstrations for Long-Horizon Robot Manipulation

https://doi.org/10.1109/LRA.2022.3146589

Zhu, Yifeng; Stone, Peter; Zhu, Yuke (April 2022, IEEE Robotics and Automation Letters)

Full Text Available

Search for: All records