NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fatigue‐Resistant Mechanoresponsive Color‐Changing Hydrogels for Vision‐Based Tactile Robots

https://doi.org/10.1002/adma.202407925

Liu, Jiabin; Li, Wei; She, Yu; Blanchard, Sean; Lin, Shaoting (September 2024, Advanced Materials)

Abstract Mechanoresponsive color‐changing materials that can reversibly and resiliently change color in response to mechanical deformation are highly desirable for diverse modern technologies in optics, sensors, and robots; however, such materials are rarely achieved. Here, a fatigue‐resistant mechanoresponsive color‐changing hydrogel (FMCH) is reported that exhibits reversible, resilient, and predictable color changes under mechanical stress. At its undeformed state, the FMCH remains dark under a circular polariscope; upon uniaxial stretching of up to six times its initial length, it gradually shifts its color from black, to gray, yellow, and purple. Unlike traditional mechanoresponsive color‐changing materials, FMCH maintains its performance across various strain rates for up to 10 000 cycles. Moreover, FMCH demonstrates superior mechanical properties with fracture toughness of 3000 J m⁻², stretchability of 6, and fatigue threshold up to 400 J m⁻². These exceptional mechanical and optical features are attributed to FMCH's substantial molecular entanglements and desirable hygroscopic salts, which synergistically enhance its mechanical toughness while preserving its color‐changing performance. One application of this FMCH as a tactile sensoris then demonstrated for vision‐based tactile robots, enabling them to discern material stiffness, object shape, spatial location, and applied pressure by translating stress distribution on the contact surface into discernible images.
more » « less
UniT: Data Efficient Tactile Representation With Generalization to Unseen Objects

https://doi.org/10.1109/LRA.2025.3559835

Xu, Zhengtong; Uppuluri, Raghava; Zhang, Xinwei; Fitch, Cael; Crandall, Philip Glen; Shou, Wan; Wang, Dongyi; She, Yu (June 2025, IEEE Robotics and Automation Letters)

UniT is an approach to tactile representation learn¬ing, using VQGAN to learn a compact latent space and serve as the tactile representation. It uses tactile images obtained from a single simple object to train the representation with generalizability. This tactile representation can be zero-shot transferred to various downstream tasks, including perception tasks and manipulation policy learning. Our benchmarkings on in-hand 3D pose and 6D pose estimation tasks and a tactile classifcation task show that UniT outperforms existing visual and tactile representation learning methods. Additionally, UniT’s effectiveness in policy learning is demonstrated across three real-world tasks involving diverse manipulated objects and complex robot-object-environment interactions. Through extensive experi¬mentation, UniT is shown to be a simple-to-train, plug-and-play, yet widely effective method for tactile representation learning. For more details, please refer to our open-source repository https://github.com/ZhengtongXu/UniT and the project website https://zhengtongxu.github.io/unit-website/.
more » « less
Free, publicly-accessible full text available June 1, 2026
In‐Hand Singulation, Scooping, and Cable Untangling with a 5‐Dof Tactile‐Reactive Gripper

https://doi.org/10.1002/adrr.202500020

Zhou, Yuhao; Zhou, Pokuang; Wang, Shaoxiong; She, Yu (May 2025, Advanced Robotics Research)

Manipulation tasks often require a high degree of dexterity, typically necessitating grippers with multiple degrees of freedom (DOF). While a robotic hand equipped with multiple fingers can execute precise and intricate manipulation tasks, the inherent redundancy stemming from its high‐DOF often adds complexity that may not be required. In this paper, we introduce the design of a tactile sensor‐equipped gripper with two fingers and five‐DOF. We present a novel design integrating a GelSight tactile sensor, enhancing sensing capabilities and enabling finer control during specific manipulation tasks. To evaluate the gripper's performance, we conduct experiments involving three challenging tasks: 1) retrieving, singularizing, and classification of various objects buried within granular media, 2) executing scooping manipulations of a 3D‐printed credit card in confined environments to achieve precise insertion, and 3) sensing entangled cable states with only tactile perception and executing manipulations to achieve two‐cable untangling. Our results demonstrate the versatility of the proposed gripper across these tasks, with a high success rate of 84% for singulation task, a 100% success rate for scooping and inserting credit cards, and successful cable untangling. Videos are available athttps://yuhochau.github.io/5_dof_gripper/.
more » « less
Free, publicly-accessible full text available May 12, 2026
VILP: Imitation Learning With Latent Video Planning

https://doi.org/10.1109/LRA.2025.3542317

Xu, Zhengtong; Qiu, Qiang; She, Yu (April 2025, IEEE Robotics and Automation Letters)

In the era of generative AI, integrating video generation models into robotics opens new possibilities for the general-purpose robot agent. This paper introduces imitation learning with latent video planning (VILP). We propose a latent video diffusion model to generate predictive robot videos that adhere to temporal consistency to a good degree. Our method is able to generate highly time-aligned videos from multiple views, which is crucial for robot policy learning. Our video generation model is highly time-effcient. For example, it can generate videos from two distinct perspectives, each consisting of six frames with a resolution of 96x160 pixels, at a rate of 5 Hz. In the experiments, we demonstrate that VILP outperforms the existing video generation robot policy across several metrics: training costs, inference speed, temporal consistency of generated videos, and the performance of the policy. We also compared our method with other imitation learning methods. Our fndings indicate that VILP can rely less on extensive high-quality task-specifc robot action data while still maintaining robust performance. In addition, VILP possesses robust capabilities in representing multi-modal action distributions. Our paper provides a practical example of how to effectively integrate video generation models into robot policies, potentially offering insights for related felds and directions. For more details, please refer to our open-source repository https://github.com/ZhengtongXu/VILP.
more » « less
Free, publicly-accessible full text available April 1, 2026
LeTO: Learning Constrained Visuomotor Policy With Differentiable Trajectory Optimization

https://doi.org/10.1109/TASE.2024.3486542

Xu, Zhengtong; She, Yu (March 2025, IEEE Transactions on Automation Science and Engineering)

This paper introduces LeTO, a method for learning constrained visuomotor policy with differentiable trajectory optimization. Our approach integrates a differentiable optimization layer into the neural network. By formulating the optimization layer as a trajectory optimization problem, we enable the model to end-to-end generate actions in a safe and constraint-controlled fashion without extra modules. Our method allows for the introduction of constraint information during the training process, thereby balancing the training objectives of satisfying constraints, smoothing the trajectories, and minimizing errors with demonstrations. This “gray box” method marries optimization-based safety and interpretability with powerful representational abilities of neural networks. We quantitatively evaluate LeTO in simulation and in the real robot. The results demonstrate that LeTO performs well in both simulated and real-world tasks. In addition, it is capable of generating trajectories that are less uncertain, higher quality, and smoother compared to existing imitation learning methods. Therefore, it is shown that LeTO provides a practical example of how to achieve the integration of neural networks with trajectory optimization. We release our code at https://github.com/ZhengtongXu/LeTO.
more » « less
Free, publicly-accessible full text available March 26, 2026
VibTac: A High-Resolution High-Bandwidth Tactile Sensing Finger for Multi-Modal Perception in Robotic Manipulation

https://doi.org/10.1109/TOH.2025.3561049

Athar, Sheeraz; Zhang, Xinwei; Ueda, Jun; Zhao, Ye; She, Yu (January 2025, IEEE Transactions on Haptics)

Tactile sensing is pivotal for enhancing robot manipulation abilities by providing crucial feedback for localized information. However, existing sensors often lack the necessary resolution and bandwidth required for intricate tasks. To address this gap, we introduce VibTac, a novel multi-modal tactile sensing finger designed to offer high-resolution and high-bandwidth tactile sensing simultaneously. VibTac seamlessly integrates vision-based and vibration-based tactile sensing modes to achieve high-resolution and high-bandwidth tactile sensing respectively, leveraging a streamlined human-inspired design for versatility in tasks. This paper outlines the key design elements of VibTac and its fabrication methods, highlighting the significance of the Elastomer Gel Pad (EGP) in its sensing mechanism. The sensor’s multi-modal performance is validated through 3D reconstruction and spectral analysis to discern tactile stimuli effectively. In experimental trials, VibTac demonstrates its efficacy by achieving over 90% accuracy in insertion tasks involving objects emitting distinct sounds, such as ethernet connectors. Leveraging vision-based tactile sensing for object localization and employing a deep learning model for “click” sound classification, VibTac showcases its robustness in real-world scenarios. Video of the sensor working can be accessed at https://youtu.be/kmKIUlXGroo.
more » « less
Free, publicly-accessible full text available January 1, 2026
LeTac-MPC: Learning Model Predictive Control for Tactile-Reactive Grasping

https://doi.org/10.1109/TRO.2024.3463470

Xu, Zhengtong; She, Yu (September 2024, IEEE Transactions on Robotics)

Grasping is a crucial task in robotics, necessitating tactile feedback and reactive grasping adjustments for robust grasping of objects under various conditions and with differing physical properties. In this paper, we introduce LeTac-MPC, a learning-based model predictive control (MPC) for tactile-reactive grasping. Our approach enables the gripper to grasp objects with different physical properties on dynamic and force-interactive tasks. We utilize a vision-based tactile sensor, GelSight [1], which is capable of perceiving high-resolution tactile feedback that contains information on the physical properties and states of the grasped object. LeTac-MPC incorporates a differentiable MPC layer designed to model the embeddings extracted by a neural network (NN) from tactile feedback. This design facilitates convergent and robust grasping control at a frequency of 25 Hz. We propose a fully automated data collection pipeline and collect a dataset only using standardized blocks with different physical properties. However, our trained controller can generalize to daily objects with different sizes, shapes, materials, and textures. The experimental results demonstrate the effectiveness and robustness of the proposed approach. We compare LeTac-MPC with two purely model-based tactile-reactive controllers (MPC and PD) and open-loop grasping. Our results show that LeTac-MPC has optimal performance in dynamic and force-interactive tasks and optimal generalizability. We release our code and dataset at https://github.com/ZhengtongXu/LeTac-MPC.
more » « less
Full Text Available

Search for: All records