Search for: All records

Creators/Authors contains: "Guibas, Leonidas J."

« Prev Next »

Total Resources

27

Resource Type
Conference Paper

24

Conference Proceeding

0

Dataset

0

Journal Article

3

Workshop Report

0

Availability
Full Text / Resource Available

26

Citation Only

1

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation

Shen, Bokui ; Jiang, Zhenyu ; Choy, Christopher ; Savarese, Silvio ; Guibas, Leonidas J. ; Anandkumar, Anima ; Zhu, Yuke ( June 2022 , Robotics: Science and Systems XVIII)

Full Text Available
Action-conditional implicit visual dynamics for deformable object manipulation

https://doi.org/10.1177/02783649231191222

Shen, Bokui ; Jiang, Zhenyu ; Choy, Christopher ; Savarese, Silvio ; Guibas, Leonidas J. ; Anandkumar, Anima ; Zhu, Yuke ( July 2023 , The International Journal of Robotics Research)

Manipulating volumetric deformable objects in the real world, like plush toys and pizza dough, brings substantial challenges due to infinite shape variations, non-rigid motions, and partial observability. We introduce ACID, an action-conditional visual dynamics model for volumetric deformable objects based on structured implicit neural representations. ACID integrates two new techniques: implicit representations for action-conditional dynamics and geodesics-based contrastive learning. To represent deformable dynamics from partial RGB-D observations, we learn implicit representations of occupancy and flow-based forward dynamics. To accurately identify state change under large non-rigid deformations, we learn a correspondence embedding field through a novel geodesics-based contrastive loss. To evaluate our approach, we develop a simulation framework for manipulating complex deformable shapes in realistic scenes and a benchmark containing over 17,000 action trajectories with six types of plush toys and 78 variants. Our model achieves the best performance in geometry, correspondence, and dynamics predictions over existing approaches. The ACID dynamics models are successfully employed for goal-conditioned deformable manipulation tasks, resulting in a 30% increase in task success rate over the strongest baseline. Furthermore, we apply the simulation-trained ACID model directly to real-world objects and show success in manipulating them into target configurations. https://b0ku1.github.io/acid/

more » « less
HuMoR: 3D Human Motion Model for Robust Pose Estimation

https://doi.org/10.1109/ICCV48922.2021.01129

Rempe, Davis ; Birdal, Tolga ; Hertzmann, Aaron ; Yang, Jimei ; Sridhar, Srinath ; Guibas, Leonidas J. ( October 2021 , 2021 IEEE/CVF International Conference on Computer Vision (ICCV))

We introduce HuMoR: a 3D Human Motion Model for Robust Estimation of temporal pose and shape. Though substantial progress has been made in estimating 3D human motion and shape from dynamic observations, recovering plausible pose sequences in the presence of noise and occlusions remains a challenge. For this purpose, we propose an expressive generative model in the form of a conditional variational autoencoder, which learns a distribution of the change in pose at each step of a motion sequence. Furthermore, we introduce a flexible optimization-based approach that leverages HuMoR as a motion prior to robustly estimate plausible pose and shape from ambiguous observations. Through extensive evaluations, we demonstrate that our model generalizes to diverse motions and body shapes after training on a large motion capture dataset, and enables motion reconstruction from multiple input modalities including 3D keypoints and RGB(-D) videos. See the project page at geometry.stanford.edu/projects/humor.
more » « less
Full Text Available
CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

https://doi.org/10.1109/ICCV48922.2021.01296

Weng, Yijia ; Wang, He ; Zhou, Qiang ; Qin, Yuzhe ; Duan, Yueqi ; Fan, Qingnan ; Chen, Baoquan ; Su, Hao ; Guibas, Leonidas J. ( October 2021 , 2021 IEEE/CVF International Conference on Computer Vision (ICCV))

In this work, we tackle the problem of category-level online pose tracking of objects from point cloud sequences. For the first time, we propose a unified framework that can handle 9DoF pose tracking for novel rigid object instances as well as per-part pose tracking for articulated objects from known categories. Here the 9DoF pose, comprising 6D pose and 3D size, is equivalent to a 3D amodal bounding box representation with free 6D pose. Given the depth point cloud at the current frame and the estimated pose from the last frame, our novel end-to-end pipeline learns to accurately update the pose. Our pipeline is composed of three modules: 1) a pose canonicalization module that normalizes the pose of the input depth point cloud; 2) RotationNet, a module that directly regresses small interframe delta rotations; and 3) CoordinateNet, a module that predicts the normalized coordinates and segmentation, enabling analytical computation of the 3D size and translation. Leveraging the small pose regime in the pose-canonicalized point clouds, our method integrates the best of both worlds by combining dense coordinate prediction and direct rotation regression, thus yielding an end-to-end differentiable pipeline optimized for 9DoF pose accuracy (without using non-differentiable RANSAC). Our extensive experiments demonstrate that our method achieves new state-of-the-art performance on category-level rigid object pose (NOCSREAL275 [29]) and articulated object pose benchmarks (SAPIEN [34], BMVC [18]) at the fastest FPS ∼ 12.
more » « less
Full Text Available
3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection

Wang, He ; Cong, Yezhen ; Litany, Or ; Gao, Yue ; Guibas, Leonidas J ( January 2021 , IEEE Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available
Rethinking sampling in 3D Point Cloud Generative Adversarial Networks

Wang, He ; Jiang, Zetian ; Yi, Li ; Mo, Kaichun ; Su, Hao ; Guibas, Leonidas J ( January 2021 , IEEE Conference on Computer Vision and Pattern Recognition Workshop)
null (Ed.)
Full Text Available
Joint Learning of 3D Shape Retrieval and Deformation

Uy, Mikaela Angelina ; Kim, Vladimir G ; Sung, Minhyuk ; Aigerman, Noam ; Chaudhuri, Siddhartha ; Guibas, Leonidas J ( January 2021 , IEEE Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available
MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Huang, Jiahui ; Wang, He ; Birdal, Tolga ; Sung, Minhyuk ; Arrigoni, Federica ; Hu, Shi-Min ; Guibas, Leonidas J ( January 2021 , IEEE Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available
Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor Environments

Dong, Siyan ; Fan, Qingnan ; Wang, He ; Shi, Ji ; Yi, Li ; Funkhouser, Thomas ; Chen, Baoquan ; Guibas, Leonidas J ( January 2021 , IEEE Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available
Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor Environments

Dong, Siyan ; Fan, Qingnan ; Wang, He ; Shi, Ji ; Yi, Li ; Funkhouser, Thomas ; Chen, Baoquan ; Guibas, Leonidas J ( January 2021 , IEEE Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available

« Prev Next »