NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

GenAnalysis: Joint Shape Analysis by Learning Man-Made Shape Generators with Deformation Regularizations.

Yang, Yuezhi; Yang, Haitao; Nakayama, George; Huang, Xiangru; Guibas, Leonidas; Huang, Qixing (August 2025, ACM transactions on graphics)

Free, publicly-accessible full text available August 1, 2026
An Efficient Global-to-Local Rotation Optimization Approach via Spherical Harmonics

He, Zihang; Yang, Yuezhi; Deng, Congyue; Lu, Jiaxin; Guibas, Leonidas; Huang, Qixing (July 2025, Computer Graphics Forum)

Free, publicly-accessible full text available July 1, 2026
Diffusion Self-Distillation for Zero-Shot Customized Image Generation

Cai, Shengqu; Chan, Eric Ryan; Zhang, Yunzhi; Guibas, Leonidas; Wu, Jiajun; Wetzstein, Gordon (June 2025, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Free, publicly-accessible full text available June 1, 2026
Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping

Wang, Qianxu; Deng, Congyue; Lum, Tyler; Chen, Yuanpei; Yang, Yaodong; Bohg, Jeannette; Zhu, Yixin; Guibas, Leonidas (November 2024, Proceedings of Machine Learning Research)

Full Text Available
NAP: Neural 3D Articulated Object Prior

Lei, Jiahui; Deng, Congyue; Shen, Bokui; Guibas, Leonidas; Daniilidis, Kostas (September 2023, Neurips - Openreview)

Full Text Available
Banana: Banach Fixed-Point Network for Pointcloud Segmentation with Inter-Part Equivariance

Deng, Congyue; Lei, Jiahui; Shen, Bokui; Daniilidis, Kostas; Guibas, Leonidas (September 2023, Neurips - Openreview)

Full Text Available
ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation

Shen, Bokui; Jiang, Zhenyu; Choy, Christopher; Savarese, Silvio; Guibas, Leonidas J.; Anandkumar, Anima; Zhu, Yuke (June 2022, Robotics: Science and Systems XVIII)

Full Text Available
Where2Act: From Pixels to Actions for Articulated 3D Objects

https://doi.org/10.1109/ICCV48922.2021.00674

Mo, Kaichun; Guibas, Leonidas; Mukadam, Mustafa; Gupta, Abhinav; Tulsiani, Shubham (October 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV))

One of the fundamental goals of visual perception is to allow agents to meaningfully interact with their environment. In this paper, we take a step towards that long-term goal – we extract highly localized actionable information related to elementary actions such as pushing or pulling for articulated objects with movable parts. For example, given a drawer, our network predicts that applying a pulling force on the handle opens the drawer. We propose, discuss, and evaluate novel network architectures that given image and depth data, predict the set of actions possible at each pixel, and the regions over articulated parts that are likely to move under the force. We propose a learning-from-interaction framework with an online data sampling strategy that allows us to train the network in simulation (SAPIEN) and generalizes across categories. Check the website for code and data release.
more » « less
Full Text Available
HuMoR: 3D Human Motion Model for Robust Pose Estimation

https://doi.org/10.1109/ICCV48922.2021.01129

Rempe, Davis; Birdal, Tolga; Hertzmann, Aaron; Yang, Jimei; Sridhar, Srinath; Guibas, Leonidas J. (October 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV))

We introduce HuMoR: a 3D Human Motion Model for Robust Estimation of temporal pose and shape. Though substantial progress has been made in estimating 3D human motion and shape from dynamic observations, recovering plausible pose sequences in the presence of noise and occlusions remains a challenge. For this purpose, we propose an expressive generative model in the form of a conditional variational autoencoder, which learns a distribution of the change in pose at each step of a motion sequence. Furthermore, we introduce a flexible optimization-based approach that leverages HuMoR as a motion prior to robustly estimate plausible pose and shape from ambiguous observations. Through extensive evaluations, we demonstrate that our model generalizes to diverse motions and body shapes after training on a large motion capture dataset, and enables motion reconstruction from multiple input modalities including 3D keypoints and RGB(-D) videos. See the project page at geometry.stanford.edu/projects/humor.
more » « less
Full Text Available
A Functional Approach to Rotation Equivariant Non-Linearities for Tensor Field Networks

Poulenard, Adrien; Guibas, Leonidas (January 2021, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Learning pose invariant representation is a fundamental problem in shape analysis. Most existing deep learning algorithms for 3D shape analysis are not robust to rotations and are often trained on synthetic datasets consisting of pre-aligned shapes, yielding poor generalization to unseen poses. This observation motivates a growing interest in rotation invariant and equivariant methods. The field of rotation equivariant deep learning is developing in recent years thanks to a well established theory of Lie group representations and convolutions. A fundamental problem in equivariant deep learning is to design activation functions which are both informative and preserve equivariance. The recently introduced Tensor Field Network (TFN) framework provides a rotation equivariant network design for point cloud analysis. TFN features undergo a rotation in feature space given a rotation of the input pointcloud. TFN and similar designs consider nonlinearities which operate only over rotation invariant features such as the norm of equivariant features to preserve equivariance, making them unable to capture the directional information. In a recent work entitled "Gauge Equivariant Mesh CNNs: Anisotropic Convolutions on Geometric Graphs" Hann et al. interpret 2D rotation equivariant features as Fourier coefficients of functions on the circle. In this work we transpose the idea of Hann et al. to 3D by interpreting TFN features as spherical harmonics coefficients of functions on the sphere. We introduce a new equivariant nonlinearity and pooling for TFN. We show improvments over the original TFN design and other equivariant nonlinearities in classification and segmentation tasks. Furthermore our method is competitive with state of the art rotation invariant methods in some instances.
more » « less
Full Text Available

« Prev Next »

Search for: All records