NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

https://doi.org/10.1109/ICCV48922.2021.01296

Weng, Yijia; Wang, He; Zhou, Qiang; Qin, Yuzhe; Duan, Yueqi; Fan, Qingnan; Chen, Baoquan; Su, Hao; Guibas, Leonidas J. (October 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV))

In this work, we tackle the problem of category-level online pose tracking of objects from point cloud sequences. For the first time, we propose a unified framework that can handle 9DoF pose tracking for novel rigid object instances as well as per-part pose tracking for articulated objects from known categories. Here the 9DoF pose, comprising 6D pose and 3D size, is equivalent to a 3D amodal bounding box representation with free 6D pose. Given the depth point cloud at the current frame and the estimated pose from the last frame, our novel end-to-end pipeline learns to accurately update the pose. Our pipeline is composed of three modules: 1) a pose canonicalization module that normalizes the pose of the input depth point cloud; 2) RotationNet, a module that directly regresses small interframe delta rotations; and 3) CoordinateNet, a module that predicts the normalized coordinates and segmentation, enabling analytical computation of the 3D size and translation. Leveraging the small pose regime in the pose-canonicalized point clouds, our method integrates the best of both worlds by combining dense coordinate prediction and direct rotation regression, thus yielding an end-to-end differentiable pipeline optimized for 9DoF pose accuracy (without using non-differentiable RANSAC). Our extensive experiments demonstrate that our method achieves new state-of-the-art performance on category-level rigid object pose (NOCSREAL275 [29]) and articulated object pose benchmarks (SAPIEN [34], BMVC [18]) at the fastest FPS ∼ 12.
more » « less
Full Text Available
Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor Environments

Dong, Siyan; Fan, Qingnan; Wang, He; Shi, Ji; Yi, Li; Funkhouser, Thomas; Chen, Baoquan; Guibas, Leonidas (January 2021, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Localizing the camera in a known indoor environment is a key building block for scene mapping, robot navigation, AR, etc. Recent advances estimate the camera pose via optimization over the 2D/3D-3D correspondences established between the coordinates in 2D/3D camera space and 3D world space. Such a mapping is estimated with either a convolution neural network or a decision tree using only the static input image sequence, which makes these approaches vulnerable to dynamic indoor environments that are quite common yet challenging in the real world. To address the aforementioned issues, in this paper, we propose a novel outlier-aware neural tree which bridges the two worlds, deep learning and decision tree approaches. It builds on three important blocks: (a) a hierarchical space partition over the indoor scene to construct the decision tree; (b) a neural routing function, implemented as a deep classification network, employed for better 3D scene understanding; and (c) an outlier rejection module used to filter out dynamic points during the hierarchical routing process. Our proposed algorithm is evaluated on the RIO-10 benchmark developed for camera relocalization in dynamic indoor environments. It achieves robust neural routing through space partitions and outperforms the state-of-the-art approaches by around 30% on camera pose accuracy, while running comparably fast for evaluation.
more » « less
Full Text Available
Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor Environments

Dong, Siyan; Fan, Qingnan; Wang, He; Shi, Ji; Yi, Li; Funkhouser, Thomas; Chen, Baoquan; Guibas, Leonidas J (January 2021, IEEE Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available
Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor Environments

Dong, Siyan; Fan, Qingnan; Wang, He; Shi, Ji; Yi, Li; Funkhouser, Thomas; Chen, Baoquan; Guibas, Leonidas J (January 2021, IEEE Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available

Search for: All records