RGB2Point: 3D Point Cloud Generation from Single RGB Images

Lee, Jae Joong; Benes, Bedrich

doi:10.1109/WACV61041.2025.00292

Citation Details

This content will become publicly available on February 26, 2026

RGB2Point: 3D Point Cloud Generation from Single RGB Images

We introduce RGB2Point, an unposed single-view RGB image to a 3D point cloud generation based on Transformer. RGB2Point takes an input image of an object and generates a dense 3D point cloud. Contrary to prior works based on CNN layers and diffusion-denoising approaches, we use pre-trained Transformer layers that are fast and generate high-quality point clouds with consistent quality over available categories. Our generated point clouds demonstrate high quality on a real-world dataset, as evidenced by improved Chamfer distance (51.15%) and Earth Mover’s distance (36.17%) metrics compared to the current state-of the-art. Additionally, our approach shows a better quality on a synthetic dataset, achieving better Chamfer distance (39.26%), Earth Mover’s distance (26.95%), and F-score (47.16%). Moreover, our method produces 63.1% more consistent high-quality results across various object categories compared to prior works. Furthermore, RGB2Point is computationally efficient, requiring only 2.3GB of VRAM to reconstruct a 3D point cloud from a single RGB image, and our implementation generates the results 15,133× faster than a SOTA diffusion-based model. more »

Award ID(s):: 2417510 2412928

PAR ID:: 10587438

Author(s) / Creator(s):: Lee, Jae Joong; Benes, Bedrich

Publisher / Repository:: IEEE

Date Published:: 2025-02-26

ISBN:: 979-8-3315-1083-1

Page Range / eLocation ID:: 2952 to 2962

Format(s):: Medium: X

Location:: Tucson, AZ, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on February 26, 2026
Conference Paper:
https://doi.org/10.1109/WACV61041.2025.00292

More Like this