NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Time of the Flight of the Gaussians: Optimizing Depth Indirectly in Dynamic Radiance Fields

Li, Runfeng; Okunev, Mikhail; Guo, Zixuan; Duong, Anh Ha; Richardt, Christian; O’Toole, Matthew; Tompkin, James (June 2025, IEEE/CVF Computer Vision and Pattern Recognition)

We present a method to reconstruct dynamic scenes from monocular continuous-wave time-of-flight (C-ToF) cameras using raw sensor samples that achieves similar or better accuracy than neural volumetric approaches and is 100×faster. Quickly achieving high-fidelity dynamic 3D reconstruction from a single viewpoint is a significant challenge in computer vision. In C-ToF radiance field reconstruction, the property of interest—depth—is not directly measured, causing an additional challenge. This problem has a large and underappreciated impact upon the optimization when using a fast primitive-based scene representation like 3D Gaussian splatting, which is commonly used with multi-view data to produce satisfactory results and is brittle in its optimization otherwise. We incorporate two heuristics into the optimization to improve the accuracy of scene geometry represented by Gaussians. Experimental results show that our approach produces accurate reconstructions under constrained C-ToF sensing conditions, including for fast motions like swinging baseball bats. https://visual.cs.brown.edu/gftorf
more » « less
Free, publicly-accessible full text available June 11, 2026
Zero-Shot Monocular Scene Flow Estimation in the Wild

Liang, Yiqing; Badki, Abhishek; Su, Hang; Tompkin, James; Gallo, Orazio (June 2025, IEEE/CVF Computer Vision and Pattern Recognition)

Large models have shown generalization across datasets for many low-level vision tasks, like depth estimation, but no such general models exist for scene flow. Even though scene flow prediction has wide potential, its practical use is limited because of the lack of generalization of current predictive models. We identify three key challenges and propose solutions for each. First, we create a method that jointly estimates geometry and motion for accurate prediction. Second, we alleviate scene flow data scarcity with a data recipe that affords us 1M annotated training samples across diverse synthetic scenes. Third, we evaluate different parameterizations for scene flow prediction and adopt a natural and effective parameterization. Our model outperforms existing methods as well as baselines built on large-scale models in terms of 3D end-point error, and shows zero-shot generalization to the casually captured videos from DAVIS and the robotic manipulation scenes from RoboTAP. Overall, our approach makes scene flow prediction more practical in-the-wild. Website: https://research.nvidia.com/labs/lpr/zero msf/
more » « less
Free, publicly-accessible full text available June 11, 2026
Local Gaussian Density Mixtures for Unstructured Lumigraph Rendering

https://doi.org/10.1145/3680528.3687659

Wu, Xiuchao; Xu, Jiamin; Wang, Chi; Peng, Yifan; Huang, Qixing; Tompkin, James; Xu, Weiwei (December 2024, ACM)

Full Text Available
Flowed Time of Flight Radiance Fields

Okunev, Mikhail; Mapeke, Marc; Attal, Benjamin; Richardt, Christian; O'Toole, Matthew; Tompkin, James (October 2024, European Conference on Computer Vision)

Full Text Available
Are Multi-view Edges Incomplete for Depth Estimation?

https://doi.org/10.1007/s11263-023-01890-y

Khan, Numair; Kim, Min H; Tompkin, James (July 2024, International Journal of Computer Vision)

Full Text Available
OmniSDF: Scene Reconstruction using Omnidirectional Signed Distance Functions and Adaptive Binoctrees

Kim, Hakyeong; Meuleman, Andreas; Jang, Hyeonjoong; Tompkin, James; Kim, Min H (June 2024, Computer Vision and Pattern Recognition)

Full Text Available
ScaNeRF: Scalable Bundle-Adjusting Neural Radiance Fields for Large-Scale Scene Rendering

https://doi.org/10.1145/3618369

Wu, Xiuchao; Xu, Jiamin; Zhang, Xin; Bao, Hujun; Huang, Qixing; Shen, Yujun; Tompkin, James; Xu, Weiwei (December 2023, ACM Transactions on Graphics)

High-quality large-scale scene rendering requires a scalable representation and accurate camera poses. This research combines tile-based hybrid neural fields with parallel distributive optimization to improve bundle-adjusting neural radiance fields. The proposed method scales with a divide-and-conquer strategy. We partition scenes into tiles, each with a multi-resolution hash feature grid and shallow chained diffuse and specular multilayer perceptrons (MLPs). Tiles unify foreground and background via a spatial contraction function that allows both distant objects in outdoor scenes and planar reflections as virtual images outside the tile. Decomposing appearance with the specular MLP allows a specular-aware warping loss to provide a second optimization path for camera poses. We apply the alternating direction method of multipliers (ADMM) to achieve consensus among camera poses while maintaining parallel tile optimization. Experimental results show that our method outperforms state-of-the-art neural scene rendering method quality by 5%--10% in PSNR, maintaining sharp distant objects and view-dependent reflections across six indoor and outdoor scenes.
more » « less
Full Text Available
Neural Fields for Structured Lighting

https://doi.org/10.1109/ICCV51070.2023.00325

Shandilya, Aarrushi; Attal, Benjamin; Richardt, Christian; Tompkin, James; O’Toole, Matthew (October 2023, IEEE)
Differentiable Appearance Acquisition from a Flash/No-flash RGB-D Pair

https://doi.org/10.1109/iccp54855.2022.9887646

Ku, Hyun Jin; Hat, Hyunho; Lee, Joo Ho; Kang, Dahyun; Tompkin, James; Kim, Min H. (August 2022, International Conference on Computational Photography)

Reconstructing 3D objects in natural environments requires solving the ill-posed problem of geometry, spatially-varying material, and lighting estimation. As such, many approaches impractically constrain to a dark environment, use controlled lighting rigs, or use few handheld captures but suffer reduced quality. We develop a method that uses just two smartphone exposures captured in ambient lighting to reconstruct appearance more accurately and practically than baseline methods. Our insight is that we can use a flash/no-flash RGB-D pair to pose an inverse rendering problem using point lighting. This allows efficient differentiable rendering to optimize depth and normals from a good initialization and so also the simultaneous optimization of diffuse environment illumination and SVBRDF material. We find that this reduces diffuse albedo error by 25%, specular error by 46%, and normal error by 30% against single and paired-image baselines that use learning-based techniques. Given that our approach is practical for everyday solid objects, we enable photorealistic relighting for mobile photography and easier content creation for augmented reality.
more » « less
Full Text Available

Search for: All records