NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias

Jin, Haian; Jiang, Hanwen; Tan, Hao; Zhang, Kai; Bi, Sai; Zhang, Tianyuan; Luan, Fujun; Snavely, Noah; Xu, Zexiang (April 2025, International Conference on Learning Representations (ICLR))

We propose the Large View Synthesis Model (LVSM), a novel transformer-based approach for scalable and generalizable novel view synthesis from sparse-view inputs. We introduce two architectures: (1) an encoder-decoder LVSM, which encodes input image tokens into a fixed number of 1D latent tokens, functioning as a fully learned scene representation, and decodes novel-view images from them; and (2) a decoder-only LVSM, which directly maps input images to novel-view outputs, completely eliminating intermediate scene representations. Both models bypass the 3D inductive biases used in previous methods—from 3D representations (e.g., NeRF, 3DGS) to network designs (e.g., epipolar projections, plane sweeps)—addressing novel view synthesis with a fully data-driven approach. While the encoder-decoder model offers faster inference due to its independent latent representation, the decoder-only LVSM achieves superior quality, scalability, and zero-shot generalization, outperforming previous state-of-the-art methods by 1.5 to 3.5 dB PSNR. Comprehensive evaluations across multiple datasets demonstrate that both LVSM variants achieve state-of-the-art novel view synthesis quality. Notably, our models surpass all previous methods even with reduced computational resources (1-2 GPUs).
more » « less
Free, publicly-accessible full text available April 24, 2026
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data

Jiang, Hanwen; Xu, Zexiang; Xie, Desai; Chen, Ziwen; Jin, Haian; Luan, Fujun; Shu, Zhixin; Zhang, Kai; Bi, Sai; Sun, Xin; et al (June 2025, IEEE/CVF International Conference on Computer Vision)

Free, publicly-accessible full text available June 1, 2026
Neural Gaffer: Relighting Any Object via Diffusion

Jin, Haian; Li, Yuan; Luan, Fujun; Xiangli, Yuanbo; Bi, Sai; Zhang, Kai; Xu, Zexiang; Sun, Jin; Snavely, Noah (December 2024, Conference on Neural Information Processing Systems (NeurIPS))

Free, publicly-accessible full text available December 10, 2025
Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling

Wu, Liwen; Bi, Sai; Xu, Zexiang; Luan, Fujun; Zhang, Kai; Georgiev, Ilyan; Sunkavalli, Kalyan; Ramamoorthi, Ravi (June 2024, CVPR 24)

Full Text Available
OpenIllumination: A Multi-Illumination Dataset for Inverse Rendering Evaluation on Real Objects

Liu, Isabella; Chen, Linghao; Fu, Ziyang; Wu, Liwen; Jin, Haian; Li, Zhong; Wong, Chin; Xu, Yi; Ramamoorthi, Ravi; Xu, Zexiang; et al (December 2023, NeuRIPS 2023)

Full Text Available
NeuSample: Importance Sampling for Neural Materials

https://doi.org/10.1145/3588432.3591524

Xu, Bing; Wu, Liwen; Hasan, Milos; Luan, Fujun; Georgiev, Iliyan; Xu, Zexiang; Ramamoorthi, Ravi (July 2023, ACM)

Full Text Available
Rendering Neural Materials on Curved Surfaces

https://doi.org/10.1145/3528233.3530721

Kuznetsov, Alexandr; Wang, Xuezheng; Mullia, Krishna; Luan, Fujun; Xu, Zexiang; Hasan, Milos; Ramamoorthi, Ravi (August 2022, SIGGRAPH 2022)

Neural material reflectance representations address some limitations of traditional analytic BRDFs with parameter textures; they can theoretically represent any material data, whether a complex synthetic microgeometry with displacements, shadows and interreflections, or real measured reflectance. However, they still approximate the material on an infinite plane, which prevents them from correctly handling silhouette and parallax effects for viewing directions close to grazing. The goal of this paper is to design a neural material representation capable of correctly handling such silhouette effects. We extend the neural network query to take surface curvature information as input, while the query output is extended to return a transparency value in addition to reflectance. We train the new neural representation on synthetic data that contains queries spanning a variety of surface curvatures. We show an ability to accurately represent complex silhouette behavior that would traditionally require more expensive and less flexible techniques, such as on-the-fly geometry displacement or ray marching.
more » « less
Full Text Available
Differentiable Rendering of Neural SDFs through Reparameterization

https://doi.org/10.1145/3550469.3555397

Bangaru, Sai Praveen; Gharbi, Michael; Luan, Fujun; Li, Tzu-Mao; Sunkavalli, Kalyan; Hasan, Milos; Bi, Sai; Xu, Zexiang; Bernstein, Gilbert; Durand, Fredo (November 2022, ACM transactions on graphics)
Neural Free‐Viewpoint Relighting for Glossy Indirect Illumination

https://doi.org/10.1111/cgf.14885

Raghavan, Nithin; Xiao, Yan; Lin, Kai‐En; Sun, Tiancheng; Bi, Sai; Xu, Zexiang; Li, Tzu‐Mao; Ramamoorthi, Ravi (July 2023, Computer Graphics Forum)

Abstract Precomputed Radiance Transfer (PRT) remains an attractive solution for real‐time rendering of complex light transport effects such as glossy global illumination. After precomputation, we can relight the scene with new environment maps while changing viewpoint in real‐time. However, practical PRT methods are usually limited to low‐frequency spherical harmonic lighting. All‐frequency techniques using wavelets are promising but have so far had little practical impact. The curse of dimensionality and much higher data requirements have typically limited them to relighting with fixed view or only direct lighting with triple product integrals. In this paper, we demonstrate a hybrid neural‐wavelet PRT solution to high‐frequency indirect illumination, including glossy reflection, for relighting with changing view. Specifically, we seek to represent the light transport function in the Haar wavelet basis. For global illumination, we learn the wavelet transport using a small multi‐layer perceptron (MLP) applied to a feature field as a function of spatial location and wavelet index, with reflected direction and material parameters being other MLP inputs. We optimize/learn the feature field (compactly represented by a tensor decomposition) and MLP parameters from multiple images of the scene under different lighting and viewing conditions. We demonstrate real‐time (512 x 512 at 24 FPS, 800 x 600 at 13 FPS) precomputed rendering of challenging scenes involving view‐dependent reflections and even caustics.
more » « less
Hierarchical neural reconstruction for path guiding using hybrid path and photon samples

https://doi.org/10.1145/3450626.3459810

Zhu, Shilin; Xu, Zexiang; Sun, Tiancheng; Kuznetsov, Alexandr; Meyer, Mark; Jensen, Henrik Wann; Su, Hao; Ramamoorthi, Ravi (August 2021, ACM Transactions on Graphics)

Path guiding is a promising technique to reduce the variance of path tracing. Although existing online path guiding algorithms can eventually learn good sampling distributions given a large amount of time and samples, the speed of learning becomes a major bottleneck. In this paper, we accelerate the learning of sampling distributions by training a light-weight neural network offline to reconstruct from sparse samples. Uniquely, we design our neural network to directly operate convolutions on a sparse quadtree, which regresses a high-quality hierarchical sampling distribution. Our approach can reconstruct reasonably accurate sampling distributions faster, allowing for efficient path guiding and rendering. In contrast to the recent offline neural path guiding techniques that reconstruct low-resolution 2D images for sampling, our novel hierarchical framework enables more fine-grained directional sampling with less memory usage, effectively advancing the practicality and efficiency of neural path guiding. In addition, we take advantage of hybrid bidirectional samples including both path samples and photons, as we have found this more robust to different light transport scenarios compared to using only one type of sample as in previous work. Experiments on diverse testing scenes demonstrate that our approach often improves rendering results with better visual quality and lower errors. Our framework can also provide the proper balance of speed, memory cost, and robustness.
more » « less
Full Text Available

« Prev Next »

Search for: All records