NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A manifold two-sample test study: integral probability metric with neural networks

https://doi.org/10.1093/imaiai/iaad018

Wang, Jie; Chen, Minshuo; Zhao, Tuo; Liao, Wenjing; Xie, Yao (June 2023, Information and Inference: A Journal of the IMA)

Abstract Two-sample tests are important areas aiming to determine whether two collections of observations follow the same distribution or not. We propose two-sample tests based on integral probability metric (IPM) for high-dimensional samples supported on a low-dimensional manifold. We characterize the properties of proposed tests with respect to the number of samples $$n$$ and the structure of the manifold with intrinsic dimension $$d$$. When an atlas is given, we propose a two-step test to identify the difference between general distributions, which achieves the type-II risk in the order of $$n^{-1/\max \{d,2\}}$$. When an atlas is not given, we propose Hölder IPM test that applies for data distributions with $$(s,\beta )$$-Hölder densities, which achieves the type-II risk in the order of $$n^{-(s+\beta )/d}$$. To mitigate the heavy computation burden of evaluating the Hölder IPM, we approximate the Hölder function class using neural networks. Based on the approximation theory of neural networks, we show that the neural network IPM test has the type-II risk in the order of $$n^{-(s+\beta )/d}$$, which is in the same order of the type-II risk as the Hölder IPM test. Our proposed tests are adaptive to low-dimensional geometric structure because their performance crucially depends on the intrinsic dimension instead of the data dimension.
more » « less
Generalization error guaranteed auto-encoder-based nonlinear model reduction for operator learning

https://doi.org/10.1016/j.acha.2024.101717

Liu, Hao; Dahal, Biraj; Lai, Rongjie; Liao, Wenjing (January 2025, Applied and Computational Harmonic Analysis)
Fornasier, Massimo (Ed.)
Free, publicly-accessible full text available January 1, 2026
Deep Nonparametric Estimation of Operators between Infinite Dimensional Spaces

Liu, Hao; Yang, Haizhao; Chen, Minshuo; Zhao, Tuo; Liao, Wenjing (January 2024, Journal of machine learning research)

Learning operators between infinitely dimensional spaces is an important learning task arising in machine learning, imaging science, mathematical modeling and simulations, etc. This paper studies the nonparametric estimation of Lipschitz operators using deep neural networks. Non-asymptotic upper bounds are derived for the generalization error of the empirical risk minimizer over a properly chosen network class. Under the assumption that the target operator exhibits a low dimensional structure, our error bounds decay as the training sample size increases, with an attractive fast rate depending on the intrinsic dimension in our estimation. Our assumptions cover most scenarios in real applications and our results give rise to fast rates by exploiting low dimensional structures of data in operator estimation. We also investigate the influence of network structures (e.g., network width, depth, and sparsity) on the generalization error of the neural network estimator and propose a general suggestion on the choice of network structures to maximize the learning efficiency quantitatively.
more » « less
Full Text Available
Deep Nonparametric Estimation of Operators between Infinite Dimensional Spaces

Liu, Hao; Yang, Haizhao; Chen, Minshuo; Zhao, Tuo; Liao, Wenjing (January 2024, Journal of Machine Learning Research)
Raginsky, Maxim (Ed.)
Learning operators between infinitely dimensional spaces is an important learning task arising in machine learning, imaging science, mathematical modeling and simulations, etc. This paper studies the nonparametric estimation of Lipschitz operators using deep neural networks. Non-asymptotic upper bounds are derived for the generalization error of the empirical risk minimizer over a properly chosen network class. Under the assumption that the target operator exhibits a low dimensional structure, our error bounds decay as the training sample size increases, with an attractive fast rate depending on the intrinsic dimension in our estimation. Our assumptions cover most scenarios in real applications and our results give rise to fast rates by exploiting low dimensional structures of data in operator estimation. We also investigate the influence of network structures (e.g., network width, depth, and sparsity) on the generalization error of the neural network estimator and propose a general suggestion on the choice of network structures to maximize the learning efficiency quantitatively.
more » « less
Full Text Available
Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data

Chen, Minshuo; Huang, Kaixuan; Zhao, Tuo; Wang, Mengdi (July 2023, Proceedings of Machine Learning Research)

Diffusion models achieve state-of-the-art performance in various generation tasks. However, their theoretical foundations fall far behind. This paper studies score approximation, estimation, and distribution recovery of diffusion models, when data are supported on an unknown low-dimensional linear subspace. Our result provides sample complexity bounds for distribution estimation using diffusion models. We show that with a properly chosen neural network architecture, the score function can be both accurately approximated and efficiently estimated. Further, the generated distribution based on the estimated score function captures the data geometric structures and converges to a close vicinity of the data distribution. The convergence rate depends on subspace dimension, implying that diffusion models can circumvent the curse of data ambient dimensionality.
more » « less
Full Text Available
Sample complexity of nonparametric off-policy evaluation on low-dimensional manifolds using deep networks

Ji, Xiang; Chen, Minshuo; Wang, Mengdi; Zhao, Tuo (May 2023, International Conference on Learning Representations (ICLR))

Full Text Available
On Deep Generative Models for Approximation and Estimation of Distributions on Manifolds

Dahal, Biraj; Havrilla, Alexander; Chen, Minshuo; Zhao, Tuo; Liao, Wenjing (December 2022, Advances in Neural Information Processing Systems)

Deep generative models have experienced great empirical successes in distribution learning. Many existing experiments have demonstrated that deep generative networks can efficiently generate high-dimensional complex data from a low-dimensional easy-to-sample distribution. However, this phenomenon can not be justified by existing theories. The widely held manifold hypothesis speculates that real-world data sets, such as natural images and signals, exhibit low-dimensional geometric structures. In this paper, we take such low-dimensional data structures into consideration by assuming that data distributions are supported on a low-dimensional manifold. We prove approximation and estimation theories of deep generative networks for estimating distributions on a low-dimensional manifold under the Wasserstein-1 loss. We show that the Wasserstein-1 loss converges to zero at a fast rate depending on the intrinsic dimension instead of the ambient data dimension. Our theory leverages the low-dimensional geometric structures in data sets and justifies the practical power of deep generative models. We require no smoothness assumptions on the data distribution which is desirable in practice.
more » « less
Full Text Available
Effective Minkowski dimension of deep nonparametric regression: function approximation and statistical theories

Zhang, Zixuan; Chen, Minshuo; Wang, Mengdi; Liao, Wenjing; Zhao, Tuo (July 2022, International Conference on Machine Learning)

Full Text Available
Nonparametric regression on low-dimensional manifolds using deep ReLU networks

https://doi.org/doi.org/10.1093/imaiai/iaac001

Chen, Minshuo; Jiang, Haomin; Liao, Wenjing; Zhao, Tuo (March 2022, Information and inference)

Full Text Available
Multiscale regression on unknown manifolds

https://doi.org/10.3934/mine.2022028

Liao, Wenjing; Maggioni, Mauro; Vigogna, Stefano (January 2022, Mathematics in Engineering)
null (Ed.)
We consider the regression problem of estimating functions on $$ \mathbb{R}^D $$ but supported on a $ d $-dimensional manifold $$ \mathcal{M} ~~\subset \mathbb{R}^D $$ with $$ d \ll D $$. Drawing ideas from multi-resolution analysis and nonlinear approximation, we construct low-dimensional coordinates on $$ \mathcal{M} $$ at multiple scales, and perform multiscale regression by local polynomial fitting. We propose a data-driven wavelet thresholding scheme that automatically adapts to the unknown regularity of the function, allowing for efficient estimation of functions exhibiting nonuniform regularity at different locations and scales. We analyze the generalization error of our method by proving finite sample bounds in high probability on rich classes of priors. Our estimator attains optimal learning rates (up to logarithmic factors) as if the function was defined on a known Euclidean domain of dimension $ d $, instead of an unknown manifold embedded in $$ \mathbb{R}^D $$. The implemented algorithm has quasilinear complexity in the sample size, with constants linear in $ D $ and exponential in $ d $. Our work therefore establishes a new framework for regression on low-dimensional sets embedded in high dimensions, with fast implementation and strong theoretical guarantees.
more » « less
Full Text Available

« Prev Next »

Search for: All records