NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Extended fiducial inference for individual treatment effects via deep neural networks

https://doi.org/10.1007/s11222-025-10624-8

Kim, Sehwan; Liang, Faming (May 2025, Statistics and Computing)

Abstract Individual treatment effect estimation has gained significant attention in recent data science literature. This work introduces the Double Neural Network (Double-NN) method to address this problem within the framework of extended fiducial inference (EFI). In the proposed method, deep neural networks are used to model the treatment and control effect functions, while an additional neural network is employed to estimate their parameters. The universal approximation capability of deep neural networks ensures the broad applicability of this method. Numerical results highlight the superior performance of the proposed Double-NN method compared to the conformal quantile regression (CQR) method in individual treatment effect estimation. From the perspective of statistical inference, this work advances the theory and methodology for statistical inference of large models. Specifically, it is theoretically proven that the proposed method permits the model size to increase with the sample sizenat a rate of$$O(n^{\zeta })$$ $O (n^{ζ})$ for some$$0 \le \zeta <1$$ $0 \leq ζ < 1$ , while still maintaining proper quantification of uncertainty in the model parameters. This result marks a significant improvement compared to the range$$0\le \zeta < \frac{1}{2}$$ $0 \leq ζ < \frac{1}{2}$ required by the classical central limit theorem. Furthermore, this work provides a rigorous framework for quantifying the uncertainty of deep neural networks under the neural scaling law, representing a substantial contribution to the statistical understanding of large-scale neural network models.
more » « less
A New Paradigm for Generative Adversarial Networks Based on Randomized Decision Rules

https://doi.org/10.5705/ss.202022.0404

Kim, Sehwan; Song, Qifan; Liang, Faming (January 2025, Statistica Sinica)

Full Text Available
Deep network embedding with dimension selection

https://doi.org/10.1016/j.neunet.2024.106512

Dong, Tianning; Sun, Yan; Liang, Faming (November 2024, Neural Networks)

Full Text Available
Extended fiducial inference: toward an automated process of statistical inference

https://doi.org/10.1093/jrsssb/qkae082

Liang, Faming; Kim, Sehwan; Sun, Yan (August 2024, Journal of the Royal Statistical Society Series B: Statistical Methodology)

Abstract While fiducial inference was widely considered a big blunder by R.A. Fisher, the goal he initially set—‘inferring the uncertainty of model parameters on the basis of observations’—has been continually pursued by many statisticians. To this end, we develop a new statistical inference method called extended Fiducial inference (EFI). The new method achieves the goal of fiducial inference by leveraging advanced statistical computing techniques while remaining scalable for big data. Extended Fiducial inference involves jointly imputing random errors realized in observations using stochastic gradient Markov chain Monte Carlo and estimating the inverse function using a sparse deep neural network (DNN). The consistency of the sparse DNN estimator ensures that the uncertainty embedded in observations is properly propagated to model parameters through the estimated inverse function, thereby validating downstream statistical inference. Compared to frequentist and Bayesian methods, EFI offers significant advantages in parameter estimation and hypothesis testing. Specifically, EFI provides higher fidelity in parameter estimation, especially when outliers are present in the observations; and eliminates the need for theoretical reference distributions in hypothesis testing, thereby automating the statistical inference process. Extended Fiducial inference also provides an innovative framework for semisupervised learning.
more » « less
Time‐varying dynamic Bayesian network learning for an fMRI study of emotion processing

https://doi.org/10.1002/sim.10096

Sun, Lizhe; Zhang, Aiying; Liang, Faming (June 2024, Statistics in Medicine)

This article presents a novel method for learning time‐varying dynamic Bayesian networks. The proposed method breaks down the dynamic Bayesian network learning problem into a sequence of regression inference problems and tackles each problem using the Markov neighborhood regression technique. Notably, the method demonstrates scalability concerning data dimensionality, accommodates time‐varying network structure, and naturally handles multi‐subject data. The proposed method exhibits consistency and offers superior performance compared to existing methods in terms of estimation accuracy and computational efficiency, as supported by extensive numerical experiments. To showcase its effectiveness, we apply the proposed method to an fMRI study investigating the effective connectivity among various regions of interest (ROIs) during an emotion‐processing task. Our findings reveal the pivotal role of the subcortical‐cerebellum in emotion processing.
more » « less
Full Text Available
Fast Value Tracking for Deep Reinforcement Learning

Shih, Frank; Liang, Faming (May 2024, The Twelfth International Conference on Learning Representations)

Reinforcement learning (RL) tackles sequential decision-making problems by creating agents that interacts with their environment. However, existing algorithms often view these problem as static, focusing on point estimates for model parameters to maximize expected rewards, neglecting the stochastic dynamics of agent-environment interactions and the critical role of uncertainty quantification. Our research leverages the Kalman filtering paradigm to introduce a novel and scalable sampling algorithm called Langevinized Kalman Temporal-Difference (LKTD) for deep reinforcement learning. This algorithm, grounded in Stochastic Gradient Markov Chain Monte Carlo (SGMCMC), efficiently draws samples from the posterior distribution of deep neural network parameters. Under mild conditions, we prove that the posterior samples generated by the LKTD algorithm converge to a stationary distribution. This convergence not only enables us to quantify uncertainties associated with the value function and model parameters but also allows us to monitor these uncertainties during policy updates throughout the training phase. The LKTD algorithm paves the way for more robust and adaptable reinforcement learning approaches.
more » « less
Causal-StoNet: Causal Inference for High-Dimensional Complex Data

Fang, Yaxin; Liang, Faming (May 2024, The Twelfth International Conference on Learning Representations (ICLR 2024))

With the advancement of data science, the collection of increasingly complex datasets has become commonplace. In such datasets, the data dimension can be extremely high, and the underlying data generation process can be unknown and highly nonlinear. As a result, the task of making causal inference with high-dimensional complex data has become a fundamental problem in many disciplines, such as medicine, econometrics, and social science. However, the existing methods for causal inference are frequently developed under the assumption that the data dimension is low or that the underlying data generation process is linear or approximately linear. To address these challenges, this paper proposes a novel causal inference approach for dealing with high-dimensional complex data. The proposed approach is based on deep learning techniques, including sparse deep learning theory and stochastic neural networks, that have been developed in recent literature. By using these techniques, the proposed approach can address both the high dimensionality and unknown data generation process in a coherent way. Furthermore, the proposed approach can also be used when missing values are present in the datasets. Extensive numerical studies indicate that the proposed approach outperforms existing ones.
more » « less
Sparse Deep Learning for Time Series Data: Theory and Applications

Zhang, Mingxuan; Sun, Yan; Liang, Faming (March 2024, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023)
A double regression method for graphical modeling of high-dimensional nonlinear and non-Gaussian data

https://doi.org/10.4310/22-sii756

Liang, Siqi; Liang, Faming (January 2024, Statistics and Its Interface)

Full Text Available
Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior

https://doi.org/10.6339/24-JDS1156

Zhang, Mingxuan; Sun, Yan; Liang, Faming (January 2024, Journal of Data Science)

Large pretrained transformer models have revolutionized modern AI applications with their state-of-the-art performance in natural language processing (NLP). However, their substantial parameter count poses challenges for real-world deployment. To address this, researchers often reduce model size by pruning parameters based on their magnitude or sensitivity. Previous research has demonstrated the limitations of magnitude pruning, especially in the context of transfer learning for modern NLP tasks. In this paper, we introduce a new magnitude-based pruning algorithm called mixture Gaussian prior pruning (MGPP), which employs a mixture Gaussian prior for regularization. MGPP prunes non-expressive weights under the guidance of the mixture Gaussian prior, aiming to retain the model’s expressive capability. Extensive evaluations across various NLP tasks, including natural language understanding, question answering, and natural language generation, demonstrate the superiority of MGPP over existing pruning methods, particularly in high sparsity settings. Additionally, we provide a theoretical justification for the consistency of the sparse transformer, shedding light on the effectiveness of the proposed pruning method.
more » « less
Full Text Available

« Prev Next »

Search for: All records