NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Causal inference with invalid instruments: post-selection problems and a solution using searching and sampling

https://doi.org/10.1093/jrsssb/qkad049

Guo, Zijian (May 2023, Journal of the Royal Statistical Society Series B: Statistical Methodology)

Abstract Instrumental variable methods are among the most commonly used causal inference approaches to deal with unmeasured confounders in observational studies. The presence of invalid instruments is the primary concern for practical applications, and a fast-growing area of research is inference for the causal effect with possibly invalid instruments. This paper illustrates that the existing confidence intervals may undercover when the valid and invalid instruments are hard to separate in a data-dependent way. To address this, we construct uniformly valid confidence intervals that are robust to the mistakes in separating valid and invalid instruments. We propose to search for a range of treatment effect values that lead to sufficiently many valid instruments. We further devise a novel sampling method, which, together with searching, leads to a more precise confidence interval. Our proposed searching and sampling confidence intervals are uniformly valid and achieve the parametric length under the finite-sample majority and plurality rules. We apply our proposal to examine the effect of education on earnings. The proposed method is implemented in the R package RobustIV available from CRAN.
more » « less
Full Text Available
Leveraging the Fisher Randomization Test using Confidence Distributions: Inference, Combination and Fusion Learning

https://doi.org/10.1111/rssb.12429

Luo, Xiaokang; Dasgupta, Tirthankar; Xie, Minge; Liu, Regina Y. (August 2021, Journal of the Royal Statistical Society Series B: Statistical Methodology)

Abstract The flexibility and wide applicability of the Fisher randomization test (FRT) make it an attractive tool for assessment of causal effects of interventions from modern-day randomized experiments that are increasing in size and complexity. This paper provides a theoretical inferential framework for FRT by establishing its connection with confidence distributions. Such a connection leads to development’s of (i) an unambiguous procedure for inversion of FRTs to generate confidence intervals with guaranteed coverage, (ii) new insights on the effect of size of the Monte Carlo sample on the estimation of a p-value curve and (iii) generic and specific methods to combine FRTs from multiple independent experiments with theoretical guarantees. Our developments pertain to finite sample settings but have direct extensions to large samples. Simulations and a case example demonstrate the benefit of these new developments.
more » « less
A bias correction method in meta‐analysis of randomized clinical trials with no adjustments for zero‐inflated outcomes

https://doi.org/10.1002/sim.9161

Zhou, Zhengyang; Xie, Minge; Huh, David; Mun, Eun‐Young (September 2021, Statistics in Medicine)

Summary Many clinical endpoint measures, such as the number of standard drinks consumed per week or the number of days that patients stayed in the hospital, are count data with excessive zeros. However, the zero‐inflated nature of such outcomes is sometimes ignored in analyses of clinical trials. This leads to biased estimates of study‐level intervention effect and, consequently, a biased estimate of the overall intervention effect in a meta‐analysis. The current study proposes a novel statistical approach, the Zero‐inflation Bias Correction (ZIBC) method, that can account for the bias introduced when using the Poisson regression model, despite a high rate of inflated zeros in the outcome distribution of a randomized clinical trial. This correction method only requires summary information from individual studies to correct intervention effect estimates as if they were appropriately estimated using the zero‐inflated Poisson regression model, thus it is attractive for meta‐analysis when individual participant‐level data are not available in some studies. Simulation studies and real data analyses showed that the ZIBC method performed well in correcting zero‐inflation bias in most situations.
more » « less
Multivariate survival analysis in big data: A divide‐and‐combine approach

https://doi.org/10.1111/biom.13469

Wang, Wei; Lu, Shou‐En; Cheng, Jerry Q.; Xie, Minge; Kostis, John B. (April 2021, Biometrics)

Abstract Multivariate failure time data are frequently analyzed using the marginal proportional hazards models and the frailty models. When the sample size is extraordinarily large, using either approach could face computational challenges. In this paper, we focus on the marginal model approach and propose a divide‐and‐combine method to analyze large‐scale multivariate failure time data. Our method is motivated by the Myocardial Infarction Data Acquisition System (MIDAS), a New Jersey statewide database that includes 73,725,160 admissions to nonfederal hospitals and emergency rooms (ERs) from 1995 to 2017. We propose to randomly divide the full data into multiple subsets and propose a weighted method to combine these estimators obtained from individual subsets using three weights. Under mild conditions, we show that the combined estimator is asymptotically equivalent to the estimator obtained from the full data as if the data were analyzed all at once. In addition, to screen out risk factors with weak signals, we propose to perform the regularized estimation on the combined estimator using its combined confidence distribution. Theoretical properties, such as consistency, oracle properties, and asymptotic equivalence between the divide‐and‐combine approach and the full data approach are studied. Performance of the proposed method is investigated using simulation studies. Our method is applied to the MIDAS data to identify risk factors related to multivariate cardiovascular‐related health outcomes.
more » « less
A Discussion of “A Note on Universal Inference” by Tse and Davison

https://doi.org/10.1002/sta4.571

Xie, Minge (December 2023, Stat)

Full Text Available
Approximate Confidence Distribution Computing

https://doi.org/10.51387/23-NEJSDS38

Thornton, Suzanne; Li, Wentao; Xie, Minge (January 2023, The New England Journal of Statistics in Data Science)

Approximate confidence distribution computing (ACDC) offers a new take on the rapidly developing field of likelihood-free inference from within a frequentist framework. The appeal of this computational method for statistical inference hinges upon the concept of a confidence distribution, a special type of estimator which is defined with respect to the repeated sampling principle. An ACDC method provides frequentist validation for computational inference in problems with unknown or intractable likelihoods. The main theoretical contribution of this work is the identification of a matching condition necessary for frequentist validity of inference from this method. In addition to providing an example of how a modern understanding of confidence distribution theory can be used to connect Bayesian and frequentist inferential paradigms, we present a case to expand the current scope of so-called approximate Bayesian inference to include non-Bayesian inference by targeting a confidence distribution rather than a posterior. The main practical contribution of this work is the development of a data-driven approach to drive ACDC in both Bayesian or frequentist contexts. The ACDC algorithm is data-driven by the selection of a data-dependent proposal function, the structure of which is quite general and adaptable to many settings. We explore three numerical examples that both verify the theoretical arguments in the development of ACDC and suggest instances in which ACDC outperform approximate Bayesian computing methods computationally.
more » « less
Full Text Available
Doubly debiased lasso: High-dimensional inference under hidden confounding

https://doi.org/10.1214/21-AOS2152

Guo, Zijian; Ćevid, Domagoj; Bühlmann, Peter (June 2022, The Annals of Statistics)

Full Text Available
Clustering of Trajectories using Non-Parametric Conformal DBSCAN Algorithm

https://doi.org/10.1109/IPSN54338.2022.00043

Wang, Haotian; Gao, Jie; Xie, Minge (May 2022, 2022 21st ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN))

Full Text Available
Homeostasis phenomenon in conformal prediction and predictive distribution functions

https://doi.org/10.1016/j.ijar.2021.09.001

Xie, Min-ge; Zheng, Zheshi (February 2022, International Journal of Approximate Reasoning)

Full Text Available
Individualized Group Learning

https://doi.org/10.1080/01621459.2021.1947306

Cai, Chencheng; Chen, Rong; Xie, Min-ge (August 2021, Journal of the American Statistical Association)

Full Text Available

« Prev Next »

Search for: All records