NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A minimum Wasserstein distance approach to Fisher's combination of independent, discrete p ‐values

https://doi.org/10.1111/sjos.12787

Contador, Gonzalo; Wu, Zheyang (April 2025, Scandinavian Journal of Statistics)

ABSTRACT This article introduces a comprehensive framework to adjust a discrete test statistic for improving its hypothesis testing procedure. The adjustment minimizes the Wasserstein distance to a null‐approximating continuous distribution, tackling some fundamental challenges inherent in combining statistical significances derived from discrete distributions. The related theory justifies Lancaster's mid‐p and mean‐value chi‐squared statistics for Fisher's combination as special cases. To counter the conservative nature of Lancaster's testing procedures, we propose an updated null‐approximating distribution. It is achieved by further minimizing the Wasserstein distance to the adjusted statistics within an appropriate distribution family. Specifically, in the context of Fisher's combination, we propose an optimal gamma distribution as a substitute for the traditionally used chi‐squared distribution. This new approach yields an asymptotically consistent test that significantly improves Type I error control and enhances statistical power.
more » « less
Signal-noise ratio of genetic associations and statistical power of SNP-set tests

https://doi.org/10.1214/22-AOAS1725

Zhang, Hong; Liu, Ming; Jin, Jiashun; Wu, Zheyang (September 2023, The Annals of Applied Statistics)

Full Text Available
Efficient Generation of Pretraining Samples for Developing a Deep Learning Brain Injury Model via Transfer Learning

https://doi.org/10.1007/s10439-023-03354-3

Lin, Nan; Wu, Shaoju; Wu, Zheyang; Ji, Songbai (August 2023, Annals of Biomedical Engineering)

Full Text Available
Signal-noise ratio of genetic associations and statistical power of SNP-set tests

Zhang, Hong; Liu, Ming; Jin, Jiashun; Wu, Zheyang (January 2023, Annals of applied statistics)

The SNP-set analysis is a powerful tool for dissecting the genetics of complex human diseases. There are three fundamental genetic association approaches to SNR-set analysis: the marginal model fitting approach, the joint model fitting approach, and the decorrelation approach. A problem of primary interest is how these approaches compare with each other. To address this problem, we develop a theoretical platform to compare the signal-to-noise ratio (SNR) of these approaches under the generalized linear model. We elaborate on how causal genetic effects give rise to statistically detectable association signals, and show that when causal effects spread over blocks of strong linkage disequilibrium (LD), the SNR of the marginal model fitting is usually higher than that of the decorrelation approach, which in turn is higher than that of the unbiased joint model fitting approach. We also scrutinize dense effects and LDs by a bivariate model and extensive simulations using the 1000 Genome Project data. Last, we compare the statistical power of two generic types of SNP-set tests (summation-based and supremum-based) by simulations and an osteoporosis study using large data from UK Biobank. Our results help develop powerful tools for SNP-set analysis and understand the signal detection problem in the presence of colored noise.
more » « less
Full Text Available
Approximating subject-specific brain injury models via scaling based on head–brain morphological relationships

https://doi.org/10.1007/s10237-022-01638-6

Wu, Shaoju; Zhao, Wei; Wu, Zheyang; McAllister, Thomas; Hu, Jingwen; Ji, Songbai (February 2023, Biomechanics and Modeling in Mechanobiology)

Full Text Available
Simultaneous detection of novel genes and SNPs by adaptive p-value combination

https://doi.org/10.3389/fgene.2022.1009428

Chen, Xiaohui; Zhang, Hong; Liu, Ming; Deng, Hong-Wen; Wu, Zheyang (November 2022, Frontiers in Genetics)

Combining SNP p -values from GWAS summary data is a promising strategy for detecting novel genetic factors. Existing statistical methods for the p -value-based SNP-set testing confront two challenges. First, the statistical power of different methods depends on unknown patterns of genetic effects that could drastically vary over different SNP sets. Second, they do not identify which SNPs primarily contribute to the global association of the whole set. We propose a new signal-adaptive analysis pipeline to address these challenges using the omnibus thresholding Fisher’s method (oTFisher). The oTFisher remains robustly powerful over various patterns of genetic effects. Its adaptive thresholding can be applied to estimate important SNPs contributing to the overall significance of the given SNP set. We develop efficient calculation algorithms to control the type I error rate, which accounts for the linkage disequilibrium among SNPs. Extensive simulations show that the oTFisher has robustly high power and provides a higher balanced accuracy in screening SNPs than the traditional Bonferroni and FDR procedures. We applied the oTFisher to study the genetic association of genes and haplotype blocks of the bone density-related traits using the summary data of the Genetic Factors for Osteoporosis Consortium. The oTFisher identified more novel and literature-reported genetic factors than existing p -value combination methods. Relevant computation has been implemented into the R package TFisher to support similar data analysis.
more » « less
Full Text Available
The General Goodness-of-fit Tests for Correlated Data

https://doi.org/https://doi.org/10.1016/j.csda.2021.107379

Zhang, Hong; Wu, Zheyang (April 2022, Computational statistics data analysis)

Full Text Available
A Fast and Accurate Approximation to the Distributions of Quadratic Forms of Gaussian Variables

https://doi.org/10.1080/10618600.2021.2000423

Zhang, Hong; Shen, Judong; Wu, Zheyang (January 2022, Journal of Computational and Graphical Statistics)

Full Text Available
The Generalized Fisher's Combination and Accurate P -Value Calculation under Dependence

https://doi.org/10.1111/biom.13634

Zhang, Hong; Wu, Zheyang (February 2022, Biometrics)

Abstract Combining dependent tests of significance has broad applications but the related p-value calculation is challenging. For Fisher's combination test, current p-value calculation methods (eg, Brown's approximation) tend to inflate the type I error rate when the desired significance level is substantially less than 0.05. The problem could lead to significant false discoveries in big data analyses. This paper provides two main contributions. First, it presents a general family of Fisher type statistics, referred to as the GFisher, which covers many classic statistics, such as Fisher's combination, Good's statistic, Lancaster's statistic, weighted Z-score combination, and so forth. The GFisher allows a flexible weighting scheme, as well as an omnibus procedure that automatically adapts proper weights and the statistic-defining parameters to a given data. Second, the paper presents several new p-value calculation methods based on two novel ideas: moment-ratio matching and joint-distribution surrogating. Systematic simulations show that the new calculation methods are more accurate under multivariate Gaussian, and more robust under the generalized linear model and the multivariate t-distribution. The applications of the GFisher and the new p-value calculation methods are demonstrated by a gene-based single nucleotide polymorphism (SNP)-set association study. Relevant computation has been implemented to an R package GFisher available on the Comprehensive R Archive Network.
more » « less
Distributions and Power of Optimal Signal-Detection Statistics in Finite Case

https://doi.org/10.1109/TSP.2020.2967179

Zhang, Hong; Jin, Jiashun; Wu, Zheyang (January 2020, IEEE Transactions on Signal Processing)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records