NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A data-adaptive Bayesian regression approach for polygenic risk prediction

https://doi.org/10.1093/bioinformatics/btac024

Song, Shuang; Hou, Lin; Liu, Jun S.; Schwartz, ed., Russell (January 2022, Bioinformatics)

Abstract MotivationPolygenic risk score (PRS) has been widely exploited for genetic risk prediction due to its accuracy and conceptual simplicity. We introduce a unified Bayesian regression framework, NeuPred, for PRS construction, which accommodates varying genetic architectures and improves overall prediction accuracy for complex diseases by allowing for a wide class of prior choices. To take full advantage of the framework, we propose a summary-statistics-based cross-validation strategy to automatically select suitable chromosome-level priors, which demonstrates a striking variability of the prior preference of each chromosome, for the same complex disease, and further significantly improves the prediction accuracy. ResultsSimulation studies and real data applications with seven disease datasets from the Wellcome Trust Case Control Consortium cohort and eight groups of large-scale genome-wide association studies demonstrate that NeuPred achieves substantial and consistent improvements in terms of predictive r2 over existing methods. In addition, NeuPred has similar or advantageous computational efficiency compared with the state-of-the-art Bayesian methods. Availability and implementationThe R package implementing NeuPred is available at https://github.com/shuangsong0110/NeuPred. Supplementary informationSupplementary data are available at Bioinformatics online.
more » « less
PhyloAcc-GT: A Bayesian Method for Inferring Patterns of Substitution Rate Shifts on Targeted Lineages Accounting for Gene Tree Discordance

https://doi.org/10.1093/molbev/msad195

Yan, Han; Hu, Zhirui; Thomas, Gregg W; Edwards, Scott V; Sackton, Timothy B; Liu, Jun S (September 2023, Molecular Biology and Evolution)
Nielsen, Rasmus (Ed.)
Abstract An important goal of evolutionary genomics is to identify genomic regions whose substitution rates differ among lineages. For example, genomic regions experiencing accelerated molecular evolution in some lineages may provide insight into links between genotype and phenotype. Several comparative genomics methods have been developed to identify genomic accelerations between species, including a Bayesian method called PhyloAcc, which models shifts in substitution rate in multiple target lineages on a phylogeny. However, few methods consider the possibility of discordance between the trees of individual loci and the species tree due to incomplete lineage sorting, which might cause false positives. Here, we present PhyloAcc-GT, which extends PhyloAcc by modeling gene tree heterogeneity. Given a species tree, we adopt the multispecies coalescent model as the prior distribution of gene trees, use Markov chain Monte Carlo (MCMC) for inference, and design novel MCMC moves to sample gene trees efficiently. Through extensive simulations, we show that PhyloAcc-GT outperforms PhyloAcc and other methods in identifying target lineage-specific accelerations and detecting complex patterns of rate shifts, and is robust to specification of population size parameters. PhyloAcc-GT is usually more conservative than PhyloAcc in calling convergent rate shifts because it identifies more accelerations on ancestral than on terminal branches. We apply PhyloAcc-GT to two examples of convergent evolution: flightlessness in ratites and marine mammal adaptations, and show that PhyloAcc-GT is a robust tool to identify shifts in substitution rate associated with specific target lineages while accounting for incomplete lineage sorting.
more » « less
Full Text Available
Optimal Classification for Functional Data

https://doi.org/10.5705/ss.202022.0057

Wang, Shuoyang; Shang, Zuofeng; Cao, Guanqun; Liu, Jun S. (January 2024, Statistica Sinica)

Full Text Available
Differentiable Particle Filters with Smoothly Jittered Resampling

https://doi.org/10.5705/ss.202022.0256

Li, Yichao; Wang, Wenshuo; Deng, Ke; Liu, Jun S. (January 2024, Statistica Sinica)

Full Text Available
Generative Multi-purpose Sampler for Weighted M-estimation

https://doi.org/10.1080/10618600.2023.2292668

Shin, Minsuk; Wang, Shijie; Liu, Jun S. (December 2023, Journal of Computational and Graphical Statistics)

Full Text Available
Varying Coefficient Model via Adaptive Spline Fitting

https://doi.org/10.1080/10618600.2023.2267616

Wang, Xufei; Jiang, Bo; Liu, Jun S. (October 2023, Journal of Computational and Graphical Statistics)

Full Text Available
Convergence rate of multiple-try Metropolis independent sampler

https://doi.org/10.1007/s11222-023-10241-3

Yang, Xiaodong; Liu, Jun S. (August 2023, Statistics and Computing)

Abstract The multiple-try Metropolis method is an interesting extension of the classical Metropolis–Hastings algorithm. However, theoretical understanding about its usefulness and convergence behavior is still lacking. We here derive the exact convergence rate for the multiple-try Metropolis Independent sampler (MTM-IS) via an explicit eigen analysis. As a by-product, we prove that an naive application of the MTM-IS is less efficient than using the simpler approach of “thinned” independent Metropolis–Hastings method at the same computational cost. We further explore more variants and find it possible to design more efficient algorithms by applying MTM to part of the target distribution or creating correlated multiple trials.
more » « less
Full Text Available
Rejoinder: A Scale-free Approach for False Discovery Rate Control in Generalized Linear Models

https://doi.org/10.1080/01621459.2023.2245686

Dai, Chenguang; Lin, Buyu; Xing, Xin; Liu, Jun S. (July 2023, Journal of the American Statistical Association)

Full Text Available
A Scale-Free Approach for False Discovery Rate Control in Generalized Linear Models

https://doi.org/10.1080/01621459.2023.2165930

Dai, Chenguang; Lin, Buyu; Xing, Xin; Liu, Jun S. (July 2023, Journal of the American Statistical Association)

Full Text Available
Bayesian bi-clustering methods with applications in computational biology

https://doi.org/10.1214/22-AOAS1622

Yan, Han; Wu, Jiexing; Li, Yang; Liu, Jun S. (December 2022, The Annals of Applied Statistics)

Full Text Available

« Prev Next »

Search for: All records