skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on January 8, 2026

Title: Functional Varying-Index Coefficients Model for Dynamic Synergistic Gene–Environment Interactions
Human complex diseases are affected by both genetic and environmental factors. When multiple environmental risk factors are present, the interaction effect between a gene and the environmental mixture can be larger than the addition of individual interactions, resulting in the so-called synergistic gene–environment (GxE) interactions. Existing literature has shown the power of synergistic gene-environment interaction analysis with cross-sectional traits. In this work, we propose a functional varying index coefficient model for longitudinal traits together with multiple longitudinal environmental risk factors and assess how the genetic effects on a longitudinal disease trait are nonlinearly modified by a mixture of environmental influences. We derive an estimation procedure for the nonparametric functional varying index coefficients under the quadratic inference function and penalized spline framework. We evaluate some theoretical properties such as estimation consistency and asymptotic normality of the estimates. We further propose a hypothesis testing procedure to assess the significance of the synergistic GxE effect. The performance of the estimation and testing procedure is evaluated through Monte Carlo simulation studies. Finally, the utility of the method is illustrated by a real dataset from a pain sensitivity study in which SNP effects are nonlinearly modulated by a mixture of drug dosages and other environmental variables to affect patients’ blood pressure and heart rate.  more » « less
Award ID(s):
2212928
PAR ID:
10580374
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Springer
Date Published:
Journal Name:
Statistics in Biosciences
ISSN:
1867-1764
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Roozbeh, Mahdi (Ed.)
    The genetic basis of complex traits involves the function of many genes with small effects as well as complex gene-gene and gene-environment interactions. As one of the major players in complex diseases, the role of gene-environment interactions has been increasingly recognized. Motivated by epidemiology studies to evaluate the joint effect of environmental mixtures, we developed a functional varying-index coefficient model (FVICM) to assess the combined effect of environmental mixtures and their interactions with genes, under a longitudinal design with quantitative traits. Built upon the previous work, we extend the FVICM model to accommodate binary longitudinal traits through the development of a generalized functional varying-index coefficient model (gFVICM). This model examines how the genetic effects on a disease trait are nonlinearly influenced by a combination of environmental factors. We derive an estimation procedure for the varying-index coefficient functions using quadratic inference functions combined with penalized splines. A hypothesis testing procedure is proposed to evaluate the significance of the nonparametric index functions. Extensive Monte Carlo simulations are conducted to evaluate the performance of the method under finite samples. The utility of the method is further demonstrated through a case study with a pain sensitivity dataset. SNPs were found to have their effects on blood pressure nonlinearly influenced by a combination of environmental factors. 
    more » « less
  2. The explosion of biobank data offers unprecedented opportunities for gene-environment interaction (GxE) studies of complex diseases because of the large sample sizes and the rich collection in genetic and non-genetic information. However, the extremely large sample size also introduces new computational challenges in G×E assessment, especially for set-based G×E variance component (VC) tests, which are a widely used strategy to boost overall G×E signals and to evaluate the joint G×E effect of multiple variants from a biologically meaningful unit (e.g., gene). In this work, we focus on continuous traits and present SEAGLE, a S calable E xact A l G orithm for L arge-scale set-based G× E tests, to permit G×E VC tests for biobank-scale data. SEAGLE employs modern matrix computations to calculate the test statistic and p -value of the GxE VC test in a computationally efficient fashion, without imposing additional assumptions or relying on approximations. SEAGLE can easily accommodate sample sizes in the order of 10 5 , is implementable on standard laptops, and does not require specialized computing equipment. We demonstrate the performance of SEAGLE using extensive simulations. We illustrate its utility by conducting genome-wide gene-based G×E analysis on the Taiwan Biobank data to explore the interaction of gene and physical activity status on body mass index. 
    more » « less
  3. Schwartz, Russell (Ed.)
    Abstract Motivation While gene–environment (GxE) interactions contribute importantly to many different phenotypes, detecting such interactions requires well-powered studies and has proven difficult. To address this, we combine two approaches to improve GxE power: simultaneously evaluating multiple phenotypes and using a two-step analysis approach. Previous work shows that the power to identify a main genetic effect can be improved by simultaneously analyzing multiple related phenotypes. For a univariate phenotype, two-step methods produce higher power for detecting a GxE interaction compared to single step analysis. Therefore, we propose a two-step approach to test for an overall GxE effect for multiple phenotypes. Results Using simulations we demonstrate that, when more than one phenotype has GxE effect (i.e. GxE pleiotropy), our approach offers substantial gain in power (18–43%) to detect an aggregate-level GxE effect for a multivariate phenotype compared to an analogous two-step method to identify GxE effect for a univariate phenotype. We applied the proposed approach to simultaneously analyze three lipids, LDL, HDL and Triglyceride with the frequency of alcohol consumption as environmental factor in the UK Biobank. The method identified two loci with an overall GxE effect on the vector of lipids, one of which was missed by the competing approaches. Availability and implementation We provide an R package MPGE implementing the proposed approach which is available from CRAN: https://cran.r-project.org/web/packages/MPGE/index.html Supplementary information Supplementary data are available at Bioinformatics online. 
    more » « less
  4. Wittkopp, Patricia (Ed.)
    Abstract The relationship between genotype and phenotype is often mediated by the environment. Moreover, gene-by-environment (GxE) interactions can contribute to variation in phenotypes and fitness. In the last 500 yr, house mice have invaded the Americas. Despite their short residence time, there is evidence of rapid climate adaptation, including shifts in body size and aspects of metabolism with latitude. Previous selection scans have identified candidate genes for metabolic adaptation. However, environmental variation in diet as well as GxE interactions likely impact body mass variation in wild populations. Here, we investigated the role of the environment and GxE interactions in shaping adaptive phenotypic variation. Using new locally adapted inbred strains from North and South America, we evaluated response to a high-fat diet, finding that sex, strain, diet, and the interaction between strain and diet contributed significantly to variation in body size. We also found that the transcriptional response to diet is largely strain-specific, indicating that GxE interactions affecting gene expression are pervasive. Next, we used crosses between strains from contrasting climates to characterize gene expression regulatory divergence on a standard diet and on a high-fat diet. We found that gene regulatory divergence is often condition-specific, particularly for trans-acting changes. Finally, we found evidence for lineage-specific selection on cis-regulatory variation involved in diverse processes, including lipid metabolism. Overlap with scans for selection identified candidate genes for environmental adaptation with diet-specific effects. Together, our results underscore the importance of environmental variation and GxE interactions in shaping adaptive variation in complex traits. 
    more » « less
  5. Beyond the main genetic and environmental effects, gene–environment (G–E) interactions have been demonstrated to significantly contribute to the development and progression of complex diseases. Published analyses of G–E interactions have primarily used a supervised framework to model both low-dimensional environmental factors and high-dimensional genetic factors in relation to disease outcomes. In this article, we aim to provide a selective review of methodological developments in G–E interaction analysis from a statistical perspective. The three main families of techniques are hypothesis testing, variable selection, and dimension reduction, which lead to three general frameworks: testing-based, estimation-based, and prediction-based. Linear- and nonlinear-effects analysis, fixed- and random-effects analysis, marginal and joint analysis, and Bayesian and frequentist analysis are reviewed to facilitate the conduct of interaction analysis in a wide range of situations with various assumptions and objectives. Statistical properties, computations, applications, and future directions are also discussed. 
    more » « less