skip to main content


Title: Identification of significant gene expression changes in multiple perturbation experiments using knockoffs
Abstract

Large-scale multiple perturbation experiments have the potential to reveal a more detailed understanding of the molecular pathways that respond to genetic and environmental changes. A key question in these studies is which gene expression changes are important for the response to the perturbation. This problem is challenging because (i) the functional form of the nonlinear relationship between gene expression and the perturbation is unknown and (ii) identification of the most important genes is a high-dimensional variable selection problem. To deal with these challenges, we present here a method based on the model-X knockoffs framework and Deep Neural Networks to identify significant gene expression changes in multiple perturbation experiments. This approach makes no assumptions on the functional form of the dependence between the responses and the perturbations and it enjoys finite sample false discovery rate control for the selected set of important gene expression responses. We apply this approach to the Library of Integrated Network-Based Cellular Signature data sets which is a National Institutes of Health Common Fund program that catalogs how human cells globally respond to chemical, genetic and disease perturbations. We identified important genes whose expression is directly modulated in response to perturbation with anthracycline, vorinostat, trichostatin-a, geldanamycin and sirolimus. We compare the set of important genes that respond to these small molecules to identify co-responsive pathways. Identification of which genes respond to specific perturbation stressors can provide better understanding of the underlying mechanisms of disease and advance the identification of new drug targets.

 
more » « less
NSF-PAR ID:
10400977
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Briefings in Bioinformatics
Volume:
24
Issue:
2
ISSN:
1467-5463
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Bomblies, K (Ed.)
    The gene balance hypothesis proposes that selection acts on the dosage (i.e. copy number) of genes within dosage-sensitive portions of networks, pathways, and protein complexes to maintain balanced stoichiometry of interacting proteins, because perturbations to stoichiometric balance can result in reduced fitness. This selection has been called dosage balance selection. Dosage balance selection is also hypothesized to constrain expression responses to dosage changes, making dosage-sensitive genes (those encoding members of interacting proteins) experience more similar expression changes. In allopolyploids, where whole-genome duplication involves hybridization of diverged lineages, organisms often experience homoeologous exchanges that recombine, duplicate, and delete homoeologous regions of the genome and alter the expression of homoeologous gene pairs. Although the gene balance hypothesis makes predictions about the expression response to homoeologous exchanges, they have not been empirically tested. We used genomic and transcriptomic data from 6 resynthesized, isogenic Brassica napus lines over 10 generations to identify homoeologous exchanges, analyzed expression responses, and tested for patterns of genomic imbalance. Groups of dosage-sensitive genes had less variable expression responses to homoeologous exchanges than dosage-insensitive genes, a sign that their relative dosage is constrained. This difference was absent for homoeologous pairs whose expression was biased toward the B. napus A subgenome. Finally, the expression response to homoeologous exchanges was more variable than the response to whole-genome duplication, suggesting homoeologous exchanges create genomic imbalance. These findings expand our knowledge of the impact of dosage balance selection on genome evolution and potentially connect patterns in polyploid genomes over time, from homoeolog expression bias to duplicate gene retention.

     
    more » « less
  2. Abstract

    In the past century, recently emerged infectious diseases have become major drivers of species decline and extinction. The fungal disease chytridiomycosis has devastated many amphibian populations and exacerbated the amphibian conservation crisis. Biologists are beginning to understand what host traits contribute to disease susceptibility, but more work is needed to determine why some species succumb to chytridiomycosis while others do not. We conducted an integrative laboratory experiment to examine how two toad species respond to infection with the pathogenBatrachochytrium dendrobatidisin a controlled environment. We selected two toad species thought to differ in susceptibility –Bufo marinus(an invasive and putatively resistant species) andBufo boreas(an endangered and putatively susceptible species). We measured infection intensity, body weight, histological changes and genomewide gene expression using a custom assay developed from transcriptome sequencing. Our results confirmed that the two species differ in susceptibility with the more susceptible species,B. boreas,showing higher infection intensities, loss in body weight, more dramatic histological changes and larger perturbations in gene expression. We found key differences in skin expression responses in multiple pathways including upregulation of skin integrity‐related genes in the resistantB. marinus. Together, our results show intrinsic differences in host response between related species, which are likely to be important in explaining variation in response to a deadly emerging pathogen in wild populations. Our study also underscores the importance of understanding differences among host species to better predict disease outcomes and reveal generalities about host response to emerging infectious diseases of wildlife.

     
    more » « less
  3. Abstract

    Goss's wilt, caused by the Gram-positive actinobacterium Clavibacter nebraskensis, is an important bacterial disease of maize. The molecular and genetic mechanisms of resistance to the bacterium, or, in general, Gram-positive bacteria causing plant diseases, remain poorly understood. Here, we examined the genetic basis of Goss's wilt through differential gene expression, standard genome-wide association mapping (GWAS), extreme phenotype (XP) GWAS using highly resistant (R) and highly susceptible (S) lines, and quantitative trait locus (QTL) mapping using 3 bi-parental populations, identifying 11 disease association loci. Three loci were validated using near-isogenic lines or recombinant inbred lines. Our analysis indicates that Goss's wilt resistance is highly complex and major resistance genes are not commonly present. RNA sequencing of samples separately pooled from R and S lines with or without bacterial inoculation was performed, enabling identification of common and differential gene responses in R and S lines. Based on expression, in both R and S lines, the photosynthesis pathway was silenced upon infection, while stress-responsive pathways and phytohormone pathways, namely, abscisic acid, auxin, ethylene, jasmonate, and gibberellin, were markedly activated. In addition, 65 genes showed differential responses (up- or down-regulated) to infection in R and S lines. Combining genetic mapping and transcriptional data, individual candidate genes conferring Goss's wilt resistance were identified. Collectively, aspects of the genetic architecture of Goss's wilt resistance were revealed, providing foundational data for mechanistic studies.

     
    more » « less
  4. Abstract

    The endoplasmic reticulum (ER) houses sensors that respond to environmental stress and underly plants' adaptative responses. These sensors transduce signals that lead to changes in nuclear gene expression. The ER to nuclear signaling pathways are primarily attributed to the unfolded protein response (UPR) and are also integrated with a wide range of development, hormone, immune, and stress signaling pathways. Understanding the role of the UPR in signaling network mechanisms that associate with particular phenotypes is crucially important. While UPR‐associated genes are the subject of ongoing investigations in a few model plant systems, most remain poorly annotated, hindering the identification of candidates across plant species. This open‐source curated database provides a centralized resource of peer reviewed knowledge of ER to nuclear signaling pathways for the plant community. We provide a UPRome interactive viewer for users to navigate through the pathways and to access annotated information. The plant ER UPRome website is located athttp://uprome.tamu.edu. We welcome contributions from the researchers studying the ER UPR to incorporate additional genes into the database through the “contact us” page.

     
    more » « less
  5. Extremophytes are naturally selected to survive environmental stresses, but scarcity of genetic resources for them developed with spatiotemporal resolution limit their use in stress biology. Schrenkiella parvula is one of the leading extremophyte models with initial molecular genomic resources developed to study its tolerance mechanisms to high salinity. Here we present a transcriptome atlas for S. parvula with subsequent analyses to highlight its diverse gene expression networks associated with salt responses. We included spatiotemporal expression profiles, expression specificity of each gene, and co-expression and functional gene networks representing 115 transcriptomes sequenced from 35 tissue and developmental stages examining their responses before and after 27 salt treatments in our current study. The highest number of tissue-preferentially expressed genes were found in seeds and siliques while genes in seedlings showed the broadest expression profiles among developmental stages. Seedlings had the highest magnitude of overall transcriptomic responses to salinity compared to mature tissues and developmental stages. Differentially expressed genes in response to salt were largely mutually exclusive but shared common stress response pathways spanning across tissues and developmental stages. Our foundational dataset created for S. parvula representing a stress-adapted wild plant lays the groundwork for future functional, comparative, and evolutionary studies using extremophytes aiming to uncover novel stress tolerant mechanisms. 
    more » « less