skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Effects of hybridization and gene flow on gene co-expression networks
Abstract Gene co-expression networks are a widely used tool for summarizing transcriptomic variation between individuals, and for inferring the transcriptional regulatory pathways that mediate genotype–phenotype relationships. However, these co-expression networks must be interpreted with caution, as they can arise from multiple processes. Here, we investigate one such process, using simulations to demonstrate that hybridization and gene flow between populations can greatly modify co-expression networks. Admixture between populations produces correlated expression between genes experiencing linkage disequilibrium. This correlated expression does not reflect functional relationships between genes but rather depends on migration rates and physical linkage on chromosomes. Given the prevalence of gene flow and hybridization between divergent populations in nature, these introgression effects likely represent a significant force in network evolution, even in populations where hybridization is historical rather than contemporary. These findings emphasize the critical importance of considering both evolutionary history and genomic architecture when analyzing gene co-expression networks in natural populations.  more » « less
Award ID(s):
2133740 2243076 2207980
PAR ID:
10597607
Author(s) / Creator(s):
; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
GENETICS
Volume:
230
Issue:
2
ISSN:
1943-2631
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The regulation of gene expression is central to many biological processes. Gene regulatory networks (GRNs) link transcription factors (TFs) to their target genes and represent maps of potential transcriptional regulation. Here, we analyzed a large number of publically available maize (Zea mays) transcriptome data sets including >6000 RNA sequencing samples to generate 45 coexpression-based GRNs that represent potential regulatory relationships between TFs and other genes in different populations of samples (cross-tissue, cross-genotype, and tissue-and-genotype samples). While these networks are all enriched for biologically relevant interactions, different networks capture distinct TF-target associations and biological processes. By examining the power of our coexpression-based GRNs to accurately predict covarying TF-target relationships in natural variation data sets, we found that presence/absence changes rather than quantitative changes in TF gene expression are more likely associated with changes in target gene expression. Integrating information from our TF-target predictions and previous expression quantitative trait loci (eQTL) mapping results provided support for 68 TFs underlying 74 previously identified trans-eQTL hotspots spanning a variety of metabolic pathways. This study highlights the utility of developing multiple GRNs within a species to detect putative regulators of important plant pathways and provides potential targets for breeding or biotechnological applications. 
    more » « less
  2. Cherry, J M (Ed.)
    Abstract The mechanisms that coordinate cellular gene expression are highly complex and intricately interconnected. Thus, it is necessary to move beyond a fully reductionist approach to understanding genetic information flow and begin focusing on the networked connections between genes that organize cellular function. Continued advancements in computational hardware, coupled with the development of gene correlation network algorithms, provide the capacity to study networked interactions between genes rather than their isolated functions. For example, gene coexpression networks are used to construct gene relationship networks using linear metrics such as Spearman or Pearson correlation. Recently, there have been tools designed to deepen these analyses by differentiating between intrinsic vs extrinsic noise within gene expression values, identifying different modules based on tissue phenotype, and capturing potential nonlinear relationships. In this report, we introduce an algorithm with a novel application of image-based segmentation modalities utilizing blob detection techniques applied for detecting bigenic edges in a gene expression matrix. We applied this algorithm called EdgeCrafting to a bulk RNA-sequencing gene expression matrix comprised of a healthy kidney and cancerous kidney data. We then compared EdgeCrafting against 4 other RNA expression analysis techniques: Weighted Gene Correlation Network Analysis, Knowledge Independent Network Construction, NetExtractor, and Differential gene expression analysis. 
    more » « less
  3. null (Ed.)
    Gene co-expression networks (GCNs) are constructed from Gene Expression Matrices (GEMs) in a bottom up approach where all gene pairs are tested for correlation within the context of the input sample set. This approach is computationally intensive for many current GEMs and may not be scalable to millions of samples. Further, traditional GCNs do not detect non-linear relationships missed by correlation tests and do not place genetic relationships in a gene expression intensity context. In this report, we propose EdgeScaping, which constructs and analyzes the pairwise gene intensity network in a holistic, top down approach where no edges are filtered. EdgeScaping uses a novel technique to convert traditional pairwise gene expression data to an image based format. This conversion not only performs feature compression, making our algorithm highly scalable, but it also allows for exploring non-linear relationships between genes by leveraging deep learning image analysis algorithms. Using the learned embedded feature space we implement a fast, efficient algorithm to cluster the entire space of gene expression relationships while retaining gene expression intensity. Since EdgeScaping does not eliminate conventionally noisy edges, it extends the identification of co-expression relationships beyond classically correlated edges to facilitate the discovery of novel or unusual expression patterns within the network. We applied EdgeScaping to a human tumor GEM to identify sets of genes that exhibit conventional and non-conventional interdependent non-linear behavior associated with brain specific tumor sub-types that would be eliminated in conventional bottom-up construction of GCNs. Edgescaping source code is available at https://github.com/bhusain/EdgeScaping under the MIT license. 
    more » « less
  4. Abstract Identifying genes that interact to confer a biological function to an organism is one of the main goals of functional genomics. High‐throughput technologies for assessment and quantification of genome‐wide gene expression patterns have enabled systems‐level analyses to infer pathways or networks of genes involved in different functions under many different conditions. Here, we leveraged the publicly available, information‐rich RNA‐Seq datasets of the model plantArabidopsis thalianato construct a gene co‐expression network, which was partitioned into clusters or modules that harbor genes correlated by expression. Gene ontology and pathway enrichment analyses were performed to assess functional terms and pathways that were enriched within the different gene modules. By interrogating the co‐expression network for genes in different modules that associate with a gene of interest, diverse functional roles of the gene can be deciphered. By mapping genes differentially expressing under a certain condition inArabidopsisonto the co‐expression network, we demonstrate the ability of the network to uncover novel genes that are likely transcriptionally active but prone to be missed by standard statistical approaches due to their falling outside of the confidence zone of detection. To our knowledge, this is the firstA. thalianaco‐expression network constructed using the entire mRNA‐Seq datasets (>20,000) available at the NCBI SRA database. The developed network can serve as a useful resource for theArabidopsisresearch community to interrogate specific genes of interest within the network, retrieve the respective interactomes, decipher gene modules that are transcriptionally altered under certain condition or stage, and gain understanding of gene functions. 
    more » « less
  5. Gene expression has a key role in reproductive isolation, and studies of hybrid gene expression have identified mechanisms causing hybrid sterility. Here, we review the evidence for altered gene expression following hybridization and outline the mechanisms shown to contribute to altered gene expression in hybrids. Transgressive gene expression, transcending that of both parental species, is pervasive in early generation sterile hybrids, but also frequently observed in viable, fertile hybrids. We highlight studies showing that hybridization can result in transgressive gene expression, also in established hybrid lineages or species. Such extreme patterns of gene expression in stabilized hybrid taxa suggest that altered hybrid gene expression may result in hybridization‐derived evolutionary novelty. We also conclude that while patterns of misexpression in hybrids are well documented, the understanding of the mechanisms causing misexpression is lagging. We argue that jointly assessing differences in cell composition and cell‐specific changes in gene expression in hybrids, in addition to assessing changes in chromatin and methylation, will significantly advance our understanding of the basis of altered gene expression. Moreover, uncovering to what extent evolution of gene expression results in altered expression for individual genes, or entire networks of genes, will advance our understanding of how selection moulds gene expression. Finally, we argue that jointly studying the dual roles of altered hybrid gene expression, serving both as a mechanism for reproductive isolation and as a substrate for hybrid ecological adaptation, will lead to significant advances in our understanding of the evolution of gene expression. 
    more » « less