skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Gene communities in co-expression networks across different tissues
With the recent availability of tissue-specific gene expression data, e.g., provided by the GTEx Consortium, there is interest in comparing gene co-expression patterns across tissues. One promising approach to this problem is to use a multilayer network analysis framework and perform multilayer community detection. Communities in gene co-expression networks reveal groups of genes similarly expressed across individuals, potentially involved in related biological processes responding to specific environmental stimuli or sharing common regulatory variations. We construct a multilayer network in which each of the four layers is an exocrine gland tissue-specific gene co-expression network. We develop methods for multilayer community detection with correlation matrix input and an appropriate null model. Our correlation matrix input method identifies five groups of genes that are similarly co-expressed in multiple tissues (a community that spans multiple layers, which we call a generalist community) and two groups of genes that are co-expressed in just one tissue (a community that lies primarily within just one layer, which we call a specialist community). We further found gene co-expression communities where the genes physically cluster across the genome significantly more than expected by chance (on chromosomes 1 and 11). This clustering hints at underlying regulatory elements determining similar expression patterns across individuals and cell types. We suggest thatKRTAP3-1,KRTAP3-3, andKRTAP3-5share regulatory elements in skin and pancreas. Furthermore, we find thatCELA3AandCELA3Bshare associated expression quantitative trait loci in the pancreas. The results indicate that our multilayer community detection method for correlation matrix input extracts biologically interesting communities of genes.  more » « less
Award ID(s):
2049947 2123284 2052720
PAR ID:
10493890
Author(s) / Creator(s):
; ; ; ;
Editor(s):
Fu, Feng
Publisher / Repository:
Plos Computational Biology
Date Published:
Journal Name:
PLOS Computational Biology
Volume:
19
Issue:
11
ISSN:
1553-7358
Page Range / eLocation ID:
e1011616
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Early lineage diversification is central to understand what mutational events drive species divergence. Particularly, gene misregulation in interspecific hybrids can inform about what genes and pathways underlie hybrid dysfunction. InDrosophilahybrids, how regulatory evolution impacts different reproductive tissues remains understudied. Here, we generate a new genome assembly and annotation inDrosophila willistoniand analyse the patterns of transcriptome divergence between two allopatrically evolvedD. willistonisubspecies, their male sterile and female fertile hybrid progeny across testis, male accessory gland, and ovary. Patterns of transcriptome divergence and modes of regulatory evolution were tissue‐specific. Despite no indication for cell‐type differences in hybrid testis, this tissue exhibited the largest magnitude of expression differentiation between subspecies and between parentals and hybrids. No evidence for anomalous dosage compensation in hybrid male tissues was detected nor was a differential role for the neo‐ and the ancestral arms of theD. willistoni Xchromosome. Compared to the autosomes, theXchromosome appeared enriched for transgressively expressed genes in testis despite being the least differentiated in expression between subspecies. Evidence for fine genome clustering of transgressively expressed genes suggests a role of chromatin structure on hybrid gene misregulation. Lastly, transgressively expressed genes in the testis of the sterile male progeny were enriched for GO terms not typically associated with sperm function, instead hinting at anomalous development of the reproductive tissue. Our thorough tissue‐level portrait of transcriptome differentiation between recently divergedD. willistonisubspecies and their hybrids provides a more nuanced view of early regulatory changes during speciation. 
    more » « less
  2. Abstract The Soybean Gene Atlas project provides a comprehensive map for understanding gene expression patterns in major soybean tissues from flower, root, leaf, nodule, seed, and shoot and stem. The RNA‐Seq data generated in the project serve as a valuable resource for discovering tissue‐specific transcriptome behavior of soybean genes in different tissues. We developed a computational pipeline for Soybean context‐specific network (SoyCSN) inference with a suite of prediction tools to analyze, annotate, retrieve, and visualize soybean context‐specific networks at both transcriptome and interactome levels. BicMix and Cross‐Conditions Cluster Detection algorithms were applied to detect modules based on co‐expression relationships across all the tissues. Soybean context‐specific interactomes were predicted by combining soybean tissue gene expression and protein–protein interaction data. Functional analyses of these predicted networks provide insights into soybean tissue specificities. For example, under symbiotic, nitrogen‐fixing conditions, the constructed soybean leaf network highlights the connection between the photosynthesis function and rhizobium–legume symbiosis. SoyCSN data and all its results are publicly available via an interactive web service within the Soybean Knowledge Base (SoyKB) athttp://soykb.org/SoyCSN. SoyCSN provides a useful web‐based access for exploring context specificities systematically in gene regulatory mechanisms and gene relationships for soybean researchers and molecular breeders. 
    more » « less
  3. Finding the network biomarkers of cancers and the analysis of cancer driving genes that are involved in these biomarkers are essential for understanding the dynamics of cancer. Clusters of genes in co-expression networks are commonly known as functional units. This work is based on the hypothesis that the dense clusters or communities in the gene co-expression networks of cancer patients may represent functional units regarding cancer initiation and progression. In this study, RNA-seq gene expression data of three cancers - Breast Invasive Carcinoma (BRCA), Colorectal Adenocarcinoma (COAD) and Glioblastoma Multiforme (GBM) - from The Cancer Genome Atlas (TCGA) are used to construct gene co-expression networks using Pearson Correlation. Six well-known community detection algorithms are applied on these networks to identify communities with five or more genes. A permutation test is performed to further mine the communities that are conserved in other cancers, thus calling them conserved communities. Then survival analysis is performed on clinical data of three cancers using the conserved community genes as prognostic co-variates. The communities that could distinguish the cancer patients between high- and low-risk groups are considered as cancer biomarkers. In the present study, 16 such network biomarkers are discovered. 
    more » « less
  4. Abstract Transcriptome-wide association studies (TWASs) integrate expression quantitative trait loci (eQTLs) studies with genome-wide association studies (GWASs) to prioritize candidate target genes for complex traits. Several statistical methods have been recently proposed to improve the performance of TWASs in gene prioritization by integrating the expression regulatory information imputed from multiple tissues, and made significant achievements in improving the ability to detect gene-trait associations. Unfortunately, most existing multi-tissue methods focus on prioritization of candidate genes, and cannot directly infer the specific functional effects of candidate genes across different tissues. Here, we propose a tissue-specific collaborative mixed model (TisCoMM) for TWASs, leveraging the co-regulation of genetic variations across different tissues explicitly via a unified probabilistic model. TisCoMM not only performs hypothesis testing to prioritize gene-trait associations, but also detects the tissue-specific role of candidate target genes in complex traits. To make full use of widely available GWASs summary statistics, we extend TisCoMM to use summary-level data, namely, TisCoMM-S2. Using extensive simulation studies, we show that type I error is controlled at the nominal level, the statistical power of identifying associated genes is greatly improved, and the false-positive rate (FPR) for non-causal tissues is well controlled at decent levels. We further illustrate the benefits of our methods in applications to summary-level GWASs data of 33 complex traits. Notably, apart from better identifying potential trait-associated genes, we can elucidate the tissue-specific role of candidate target genes. The follow-up pathway analysis from tissue-specific genes for asthma shows that the immune system plays an essential function for asthma development in both thyroid and lung tissues. 
    more » « less
  5. Abstract The circadian clock is an internal molecular oscillator and coordinates numerous physiological processes through regulation of molecular pathways. Tissue‐specific clocks connected by mobile signals have previously been found to run at different speeds inArabidopsis thalianatissues. However, tissue variation in circadian clocks in crop species is unknown. In this study, leaf and tuber global gene expression in cultivated potato under cycling and constant environmental conditions was profiled. In addition, we used a circadian‐regulated luciferase reporter construct to study tuber gene expression rhythms. Diel and circadian expression patterns were present among 17.9% and 5.6% of the expressed genes in the tuber. Over 500 genes displayed differential tissue specific diel phases. Intriguingly, few core circadian clock genes had circadian expression patterns, while all such genes were circadian rhythmic in cultivated tomato leaves. Furthermore, robust diel and circadian transcriptional rhythms were observed among detached tubers. Our results suggest alternative regulatory mechanisms and/or clock composition is present in potato, as well as the presence of tissue‐specific independent circadian clocks. We have provided the first evidence of a functional circadian clock in below‐ground storage organs, holding important implications for other storage root and tuberous crops. 
    more » « less