skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: High-dimension to high-dimension screening for detecting genome-wide epigenetic and noncoding RNA regulators of gene expression
Abstract MotivationThe advancement of high-throughput technology characterizes a wide variety of epigenetic modifications and noncoding RNAs across the genome involved in disease pathogenesis via regulating gene expression. The high dimensionality of both epigenetic/noncoding RNA and gene expression data make it challenging to identify the important regulators of genes. Conducting univariate test for each possible regulator–gene pair is subject to serious multiple comparison burden, and direct application of regularization methods to select regulator–gene pairs is computationally infeasible. Applying fast screening to reduce dimension first before regularization is more efficient and stable than applying regularization methods alone. ResultsWe propose a novel screening method based on robust partial correlation to detect epigenetic and noncoding RNA regulators of gene expression over the whole genome, a problem that includes both high-dimensional predictors and high-dimensional responses. Compared to existing screening methods, our method is conceptually innovative that it reduces the dimension of both predictor and response, and screens at both node (regulators or genes) and edge (regulator–gene pairs) levels. We develop data-driven procedures to determine the conditional sets and the optimal screening threshold, and implement a fast iterative algorithm. Simulations and applications to long noncoding RNA and microRNA regulation in Kidney cancer and DNA methylation regulation in Glioblastoma Multiforme illustrate the validity and advantage of our method. Availability and implementationThe R package, related source codes and real datasets used in this article are provided at https://github.com/kehongjie/rPCor. Supplementary informationSupplementary data are available at Bioinformatics online.  more » « less
Award ID(s):
2113568
PAR ID:
10370494
Author(s) / Creator(s):
; ; ; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Bioinformatics
Volume:
38
Issue:
17
ISSN:
1367-4803
Format(s):
Medium: X Size: p. 4078-4087
Size(s):
p. 4078-4087
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract BackgroundAlternative splicing of precursor mRNAs serves as a crucial mechanism to enhance gene expression plasticity for organismal adaptation. However, the precise regulation and function of alternative splicing in plant immune gene regulation remain elusive. ResultsHere, by deploying in-depth transcriptome profiling with deep genome coverage coupled with differential expression, differential alternative splicing, and differential transcript usage analysis, we reveal profound and dynamic changes in alternative splicing following treatment with microbial pattern flg22 peptides inArabidopsis. Our findings highlight RNA polymerase II C-terminal domain phosphatase-like 3 (CPL3) as a key regulator of alternative splicing, preferentially influencing the splicing patterns of defense genes rather than their expression levels. CPL3 mediates the production of a flg22-induced alternative splicing variant, diacylglycerol kinase 5α (DGK5α), which differs from the canonical DGK5β in its interaction with the upstream kinase BIK1 and subsequent phosphorylation, resulting in reduced flg22-triggered production of phosphatidic acid and reactive oxygen species. Furthermore, our functional analysis suggests that DGK5β, but not DGK5α, contributes to plant resistance against virulent and avirulent bacterial infections. ConclusionsThese findings underscore the role of CPL3 in modulating alternative splicing dynamics of defense genes and DGK5 isoform-mediated phosphatidic acid homeostasis, shedding light on the intricate mechanisms underlying plant immune gene regulation. 
    more » « less
  2. Abstract BackgroundGenetic and epigenetic perturbation of cis-regulatory sequences can shift patterns of gene expression and result in novel phenotypes. Phased genome assemblies now enable the local dissection of linkages between cis-regulatory sequences, including their epigenetic state, and allele-specific gene expression to further characterize gene regulation and resulting phenotypes in heterozygous genomes. ResultsWe assembled a locally phased genome for a mandarin hybrid named ‘Fairchild’ to explore the molecular signatures of allele-specific gene expression. With local genome phasing, genes with allele-specific expression were paired with haplotype-specific chromatin states, including levels of chromatin accessibility, histone modifications, and DNA methylation. We found that 30% of variation in allele-specific expression could be attributed to haplotype associated factors, with allelic levels of chromatin accessibility and three histone modifications in gene bodies having the most influence. Structural variants in promoter regions were also associated with allele-specific expression, including specific enrichments of hAT and MULE-MuDR DNA transposon sequences. Integration of haplotype-resolved genetic and epigenetic landscapes with high-throughput phenotypic analysis of fruit traits in a panel of 154 accessions with mandarin and pummelo ancestry revealed that trait-associated variants were enriched in regions of open chromatin. Mining of trait-associated variants uncovered a Gypsy retrotransposon insertion in a gene that regulates potassium transport and may contribute to the reduction in fruit size that is observed in mandarins. Conclusions​​Using a locally phased assembly of a heterozygous cultivar of citrus, we dissected the interplay between genetic variants and molecular phenotypes to reveal cis-regulatory sequences with potential functional effects on phenotypes relevant for genetic improvement. 
    more » « less
  3. Abstract How the noncoding genome affects cellular functions is a key biological question. A particular challenge is to distinguish the effects of noncoding DNA elements from long noncoding RNAs (lncRNAs) that coincide at the same loci. Here, we identified the flowering‐associated intergenic lncRNA (FLAIL) inArabidopsisthrough early floweringflailmutants. Expression ofFLAILRNA from a different chromosomal location in combination with strand‐specific RNA knockdown characterizedFLAILas a trans‐acting RNA molecule.FLAILdirectly binds to differentially expressed target genes that control flowering via RNA–DNA interactions through conserved sequence motifs.FLAILinteracts with protein and RNA components of the spliceosome to affect target mRNA expression through co‐transcriptional alternative splicing (AS) and linked chromatin regulation. In the absence ofFLAIL, splicing defects at the direct FLAIL target flowering gene LACCASE 8 (LAC8) correlated with reduced mRNA expression. Double mutant analyses support a model whereFLAIL‐mediated splicing of LAC8 promotes its mRNA expression and represses flowering. Our study suggests lncRNAs as accessory components of the spliceosome that regulate AS and gene expression to impact organismal development. 
    more » « less
  4. Foundational models of transcriptional regulation involve the assembly of protein complexes at DNA elements associated with specific genes. These assemblies, which can include transcription factors, cofactors, RNA polymerase, and various chromatin regulators, form dynamic spatial compartments that contribute to both gene regulation and local genome architecture. This DNA-protein-centric view has been modified with recent evidence that RNA molecules have important roles to play in gene regulation and genome structure. Here, we discuss evidence that gene regulation by RNA occurs at multiple levels that include assembly of transcriptional complexes and genome compartments, feedback regulation of active genes, silencing of genes, and control of protein kinases. We thus provide an RNA-centric view of transcriptional regulation that must reside alongside the more traditional DNA-protein-centric perspectives on gene regulation and genome architecture. 
    more » « less
  5. The emergence of and transitions between distinct phenotypes in isogenic cells can be attributed to the intricate interplay of epigenetic marks, external signals, and gene-regulatory elements. These elements include chromatin remodelers, histone modifiers, transcription factors, and regulatory RNAs. Mathematical models known as gene-regulatory networks (GRNs) are an increasingly important tool to unravel the workings of such complex networks. In such models, epigenetic factors are usually proposed to act on the chromatin regions directly involved in the expression of relevant genes. However, it has been well-established that these factors operate globally and compete with each other for targets genome-wide. Therefore, a perturbation of the activity of a regulator can redistribute epigenetic marks across the genome and modulate the levels of competing regulators. In this paper, we propose a conceptual and mathematical modeling framework that incorporates both local and global competition effects between antagonistic epigenetic regulators, in addition to local transcription factors, and show the counterintuitive consequences of such interactions. We apply our approach to recent experimental findings on the epithelial–mesenchymal transition (EMT). We show that it can explain the puzzling experimental data, as well as provide verifiable predictions. 
    more » « less