Abstract The discovery of cancer driver mutations is a fundamental goal in cancer research. While many cancer driver mutations have been discovered in the protein-coding genome, research into potential cancer drivers in the non-coding regions showed limited success so far. Here, we present a novel comprehensive framework Dr.Nod for detection of non-coding cis-regulatory candidate driver mutations that are associated with dysregulated gene expression using tissue-matched enhancer-gene annotations. Applying the framework to data from over 1500 tumours across eight tissues revealed a 4.4-fold enrichment of candidate driver mutations in regulatory regions of known cancer driver genes. An overarching conclusion that emerges is that the non-coding driver mutations contribute to cancer by significantly altering transcription factor binding sites, leading to upregulation of tissue-matched oncogenes and down-regulation of tumour-suppressor genes. Interestingly, more than half of the detected cancer-promoting non-coding regulatory driver mutations are over 20 kb distant from the cancer-associated genes they regulate. Our results show the importance of tissue-matched enhancer-gene maps, functional impact of mutations, and complex background mutagenesis model for the prediction of non-coding regulatory drivers. In conclusion, our study demonstrates that non-coding mutations in enhancers play a previously underappreciated role in cancer and dysregulation of clinically relevant target genes.
more »
« less
Decoding Non-coding Variants: Recent Approaches to Studying Their Role in Gene Regulation and Human Diseases
Genome-wide association studies (GWAS) have mapped over 90% of disease- and quantitative-trait-associated variants within the non-coding genome. Non-coding regulatory DNA (e.g., promoters and enhancers) and RNA (e.g., 5′ and 3′ UTRs and splice sites) are essential in regulating temporal and tissue-specific gene expressions. Non-coding variants can potentially impact the phenotype of an organism by altering the molecular recognition of the cis-regulatory elements, leading to gene dysregulation. However, determining causality between non-coding variants, gene regulation, and human disease has remained challenging. Experimental and computational methods have been developed to understand the molecular mechanism involved in non-coding variant interference at the transcriptional and post-transcriptional levels. This review discusses recent approaches to evaluating disease-associated single-nucleotide variants (SNVs) and determines their impact on transcription factor (TF) binding, gene expression, chromatin conformation, post-transcriptional regulation, and translation.
more »
« less
- Award ID(s):
- 1231306
- PAR ID:
- 10587049
- Publisher / Repository:
- Frontiers in Bioscience-Scholar
- Date Published:
- Journal Name:
- Frontiers in Bioscience-Scholar
- Volume:
- 16
- Issue:
- 1
- ISSN:
- 1945-0516
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Almost all regulation of gene expression in eukaryotic genomes is mediated by the action of distant non-coding transcriptional enhancers upon proximal gene promoters. Enhancer locations cannot be accurately predicted bioinformatically because of the absence of a defined sequence code, and thus functional assays are required for their direct detection. Here we used a massively parallel reporter assay, Self-Transcribing Active Regulatory Region sequencing (STARR-seq), to generate the first comprehensive genome-wide map of enhancers in Anopheles coluzzii , a major African malaria vector in the Gambiae species complex. The screen was carried out by transfecting reporter libraries created from the genomic DNA of 60 wild A. coluzzii from Burkina Faso into A. coluzzii 4a3A cells, in order to functionally query enhancer activity of the natural population within the homologous cellular context. We report a catalog of 3,288 active genomic enhancers that were significant across three biological replicates, 74% of them located in intergenic and intronic regions. The STARR-seq enhancer screen is chromatin-free and thus detects inherent activity of a comprehensive catalog of enhancers that may be restricted in vivo to specific cell types or developmental stages. Testing of a validation panel of enhancer candidates using manual luciferase assays confirmed enhancer function in 26 of 28 (93%) of the candidates over a wide dynamic range of activity from two to at least 16-fold activity above baseline. The enhancers occupy only 0.7% of the genome, and display distinct composition features. The enhancer compartment is significantly enriched for 15 transcription factor binding site signatures, and displays divergence for specific dinucleotide repeats, as compared to matched non-enhancer genomic controls. The genome-wide catalog of A. coluzzii enhancers is publicly available in a simple searchable graphic format. This enhancer catalogue will be valuable in linking genetic and phenotypic variation, in identifying regulatory elements that could be employed in vector manipulation, and in better targeting of chromosome editing to minimize extraneous regulation influences on the introduced sequences. Importance: Understanding the role of the non-coding regulatory genome in complex disease phenotypes is essential, but even in well-characterized model organisms, identification of regulatory regions within the vast non-coding genome remains a challenge. We used a large-scale assay to generate a genome wide map of transcriptional enhancers. Such a catalogue for the important malaria vector, Anopheles coluzzii , will be an important research tool as the role of non-coding regulatory variation in differential susceptibility to malaria infection is explored and as a public resource for research on this important insect vector of disease.more » « less
-
In previous work we reconstructed the entire transcriptional network for all 2,418 clock-associated genes in the model filamentous fungus, N. crassa. Several authors have suggested that there is extensive post-transcriptional control in the genome-wide clock network (IEEE 3: 27, 2015). Here we have successfully reconstructed the entire clock network in N. crassa with a Variable Topology Ensemble Method (VTENS), assigning each clock-associated gene to the regulation of one or more of 5 transcription factors as well as to 6 RNA operons. The resulting network provides a unifying framework to explore the clock’s linkage to metabolism through post-transcriptional regulation, in which ~850 genes are predicted to fall under the regulatory control of an RNA operon. A unique feature of all of the RNA operons inferred is their functional connection to genes connected to the ribosome. We have been successful in distinguishing several hypotheses about regulatory topologies of the clock network through protein profiling of the regulators.more » « less
-
Abstract CRISPR/Cas (clustered regularly interspaced short palindromic repeats/CRISPR‐associated protein)‐mediated genome editing has revolutionized fundamental research and plant breeding. Beyond gene editing, CRISPR/Cas systems have been repurposed as a platform for programmable transcriptional regulation. Catalytically inactive Cas variants (dCas), when fused with transcriptional activation domains, allow for specific activation of any target gene in the genome without inducing DNA double‐strand breaks. CRISPR activation enables simultaneous activation of multiple genes, holding great promise in the identification of gene regulatory networks and rewiring of metabolic pathways. Here, we describe a simple protocol for constructing a dCas9‐mediated multiplexed gene activation system based on the CRISPR‐Act3.0 system. The resulting vectors are tested in rice protoplasts. © 2022 Wiley Periodicals LLC. Basic Protocol 1: sgRNA design and construction of CRISPR‐Act3.0 vectors for multiplexed gene activation Basic Protocol 2: Determining the activation efficiency of CRISPR‐Act3.0 vectors using rice protoplastsmore » « less
-
Plants have evolved variable phenotypic plasticity to counteract different pathogens and pests during immobile life. Microbial infection invokes multiple layers of host immune responses, and plant gene expression is swiftly and precisely reprogramed at both the transcriptional level and post-transcriptional level. Recently, the importance of epigenetic regulation in response to biotic stresses has been recognized. Changes in DNA methylation, histone modification, and chromatin structures have been observed after microbial infection. In addition, epigenetic modifications may be preserved as transgenerational memories to allow the progeny to better adapt to similar environments. Epigenetic regulation involves various regulatory components, including non-coding small RNAs, DNA methylation, histone modification, and chromatin remodelers. The crosstalk between these components allows precise fine-tuning of gene expression, giving plants the capability to fight infections and tolerant drastic environmental changes in nature. Fully unraveling epigenetic regulatory mechanisms could aid in the development of more efficient and eco-friendly strategies for crop protection in agricultural systems. In this review, we discuss the recent advances on the roles of epigenetic regulation in plant biotic stress responses.more » « less
An official website of the United States government

