skip to main content


Title: Hybrid Epigenomes Reveal Extensive Local Genetic Changes to Chromatin Accessibility Contribute to Divergence in Embryonic Gene Expression Between Species
Abstract

Chromatin accessibility plays an important role in shaping gene expression, yet little is known about the genetic and molecular mechanisms that influence the evolution of chromatin configuration. Both local (cis) and distant (trans) genetic influences can in principle influence chromatin accessibility and are based on distinct molecular mechanisms. We, therefore, sought to characterize the role that each of these plays in altering chromatin accessibility in 2 closely related sea urchin species. Using hybrids of Heliocidaris erythrogramma and Heliocidaris tuberculata, and adapting a statistical framework previously developed for the analysis of cis and trans influences on the transcriptome, we examined how these mechanisms shape the regulatory landscape at 3 important developmental stages, and compared our results to similar analyses of the transcriptome. We found extensive cis- and trans-based influences on evolutionary changes in chromatin, with cis effects generally larger in effect. Evolutionary changes in accessibility and gene expression are correlated, especially when expression has a local genetic basis. Maternal influences appear to have more of an effect on chromatin accessibility than on gene expression, persisting well past the maternal-to-zygotic transition. Chromatin accessibility near gene regulatory network genes appears to be distinctly regulated, with trans factors appearing to play an outsized role in the configuration of chromatin near these genes. Together, our results represent the first attempt to quantify cis and trans influences on evolutionary divergence in chromatin configuration in an outbred natural study system and suggest that chromatin regulation is more genetically complex than was previously appreciated.

 
more » « less
Award ID(s):
1929934
PAR ID:
10538962
Author(s) / Creator(s):
; ; ;
Editor(s):
Wittkopp, Patricia
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Molecular Biology and Evolution
Volume:
40
Issue:
11
ISSN:
0737-4038
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Wittkopp, Patricia (Ed.)
    Abstract Chromatin configuration is highly dynamic during embryonic development in animals, exerting an important point of control in transcriptional regulation. Yet there exists remarkably little information about the role of evolutionary changes in chromatin configuration to the evolution of gene expression and organismal traits. Genome-wide assays of chromatin configuration, coupled with whole-genome alignments, can help address this gap in knowledge in several ways. In this study we present a comparative analysis of regulatory element sequences and accessibility throughout embryogenesis in three sea urchin species with divergent life histories: a lecithotroph Heliocidaris erythrogramma, a closely related planktotroph H. tuberculata, and a distantly related planktotroph Lytechinus variegatus. We identified distinct epigenetic and mutational signatures of evolutionary modifications to the function of putative cis-regulatory elements in H. erythrogramma that have accumulated nonuniformly throughout the genome, suggesting selection, rather than drift, underlies many modifications associated with the derived life history. Specifically, regulatory elements composing the sea urchin developmental gene regulatory network are enriched for signatures of positive selection and accessibility changes which may function to alter binding affinity and access of developmental transcription factors to these sites. Furthermore, regulatory element changes often correlate with divergent expression patterns of genes involved in cell type specification, morphogenesis, and development of other derived traits, suggesting these evolutionary modifications have been consequential for phenotypic evolution in H. erythrogramma. Collectively, our results demonstrate that selective pressures imposed by changes in developmental life history rapidly reshape the cis-regulatory landscape of core developmental genes to generate novel traits and embryonic programs. 
    more » « less
  2. Changes in developmental gene regulatory networks (dGRNs) underlie much of the diversity of life, but the evolutionary mechanisms that operate on regulatory interactions remain poorly understood. Closely related species with extreme phenotypic divergence provide a valuable window into the genetic and molecular basis for changes in dGRNs and their relationship to adaptive changes in organismal traits. Here we analyse genomes, epigenomes and transcriptomes during early development in two Heliocidaris sea urchin species that exhibit highly divergent life histories and in an outgroup species. Positive selection and chromatin accessibility modifications within putative regulatory elements are enriched on the branch leading to the derived life history, particularly near dGRN genes. Single-cell transcriptomes reveal a dramatic delay in cell fate specification in the derived state, which also has far fewer open chromatin regions, especially near conserved cell fate specification genes. Experimentally perturbing key transcription factors reveals profound evolutionary changes to early embryonic patterning events, disrupting regulatory interactions previously conserved for ~225 million years. These results demonstrate that natural selection can rapidly reshape developmental gene expression on a broad scale when selective regimes abruptly change. More broadly, even highly conserved dGRNs and patterning mechanisms in the early embryo remain evolvable under appropriate ecological circumstances. 
    more » « less
  3. Abstract Motivation Genome-wide profiles of chromatin accessibility and gene expression in diverse cellular contexts are critical to decipher the dynamics of transcriptional regulation. Recently, convolutional neural networks have been used to learn predictive cis-regulatory DNA sequence models of context-specific chromatin accessibility landscapes. However, these context-specific regulatory sequence models cannot generalize predictions across cell types. Results We introduce multi-modal, residual neural network architectures that integrate cis-regulatory sequence and context-specific expression of trans-regulators to predict genome-wide chromatin accessibility profiles across cellular contexts. We show that the average accessibility of a genomic region across training contexts can be a surprisingly powerful predictor. We leverage this feature and employ novel strategies for training models to enhance genome-wide prediction of shared and context-specific chromatin accessible sites across cell types. We interpret the models to reveal insights into cis- and trans-regulation of chromatin dynamics across 123 diverse cellular contexts. Availability and implementation The code is available at https://github.com/kundajelab/ChromDragoNN. Supplementary information Supplementary data are available at Bioinformatics online. 
    more » « less
  4. Lasky, Jesse R. (Ed.)

    Gene expression can be influenced by genetic variants that are closely linked to the expressed gene (cis eQTLs) and variants in other parts of the genome (trans eQTLs). We created a multiparental mapping population by sampling genotypes from a single natural population ofMimulus guttatusand scored gene expression in the leaves of 1,588 plants. We find that nearly every measured gene exhibits cis regulatory variation (91% have FDR < 0.05). cis eQTLs are usually allelic series with three or more functionally distinct alleles. The cis locus explains about two thirds of the standing genetic variance (on average) but varies among genes and tends to be greatest when there is high indel variation in the upstream regulatory region and high nucleotide diversity in the coding sequence. Despite mapping over 10,000 trans eQTL / affected gene pairs, most of the genetic variance generated by trans acting loci remains unexplained. This implies a large reservoir of trans acting genes with subtle or diffuse effects. Mapped trans eQTLs show lower allelic diversity but much higher genetic dominance than cis eQTLs. Several analyses also indicate that trans eQTLs make a substantial contribution to the genetic correlations in expression among different genes. They may thus be essential determinants of “gene expression modules,” which has important implications for the evolution of gene expression and how it is studied by geneticists.

     
    more » « less
  5. INTRODUCTION Genome-wide association studies (GWASs) have identified thousands of human genetic variants associated with diverse diseases and traits, and most of these variants map to noncoding loci with unknown target genes and function. Current approaches to understand which GWAS loci harbor causal variants and to map these noncoding regulators to target genes suffer from low throughput. With newer multiancestry GWASs from individuals of diverse ancestries, there is a pressing and growing need to scale experimental assays to connect GWAS variants with molecular mechanisms. Here, we combined biobank-scale GWASs, massively parallel CRISPR screens, and single-cell sequencing to discover target genes of noncoding variants for blood trait loci with systematic targeting and inhibition of noncoding GWAS loci with single-cell sequencing (STING-seq). RATIONALE Blood traits are highly polygenic, and GWASs have identified thousands of noncoding loci that map to candidate cis -regulatory elements (CREs). By combining CRE-silencing CRISPR perturbations and single-cell readouts, we targeted hundreds of GWAS loci in a single assay, revealing target genes in cis and in trans . For select CREs that regulate target genes, we performed direct variant insertion. Although silencing the CRE can identify the target gene, direct variant insertion can identify magnitude and direction of effect on gene expression for the GWAS variant. In select cases in which the target gene was a transcription factor or microRNA, we also investigated the gene-regulatory networks altered upon CRE perturbation and how these networks differ across blood cell types. RESULTS We inhibited candidate CREs from fine-mapped blood trait GWAS variants (from ~750,000 individual of diverse ancestries) in human erythroid progenitors. In total, we targeted 543 variants (254 loci) mapping to candidate CREs, generating multimodal single-cell data including transcriptome, direct CRISPR gRNA capture, and cell surface proteins. We identified target genes in cis (within 500 kb) for 134 CREs. In most cases, we found that the target gene was the closest gene and that specific enhancer-associated biochemical hallmarks (H3K27ac and accessible chromatin) are essential for CRE function. Using multiple perturbations at the same locus, we were able to distinguished between causal variants from noncausal variants in linkage disequilibrium. For a subset of validated CREs, we also inserted specific GWAS variants using base-editing STING-seq (beeSTING-seq) and quantified the effect size and direction of GWAS variants on gene expression. Given our transcriptome-wide data, we examined dosage effects in cis and trans in cases in which the cis target is a transcription factor or microRNA. We found that trans target genes are also enriched for GWAS loci, and identified gene clusters within trans gene networks with distinct biological functions and expression patterns in primary human blood cells. CONCLUSION In this work, we investigated noncoding GWAS variants at scale, identifying target genes in single cells. These methods can help to address the variant-to-function challenges that are a barrier for translation of GWAS findings (e.g., drug targets for diseases with a genetic basis) and greatly expand our ability to understand mechanisms underlying GWAS loci. Identifying causal variants and their target genes with STING-seq. Uncovering causal variants and their target genes or function are a major challenge for GWASs. STING-seq combines perturbation of noncoding loci with multimodal single-cell sequencing to profile hundreds of GWAS loci in parallel. This approach can identify target genes in cis and trans , measure dosage effects, and decipher gene-regulatory networks. 
    more » « less