skip to main content


Title: Comparative analysis reveals distinctive epigenetic features of the human cerebellum
Identifying the molecular underpinnings of the neural specializations that underlie human cognitive and behavioral traits has long been of considerable interest. Much research on human-specific changes in gene expression and epigenetic marks has focused on the prefrontal cortex, a brain structure distinguished by its role in executive functions. The cerebellum shows expansion in great apes and is gaining increasing attention for its role in motor skills and cognitive processing, including language. However, relatively few molecular studies of the cerebellum in a comparative evolutionary context have been conducted. Here, we identify human-specific methylation in the lateral cerebellum relative to the dorsolateral prefrontal cortex, in a comparative study with chimpanzees ( Pan troglodytes ) and rhesus macaques ( Macaca mulatta ). Specifically, we profiled genome-wide methylation levels in the three species for each of the two brain structures and identified human-specific differentially methylated genomic regions unique to each structure. We further identified which differentially methylated regions (DMRs) overlap likely regulatory elements and determined whether associated genes show corresponding species differences in gene expression. We found greater human-specific methylation in the cerebellum than the dorsolateral prefrontal cortex, with differentially methylated regions overlapping genes involved in several conditions or processes relevant to human neurobiology, including synaptic plasticity, lipid metabolism, neuroinflammation and neurodegeneration, and neurodevelopment, including developmental disorders. Moreover, our results show some overlap with those of previous studies focused on the neocortex, indicating that such results may be common to multiple brain structures. These findings further our understanding of the cerebellum in human brain evolution.  more » « less
Award ID(s):
2021785
NSF-PAR ID:
10281661
Author(s) / Creator(s):
; ; ; ; ;
Editor(s):
Gojobori, Takashi
Date Published:
Journal Name:
PLOS Genetics
Volume:
17
Issue:
5
ISSN:
1553-7404
Page Range / eLocation ID:
e1009506
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Background

    Neuropsychiatric disorders afflict a large portion of the global population and constitute a significant source of disability worldwide. Although Genome-wide Association Studies (GWAS) have identified many disorder-associated variants, the underlying regulatory mechanisms linking them to disorders remain elusive, especially those involving distant genomic elements. Expression quantitative trait loci (eQTLs) constitute a powerful means of providing this missing link. However, most eQTL studies in human brains have focused exclusively on cis-eQTLs, which link variants to nearby genes (i.e., those within 1 Mb of a variant). A complete understanding of disease etiology requires a clearer understanding of trans-regulatory mechanisms, which, in turn, entails a detailed analysis of the relationships between variants and expression changes in distant genes.

    Methods

    By leveraging large datasets from the PsychENCODE consortium, we conducted a genome-wide survey of trans-eQTLs in the human dorsolateral prefrontal cortex. We also performed colocalization and mediation analyses to identify mediators in trans-regulation and use trans-eQTLs to link GWAS loci to schizophrenia risk genes.

    Results

    We identified ~80,000 candidate trans-eQTLs (at FDR<0.25) that influence the expression of ~10K target genes (i.e., “trans-eGenes”). We found that many variants associated with these candidate trans-eQTLs overlap with known cis-eQTLs. Moreover, for >60% of these variants (by colocalization), the cis-eQTL’s target gene acts as a mediator for the trans-eQTL SNP's effect on the trans-eGene, highlighting examples of cis-mediation as essential for trans-regulation. Furthermore, many of these colocalized variants fall into a discernable pattern wherein cis-eQTL’s target is a transcription factor or RNA-binding protein, which, in turn, targets the gene associated with the candidate trans-eQTL. Finally, we show that trans-regulatory mechanisms provide valuable insights into psychiatric disorders: beyond what had been possible using only cis-eQTLs, we link an additional 23 GWAS loci and 90 risk genes (using colocalization between candidate trans-eQTLs and schizophrenia GWAS loci).

    Conclusions

    We demonstrate that the transcriptional architecture of the human brain is orchestrated by both cis- and trans-regulatory variants and found that trans-eQTLs provide insights into brain-disease biology.

     
    more » « less
  2. INTRODUCTION Transposable elements (TEs), repeat expansions, and repeat-mediated structural rearrangements play key roles in chromosome structure and species evolution, contribute to human genetic variation, and substantially influence human health through copy number variants, structural variants, insertions, deletions, and alterations to gene transcription and splicing. Despite their formative role in genome stability, repetitive regions have been relegated to gaps and collapsed regions in human genome reference GRCh38 owing to the technological limitations during its development. The lack of linear sequence in these regions, particularly in centromeres, resulted in the inability to fully explore the repeat content of the human genome in the context of both local and regional chromosomal environments. RATIONALE Long-read sequencing supported the complete, telomere-to-telomere (T2T) assembly of the pseudo-haploid human cell line CHM13. This resource affords a genome-scale assessment of all human repetitive sequences, including TEs and previously unknown repeats and satellites, both within and outside of gaps and collapsed regions. Additionally, a complete genome enables the opportunity to explore the epigenetic and transcriptional profiles of these elements that are fundamental to our understanding of chromosome structure, function, and evolution. Comparative analyses reveal modes of repeat divergence, evolution, and expansion or contraction with locus-level resolution. RESULTS We implemented a comprehensive repeat annotation workflow using previously known human repeats and de novo repeat modeling followed by manual curation, including assessing overlaps with gene annotations, segmental duplications, tandem repeats, and annotated repeats. Using this method, we developed an updated catalog of human repetitive sequences and refined previous repeat annotations. We discovered 43 previously unknown repeats and repeat variants and characterized 19 complex, composite repetitive structures, which often carry genes, across T2T-CHM13. Using precision nuclear run-on sequencing (PRO-seq) and CpG methylated sites generated from Oxford Nanopore Technologies long-read sequencing data, we assessed RNA polymerase engagement across retroelements genome-wide, revealing correlations between nascent transcription, sequence divergence, CpG density, and methylation. These analyses were extended to evaluate RNA polymerase occupancy for all repeats, including high-density satellite repeats that reside in previously inaccessible centromeric regions of all human chromosomes. Moreover, using both mapping-dependent and mapping-independent approaches across early developmental stages and a complete cell cycle time series, we found that engaged RNA polymerase across satellites is low; in contrast, TE transcription is abundant and serves as a boundary for changes in CpG methylation and centromere substructure. Together, these data reveal the dynamic relationship between transcriptionally active retroelement subclasses and DNA methylation, as well as potential mechanisms for the derivation and evolution of new repeat families and composite elements. Focusing on the emerging T2T-level assembly of the HG002 X chromosome, we reveal that a high level of repeat variation likely exists across the human population, including composite element copy numbers that affect gene copy number. Additionally, we highlight the impact of repeats on the structural diversity of the genome, revealing repeat expansions with extreme copy number differences between humans and primates while also providing high-confidence annotations of retroelement transduction events. CONCLUSION The comprehensive repeat annotations and updated repeat models described herein serve as a resource for expanding the compendium of human genome sequences and reveal the impact of specific repeats on the human genome. In developing this resource, we provide a methodological framework for assessing repeat variation within and between human genomes. The exhaustive assessment of the transcriptional landscape of repeats, at both the genome scale and locally, such as within centromeres, sets the stage for functional studies to disentangle the role transcription plays in the mechanisms essential for genome stability and chromosome segregation. Finally, our work demonstrates the need to increase efforts toward achieving T2T-level assemblies for nonhuman primates and other species to fully understand the complexity and impact of repeat-derived genomic innovations that define primate lineages, including humans. Telomere-to-telomere assembly of CHM13 supports repeat annotations and discoveries. The human reference T2T-CHM13 filled gaps and corrected collapsed regions (triangles) in GRCh38. Combining long read–based methylation calls, PRO-seq, and multilevel computational methods, we provide a compendium of human repeats, define retroelement expression and methylation profiles, and delineate locus-specific sites of nascent transcription genome-wide, including previously inaccessible centromeres. SINE, short interspersed element; SVA, SINE–variable number tandem repeat– Alu ; LINE, long interspersed element; LTR, long terminal repeat; TSS, transcription start site; pA, xxxxxxxxxxxxxxxx. 
    more » « less
  3. Abstract Background Mutations in LMNA , encoding lamin A/C, lead to a variety of diseases known as laminopathies including dilated cardiomyopathy (DCM) and skeletal abnormalities. Though previous studies have investigated the dysregulation of gene expression in cells from patients with DCM, the role of epigenetic (gene regulatory) mechanisms, such as DNA methylation, has not been thoroughly investigated. Furthermore, the impact of family-specific LMNA mutations on DNA methylation is unknown. Here, we performed reduced representation bisulfite sequencing on ten pairs of fibroblasts and their induced pluripotent stem cell (iPSC) derivatives from two families with DCM due to distinct LMNA mutations, one of which also induces brachydactyly. Results Family-specific differentially methylated regions (DMRs) were identified by comparing the DNA methylation landscape of patient and control samples. Fibroblast DMRs were found to enrich for distal regulatory features and transcriptionally repressed chromatin and to associate with genes related to phenotypes found in tissues affected by laminopathies. These DMRs, in combination with transcriptome-wide expression data and lamina-associated domain (LAD) organization, revealed the presence of inter-family epimutation hotspots near differentially expressed genes, most of which were located outside LADs redistributed in LMNA -related DCM. Comparison of DMRs found in fibroblasts and iPSCs identified regions where epimutations were persistent across both cell types. Finally, a network of aberrantly methylated disease-associated genes revealed a potential molecular link between pathways involved in bone and heart development. Conclusions Our results identified both shared and mutation-specific laminopathy epimutation landscapes that were consistent with lamin A/C mutation-mediated epigenetic aberrancies that arose in somatic and early developmental cell stages. 
    more » « less
  4. Abstract Background DNA methylation dynamics in the brain are associated with normal development and neuropsychiatric disease and differ across functionally distinct brain regions. Previous studies of genome-wide methylation differences among human brain regions focus on limited numbers of individuals and one to two brain regions. Results Using GTEx samples, we generate a resource of DNA methylation in purified neuronal nuclei from 8 brain regions as well as lung and thyroid tissues from 12 to 23 donors. We identify differentially methylated regions between brain regions among neuronal nuclei in both CpG (181,146) and non-CpG (264,868) contexts, few of which were unique to a single pairwise comparison. This significantly expands the knowledge of differential methylation across the brain by 10-fold. In addition, we present the first differential methylation analysis among neuronal nuclei from basal ganglia tissues and identify unique CpG differentially methylated regions, many associated with ion transport. We also identify 81,130 regions of variably CpG methylated regions, i.e., variable methylation among individuals in the same brain region, which are enriched in regulatory regions and in CpG differentially methylated regions. Many variably methylated regions are unique to a specific brain region, with only 202 common across all brain regions, as well as lung and thyroid. Variably methylated regions identified in the amygdala, anterior cingulate cortex, and hippocampus are enriched for heritability of schizophrenia. Conclusions These data suggest that epigenetic variation in these particular human brain regions could be associated with the risk for this neuropsychiatric disorder. 
    more » « less
  5. Post-transcriptional RNA modifications have been recognized as key regulators of neuronal differentiation and synapse development in the mammalian brain. While distinct sets of 5-methylcytosine (m5C) modified mRNAs have been detected in neuronal cells and brain tissues, no study has been performed to characterize methylated mRNA profiles in the developing brain. Here, together with regular RNA-seq, we performed transcriptome-wide bisulfite sequencing to compare RNA cytosine methylation patterns in neural stem cells (NSCs), cortical neuronal cultures, and brain tissues at three postnatal stages. Among 501 m5C sites identified, approximately 6% are consistently methylated across all five conditions. Compared to m5C sites identified in NSCs, 96% of them were hypermethylated in neurons and enriched for genes involved in positive transcriptional regulation and axon extension. In addition, brains at the early postnatal stage demonstrated substantial changes in both RNA cytosine methylation and gene expression of RNA cytosine methylation readers, writers, and erasers. Furthermore, differentially methylated transcripts were significantly enriched for genes regulating synaptic plasticity. Altogether, this study provides a brain epitranscriptomic dataset as a new resource and lays the foundation for further investigations into the role of RNA cytosine methylation during brain development. 
    more » « less