skip to main content


Title: Evolutionary Dynamics of the Pericentromeric Heterochromatin in Drosophila virilis and Related Species
Pericentromeric heterochromatin in Drosophila generally consists of repetitive DNA, forming the environment associated with gene silencing. Despite the expanding knowledge of the impact of transposable elements (TEs) on the host genome, little is known about the evolution of pericentromeric heterochromatin, its structural composition, and age. During the evolution of the Drosophilidae, hundreds of genes have become embedded within pericentromeric regions yet retained activity. We investigated a pericentromeric heterochromatin fragment found in D. virilis and related species, describing the evolution of genes in this region and the age of TE invasion. Regardless of the heterochromatic environment, the amino acid composition of the genes is under purifying selection. However, the selective pressure affects parts of genes in varying degrees, resulting in expansion of gene introns due to TEs invasion. According to the divergence of TEs, the pericentromeric heterochromatin of the species of virilis group began to form more than 20 million years ago by invasions of retroelements, miniature inverted repeat transposable elements (MITEs), and Helitrons. Importantly, invasions into the heterochromatin continue to occur by TEs that fall under the scope of piRNA silencing. Thus, the pericentromeric heterochromatin, in spite of its ability to induce silencing, has the means for being dynamic, incorporating the regions of active transcription.  more » « less
Award ID(s):
2025197
NSF-PAR ID:
10295386
Author(s) / Creator(s):
; ; ; ; ; ; ;
Date Published:
Journal Name:
Genes
Volume:
12
Issue:
2
ISSN:
2073-4425
Page Range / eLocation ID:
175
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Lerat, Emmanuelle (Ed.)
    Abstract Methylated CHH (mCHH) islands are peaks of CHH methylation that occur primarily upstream to genes. These regions are actively targeted by the methylation machinery, occur at boundaries between heterochromatin and euchromatin, and tend to be near highly expressed genes. Here we took an evolutionary perspective by studying upstream mCHH islands across a sample of eight grass species. Using a statistical approach to define mCHH islands as regions that differ from genome-wide background CHH methylation levels, we demonstrated that mCHH islands are common and associate with 39% of genes, on average. We hypothesized that islands should be more frequent in genomes of large size, because they have more heterochromatin and hence more need for defined boundaries. We found, however, that smaller genomes tended to have a higher proportion of genes associated with 5′ mCHH islands. Consistent with previous work suggesting that islands reflect the silencing of the edge of transposable elements (TEs), genes with nearby TEs were more likely to have mCHH islands. However, the presence of mCHH islands was not a function solely of TEs, both because the underlying sequences of islands were often not homologous to TEs and because genic properties also predicted the presence of 5′ mCHH islands. These genic properties included length and gene-body methylation (gbM); in fact, in three of eight species, the absence of gbM was a stronger predictor of a 5′ mCHH island than TE proximity. In contrast, gene expression level was a positive but weak predictor of the presence of an island. Finally, we assessed whether mCHH islands were evolutionarily conserved by focusing on a set of 2,720 orthologs across the eight species. They were generally not conserved across evolutionary time. Overall, our data establish additional genic properties that are associated with mCHH islands and suggest that they are not just a consequence of the TE silencing machinery. 
    more » « less
  2. Abstract

    Understanding the mechanisms underlying biological invasions and rapid adaptation to global change remains a fundamental challenge, particularly in small populations lacking in genetic variation. Two understudied mechanisms that could facilitate adaptive evolution and adaptive plasticity are the increased genetic variation due to transposable elements (TEs), and associated or independent modification of gene expression through epigenetic changes.

    Here, we focus on the potential role of these genetic and non‐genetic mechanisms for facilitating invasion success. Because novel or stressful environments are known to induce both epigenetic changes and TE activity, these mechanisms may play an underappreciated role in generating phenotypic and genetic variation for selection to act on. We review how these mechanisms operate, the evidence for how they respond to novel or stressful environments, and how these mechanisms can contribute to the success of biological invasions by facilitating adaptive evolution and phenotypic plasticity.

    Because genetic and phenotypic variations due to TEs and epigenetic changes are often well regulated or “hidden” in the native environment, the independent and combined contribution of these mechanisms may only become important when populations colonize novel environments. A focus on the mechanisms that generate and control the expression of this variation in new environments may provide insights into biological invasions that would otherwise not be obvious.

    Global changes and human activities impact on ecosystems and allow new opportunities for biological invasions. Invasive species succeed by adapting rapidly to new environments. The degree to which rapid responses to environmental change could be mediated by the epigenome—the regulatory system that integrates how environmental and genomic variation jointly shape phenotypic variation—requires greater attention if we want to understand the mechanisms by which populations successfully colonize and adapt to new environments.

    A freePlain Language Summarycan be found within the Supporting Information of this article.

     
    more » « less
  3. Bosco, Giovanni (Ed.)
    Transposable elements (TE) are selfish genetic elements that can cause harmful mutations. In Drosophila , it has been estimated that half of all spontaneous visible marker phenotypes are mutations caused by TE insertions. Several factors likely limit the accumulation of exponentially amplifying TEs within genomes. First, synergistic interactions between TEs that amplify their harm with increasing copy number are proposed to limit TE copy number. However, the nature of this synergy is poorly understood. Second, because of the harm posed by TEs, eukaryotes have evolved systems of small RNA-based genome defense to limit transposition. However, as in all immune systems, there is a cost of autoimmunity and small RNA-based systems that silence TEs can inadvertently silence genes flanking TE insertions. In a screen for essential meiotic genes in Drosophila melanogaster , a truncated Doc retrotransposon within a neighboring gene was found to trigger the germline silencing of ald , the Drosophila Mps1 homolog, a gene essential for proper chromosome segregation in meiosis. A subsequent screen for suppressors of this silencing identified a new insertion of a Hobo DNA transposon in the same neighboring gene. Here we describe how the original Doc insertion triggers flanking piRNA biogenesis and local gene silencing. We show that this local gene silencing occurs in cis and is dependent on deadlock , a component of the Rhino-Deadlock-Cutoff (RDC) complex, to trigger dual-strand piRNA biogenesis at TE insertions. We further show how the additional Hobo insertion leads to de-silencing by reducing flanking piRNA biogenesis triggered by the original Doc insertion. These results support a model of TE-mediated gene silencing by piRNA biogenesis in cis that depends on local determinants of transcription. This may explain complex patterns of off-target gene silencing triggered by TEs within populations and in the laboratory. It also provides a mechanism of sign epistasis among TE insertions, illuminates the complex nature of their interactions and supports a model in which off-target gene silencing shapes the evolution of the RDC complex. 
    more » « less
  4. INTRODUCTION Transposable elements (TEs), repeat expansions, and repeat-mediated structural rearrangements play key roles in chromosome structure and species evolution, contribute to human genetic variation, and substantially influence human health through copy number variants, structural variants, insertions, deletions, and alterations to gene transcription and splicing. Despite their formative role in genome stability, repetitive regions have been relegated to gaps and collapsed regions in human genome reference GRCh38 owing to the technological limitations during its development. The lack of linear sequence in these regions, particularly in centromeres, resulted in the inability to fully explore the repeat content of the human genome in the context of both local and regional chromosomal environments. RATIONALE Long-read sequencing supported the complete, telomere-to-telomere (T2T) assembly of the pseudo-haploid human cell line CHM13. This resource affords a genome-scale assessment of all human repetitive sequences, including TEs and previously unknown repeats and satellites, both within and outside of gaps and collapsed regions. Additionally, a complete genome enables the opportunity to explore the epigenetic and transcriptional profiles of these elements that are fundamental to our understanding of chromosome structure, function, and evolution. Comparative analyses reveal modes of repeat divergence, evolution, and expansion or contraction with locus-level resolution. RESULTS We implemented a comprehensive repeat annotation workflow using previously known human repeats and de novo repeat modeling followed by manual curation, including assessing overlaps with gene annotations, segmental duplications, tandem repeats, and annotated repeats. Using this method, we developed an updated catalog of human repetitive sequences and refined previous repeat annotations. We discovered 43 previously unknown repeats and repeat variants and characterized 19 complex, composite repetitive structures, which often carry genes, across T2T-CHM13. Using precision nuclear run-on sequencing (PRO-seq) and CpG methylated sites generated from Oxford Nanopore Technologies long-read sequencing data, we assessed RNA polymerase engagement across retroelements genome-wide, revealing correlations between nascent transcription, sequence divergence, CpG density, and methylation. These analyses were extended to evaluate RNA polymerase occupancy for all repeats, including high-density satellite repeats that reside in previously inaccessible centromeric regions of all human chromosomes. Moreover, using both mapping-dependent and mapping-independent approaches across early developmental stages and a complete cell cycle time series, we found that engaged RNA polymerase across satellites is low; in contrast, TE transcription is abundant and serves as a boundary for changes in CpG methylation and centromere substructure. Together, these data reveal the dynamic relationship between transcriptionally active retroelement subclasses and DNA methylation, as well as potential mechanisms for the derivation and evolution of new repeat families and composite elements. Focusing on the emerging T2T-level assembly of the HG002 X chromosome, we reveal that a high level of repeat variation likely exists across the human population, including composite element copy numbers that affect gene copy number. Additionally, we highlight the impact of repeats on the structural diversity of the genome, revealing repeat expansions with extreme copy number differences between humans and primates while also providing high-confidence annotations of retroelement transduction events. CONCLUSION The comprehensive repeat annotations and updated repeat models described herein serve as a resource for expanding the compendium of human genome sequences and reveal the impact of specific repeats on the human genome. In developing this resource, we provide a methodological framework for assessing repeat variation within and between human genomes. The exhaustive assessment of the transcriptional landscape of repeats, at both the genome scale and locally, such as within centromeres, sets the stage for functional studies to disentangle the role transcription plays in the mechanisms essential for genome stability and chromosome segregation. Finally, our work demonstrates the need to increase efforts toward achieving T2T-level assemblies for nonhuman primates and other species to fully understand the complexity and impact of repeat-derived genomic innovations that define primate lineages, including humans. Telomere-to-telomere assembly of CHM13 supports repeat annotations and discoveries. The human reference T2T-CHM13 filled gaps and corrected collapsed regions (triangles) in GRCh38. Combining long read–based methylation calls, PRO-seq, and multilevel computational methods, we provide a compendium of human repeats, define retroelement expression and methylation profiles, and delineate locus-specific sites of nascent transcription genome-wide, including previously inaccessible centromeres. SINE, short interspersed element; SVA, SINE–variable number tandem repeat– Alu ; LINE, long interspersed element; LTR, long terminal repeat; TSS, transcription start site; pA, xxxxxxxxxxxxxxxx. 
    more » « less
  5. Purugganan, Michael (Ed.)
    Abstract Subgenome dominance after whole-genome duplication (WGD) has been observed in many plant species. However, the degree to which the chromatin environment affects this bias has not been explored. Here, we compared the dominant subgenome (maize1) and the recessive subgenome (maize2) with respect to patterns of sequence substitutions, genes expression, transposable element accumulation, small interfering RNAs, DNA methylation, histone modifications, and accessible chromatin regions (ACRs). Our data show that the degree of bias between subgenomes for all the measured variables does not vary significantly when both of the WGD genes are located in pericentromeric regions. Our data further indicate that the location of maize1 genes in chromosomal arms is pivotal for maize1 to maintain its dominance, but location has a less effect on maize2 homoeologs. In addition to homoeologous genes, we compared ACRs, which often harbor cis-regulatory elements, between the two subgenomes and demonstrate that maize1 ACRs have a higher level of chromatin accessibility, a lower level of sequence substitution, and are enriched in chromosomal arms. Furthermore, we find that a loss of maize1 ACRs near their nearby genes is associated with a reduction in purifying selection and expression of maize1 genes relative to their maize2 homoeologs. Taken together, our data suggest that chromatin environment and cis-regulatory elements are important determinants shaping the divergence and evolution of duplicated genes. 
    more » « less