skip to main content


Title: Unique structure and positive selection promote the rapid divergence of Drosophila Y chromosomes
Y chromosomes across diverse species convergently evolve a gene-poor, heterochromatic organization enriched for duplicated genes, LTR retrotransposons, and satellite DNA. Sexual antagonism and a loss of recombination play major roles in the degeneration of young Y chromosomes. However, the processes shaping the evolution of mature, already degenerated Y chromosomes are less well-understood. Because Y chromosomes evolve rapidly, comparisons between closely related species are particularly useful. We generated de novo long-read assemblies complemented with cytological validation to reveal Y chromosome organization in three closely related species of the Drosophila simulans complex, which diverged only 250,000 years ago and share >98% sequence identity. We find these Y chromosomes are divergent in their organization and repetitive DNA composition and discover new Y-linked gene families whose evolution is driven by both positive selection and gene conversion. These Y chromosomes are also enriched for large deletions, suggesting that the repair of double-strand breaks on Y chromosomes may be biased toward microhomology-mediated end joining over canonical non-homologous end-joining. We propose that this repair mechanism contributes to the convergent evolution of Y chromosome organization across organisms.  more » « less
Award ID(s):
1844693
NSF-PAR ID:
10382224
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
eLife
Volume:
11
ISSN:
2050-084X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Sex chromosome dosage compensation is a model to understand the coordinated evolution of transcription; however, the advanced age of the sex chromosomes in model systems makes it difficult to study how the complex regulatory mechanisms underlying chromosome-wide dosage compensation can evolve. The sex chromosomes ofPoecilia pictahave undergone recent and rapid divergence, resulting in widespread gene loss on the male Y, coupled with complete X Chromosome dosage compensation, the first case reported in a fish. The recent de novo origin of dosage compensation presents a unique opportunity to understand the genetic and evolutionary basis of coordinated chromosomal gene regulation. By combining a new chromosome-level assembly ofP. pictawith whole-genome bisulfite sequencing and RNA-seq data, we determine that the YY1 transcription factor (YY1) DNA binding motif is associated with male-specific hypomethylated regions on the X, but not the autosomes. These YY1 motifs are the result of a recent and rapid repetitive element expansion on theP. pictaX Chromosome, which is absent in closely related species that lack dosage compensation. Taken together, our results present compelling support that a disruptive wave of repetitive element insertions carrying YY1 motifs resulted in the remodeling of the X Chromosome epigenomic landscape and the rapid de novo origin of a dosage compensation system.

     
    more » « less
  2. Genes that originate during evolution are an important source of novel biological functions. Retrogenes are functional copies of genes produced by retroduplication and as such are located in different genomic positions. To investigate retroposition patterns and retrogene expression, we computationally identified interchromosomal retroduplication events in nine portions of the phylogenetic history of malaria mosquitoes, making use of species that do or do not have classical sex chromosomes to test the roles of sex-linkage. We found 40 interchromosomal events and a significant excess of retroduplications from the X chromosome to autosomes among a set of young retrogenes. These young retroposition events occurred within the last 100 million years in lineages where all species possessed differentiated sex chromosomes. An analysis of available microarray and RNA-seq expression data for Anopheles gambiae showed that many of the young retrogenes evolved male-biased expression in the reproductive organs. Young autosomal retrogenes with increased meiotic or postmeiotic expression in the testes tend to be male biased. In contrast, older retrogenes, i.e., in lineages with undifferentiated sex chromosomes, do not show this particular chromosomal bias and are enriched for female-biased expression in reproductive organs. Our reverse-transcription PCR data indicates that most of the youngest retrogenes, which originated within the last 47.6 million years in the subgenus Cellia, evolved non-uniform expression patterns across body parts in the males and females of An. coluzzii. Finally, gene annotation revealed that mitochondrial function is a prominent feature of the young autosomal retrogenes. We conclude that mRNA-mediated gene duplication has produced a set of genes that contribute to mosquito reproductive functions and that different biases are revealed after the sex chromosomes evolve. Overall, these results suggest potential roles for the evolution of meiotic sex chromosome inactivation in males and of sexually antagonistic conflict related to mitochondrial energy function as the main selective pressures for X-to-autosome gene reduplication and testis-biased expression in these mosquito lineages. 
    more » « less
  3. Abstract

    Sex determination, the developmental process by which sexually dimorphic phenotypes are established, evolves fast. Evolutionary turnover in a sex determination pathway may occur via selection on alleles that are genetically linked to a new master sex determining locus on a newly formed proto‐sex chromosome. Species with polygenic sex determination, in which master regulatory genes are found on multiple different proto‐sex chromosomes, are informative models to study the evolution of sex determination and sex chromosomes. House flies are such a model system, with male determining loci possible on all six chromosomes and a female‐determiner on one of the chromosomes as well. The two most common male‐determining proto‐Y chromosomes form latitudinal clines on multiple continents, suggesting that temperature variation is an important selection pressure responsible for maintaining polygenic sex determination in this species. Temperature‐dependent fitness effects could be manifested through temperature‐dependent gene expression differences across proto‐Y chromosome genotypes. These gene expression differences may be the result ofcisregulatory variants that affect the expression of genes on the proto‐sex chromosomes, ortranseffects of the proto‐Y chromosomes on genes elswhere in the genome. We used RNA‐seq to identify genes whose expression depends on proto‐Y chromosome genotype and temperature in adult male house flies. We found no evidence for ecologically meaningful temperature‐dependent expression differences of sex determining genes between male genotypes, but we were probably not sampling an appropriate developmental time‐point to identify such effects. In contrast, we identified many other genes whose expression depends on the interaction between proto‐Y chromosome genotype and temperature, including genes that encode proteins involved in reproduction, metabolism, lifespan, stress response, and immunity. Notably, genes with genotype‐by‐temperature interactions on expression were not enriched on the proto‐sex chromosomes. Moreover, there was no evidence that temperature‐dependent expression is driven by chromosome‐widecis‐regulatory divergence between the proto‐Y and proto‐X alleles. Therefore, if temperature‐dependent gene expression is responsible for differences in phenotypes and fitness of proto‐Y genotypes across house fly populations, these effects are driven by a small number of temperature‐dependent alleles on the proto‐Y chromosomes that may havetranseffects on the expression of genes on other chromosomes.

     
    more » « less
  4. INTRODUCTION To faithfully distribute genetic material to daughter cells during cell division, spindle fibers must couple to DNA by means of a structure called the kinetochore, which assembles at each chromosome’s centromere. Human centromeres are located within large arrays of tandemly repeated DNA sequences known as alpha satellite (αSat), which often span millions of base pairs on each chromosome. Arrays of αSat are frequently surrounded by other types of tandem satellite repeats, which have poorly understood functions, along with nonrepetitive sequences, including transcribed genes. Previous genome sequencing efforts have been unable to generate complete assemblies of satellite-rich regions because of their scale and repetitive nature, limiting the ability to study their organization, variation, and function. RATIONALE Pericentromeric and centromeric (peri/centromeric) satellite DNA sequences have remained almost entirely missing from the assembled human reference genome for the past 20 years. Using a complete, telomere-to-telomere (T2T) assembly of a human genome, we developed and deployed tailored computational approaches to reveal the organization and evolutionary patterns of these satellite arrays at both large and small length scales. We also performed experiments to map precisely which αSat repeats interact with kinetochore proteins. Last, we compared peri/centromeric regions among multiple individuals to understand how these sequences vary across diverse genetic backgrounds. RESULTS Satellite repeats constitute 6.2% of the T2T-CHM13 genome assembly, with αSat representing the single largest component (2.8% of the genome). By studying the sequence relationships of αSat repeats in detail across each centromere, we found genome-wide evidence that human centromeres evolve through “layered expansions.” Specifically, distinct repetitive variants arise within each centromeric region and expand through mechanisms that resemble successive tandem duplications, whereas older flanking sequences shrink and diverge over time. We also revealed that the most recently expanded repeats within each αSat array are more likely to interact with the inner kinetochore protein Centromere Protein A (CENP-A), which coincides with regions of reduced CpG methylation. This suggests a strong relationship between local satellite repeat expansion, kinetochore positioning, and DNA hypomethylation. Furthermore, we uncovered large and unexpected structural rearrangements that affect multiple satellite repeat types, including active centromeric αSat arrays. Last, by comparing sequence information from nearly 1600 individuals’ X chromosomes, we observed that individuals with recent African ancestry possess the greatest genetic diversity in the region surrounding the centromere, which sometimes contains a predominantly African αSat sequence variant. CONCLUSION The genetic and epigenetic properties of centromeres are closely interwoven through evolution. These findings raise important questions about the specific molecular mechanisms responsible for the relationship between inner kinetochore proteins, DNA hypomethylation, and layered αSat expansions. Even more questions remain about the function and evolution of non-αSat repeats. To begin answering these questions, we have produced a comprehensive encyclopedia of peri/centromeric sequences in a human genome, and we demonstrated how these regions can be studied with modern genomic tools. Our work also illuminates the rich genetic variation hidden within these formerly missing regions of the genome, which may contribute to health and disease. This unexplored variation underlines the need for more T2T human genome assemblies from genetically diverse individuals. Gapless assemblies illuminate centromere evolution. ( Top ) The organization of peri/centromeric satellite repeats. ( Bottom left ) A schematic portraying (i) evidence for centromere evolution through layered expansions and (ii) the localization of inner-kinetochore proteins in the youngest, most recently expanded repeats, which coincide with a region of DNA hypomethylation. ( Bottom right ) An illustration of the global distribution of chrX centromere haplotypes, showing increased diversity in populations with recent African ancestry. 
    more » « less
  5. Schaack, Sarah (Ed.)
    Abstract Sex chromosomes diverge after the establishment of recombination suppression, resulting in differential sex-linkage of genes involved in genetic sex determination and dimorphic traits. This process produces systems of male or female heterogamety wherein the Y and W chromosomes are only present in one sex and are often highly degenerated. Sex-limited Y and W chromosomes contain valuable information about the evolutionary transition from autosomes to sex chromosomes, yet detailed characterizations of the structure, composition, and gene content of sex-limited chromosomes are lacking for many species. In this study, we characterize the female-specific W chromosome of the prairie rattlesnake (Crotalus viridis) and evaluate how recombination suppression and other processes have shaped sex chromosome evolution in ZW snakes. Our analyses indicate that the rattlesnake W chromosome is over 80% repetitive and that an abundance of GC-rich mdg4 elements has driven an overall high degree of GC-richness despite a lack of recombination. The W chromosome is also highly enriched for repeat sequences derived from endogenous retroviruses and likely acts as a “refugium” for these and other retroelements. We annotated 219 putatively functional W-linked genes across at least two evolutionary strata identified based on estimates of sequence divergence between Z and W gametologs. The youngest of these strata is relatively gene-rich, however gene expression across strata suggests retained gene function amidst a greater degree of degeneration following ancient recombination suppression. Functional annotation of W-linked genes indicates a specialization of the W chromosome for reproductive and developmental function since recombination suppression from the Z chromosome. 
    more » « less