skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Chromosome-scale assemblies reveal the structural evolution of African cichlid genomes
Abstract BackgroundAfrican cichlid fishes are well known for their rapid radiations and are a model system for studying evolutionary processes. Here we compare multiple, high-quality, chromosome-scale genome assemblies to elucidate the genetic mechanisms underlying cichlid diversification and study how genome structure evolves in rapidly radiating lineages. ResultsWe re-anchored our recent assembly of the Nile tilapia (Oreochromis niloticus) genome using a new high-density genetic map. We also developed a new de novo genome assembly of the Lake Malawi cichlid, Metriaclima zebra, using high-coverage Pacific Biosciences sequencing, and anchored contigs to linkage groups (LGs) using 4 different genetic maps. These new anchored assemblies allow the first chromosome-scale comparisons of African cichlid genomes. Large intra-chromosomal structural differences (~2–28 megabase pairs) among species are common, while inter-chromosomal differences are rare (<10 megabase pairs total). Placement of the centromeres within the chromosome-scale assemblies identifies large structural differences that explain many of the karyotype differences among species. Structural differences are also associated with unique patterns of recombination on sex chromosomes. Structural differences on LG9, LG11, and LG20 are associated with reduced recombination, indicative of inversions between the rock- and sand-dwelling clades of Lake Malawi cichlids. M. zebra has a larger number of recent transposable element insertions compared with O. niloticus, suggesting that several transposable element families have a higher rate of insertion in the haplochromine cichlid lineage. ConclusionThis study identifies novel structural variation among East African cichlid genomes and provides a new set of genomic resources to support research on the mechanisms driving cichlid adaptation and speciation.  more » « less
Award ID(s):
1830753
PAR ID:
10555341
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
GigaScience
Volume:
8
Issue:
4
ISSN:
2047-217X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Vieira, Cristina (Ed.)
    Abstract Structural genomic variants are key drivers of phenotypic evolution. They can span hundreds to millions of base pairs and can thus affect large numbers of genetic elements. Although structural variation is quite common within and between species, its characterization depends upon the quality of genome assemblies and the proportion of repetitive elements. Using new high-quality genome assemblies, we report a complex and previously hidden landscape of structural divergence between the genomes of Drosophila persimilis and D. pseudoobscura, two classic species in speciation research, and study the relationships among structural variants, transposable elements, and gene expression divergence. The new assemblies confirm the already known fixed inversion differences between these species. Consistent with previous studies showing higher levels of nucleotide divergence between fixed inversions relative to collinear regions of the genome, we also find a significant overrepresentation of INDELs inside the inversions. We find that transposable elements accumulate in regions with low levels of recombination, and spatial correlation analyses reveal a strong association between transposable elements and structural variants. We also report a strong association between differentially expressed (DE) genes and structural variants and an overrepresentation of DE genes inside the fixed chromosomal inversions that separate this species pair. Interestingly, species-specific structural variants are overrepresented in DE genes involved in neural development, spermatogenesis, and oocyte-to-embryo transition. Overall, our results highlight the association of transposable elements with structural variants and their importance in driving evolutionary divergence. 
    more » « less
  2. Abstract African cichlid fishes are a prime model for studying speciation mechanisms. Despite the development of extensive genomic resources, it has been difficult to determine which sources of genetic variation are responsible for cichlid phenotypic variation. One of their most variable phenotypes is visual sensitivity, with some of the largest spectral shifts among vertebrates. These shifts arise primarily from differential expression of seven cone opsin genes. By mapping expression quantitative trait loci (eQTL) in intergeneric crosses of Lake Malawi cichlids, we previously identified four causative genetic variants that correspond to indels in the promoters of either key transcription factors or an opsin gene. In this comprehensive study, we show that these indels are the result of the movement of transposable elements (TEs) that correlate with opsin expression variation across the Malawi flock. In tracking the evolutionary history of these particular indels, we found they are endemic to Lake Malawi, suggesting that these TEs are recently active and are segregating within the Malawi cichlid lineage. However, an independent indel has arisen at a similar genomic location in one locus outside of the Malawi flock. The convergence in TE movement suggests these loci are primed for TE insertion and subsequent deletions. Increased TE mobility may be associated with interspecific hybridization, which disrupts mechanisms of TE suppression. This might provide a link between cichlid hybridization and accelerated regulatory variation. Overall, our study suggests that TEs may be an important driver of key regulatory changes, facilitating rapid phenotypic change and possibly speciation in African cichlids. 
    more » « less
  3. Advances in genome sequencing have greatly accelerated the identification of sex chromosomes in a variety of species. Many of these species have experienced structural rearrangements that reduce recombination between the sex chromosomes, allowing the accumulation of sequence differences over many megabases. Identification of the genes that are responsible for sex determination within these sometimes large regions has proved difficult. Here, we identify an XY sex chromosome system on LG19 in the West African cichlid fishChromidotilapia guntheriin which the region of differentiation extends over less than 400 kb. We develop high-quality male and female genome assemblies for this species, which confirm the absence of structural variants, and which facilitate the annotation of genes in the region. The peak of differentiation lies withinrin3, which has experienced several debilitating mutations on the Y chromosome. We suggest two hypotheses about how these mutations might disrupt endocytosis, leading to Mendelian effects on sexual development. 
    more » « less
  4. VITTE, Clémentine (Ed.)
    Structural differences between genomes are a major source of genetic variation that contributes to phenotypic differences. Transposable elements, mobile genetic sequences capable of increasing their copy number and propagating themselves within genomes, can generate structural variation. However, their repetitive nature makes it difficult to characterize fine-scale differences in their presence at specific positions, limiting our understanding of their impact on genome variation. Domesticated maize is a particularly good system for exploring the impact of transposable element proliferation as over 70% of the genome is annotated as transposable elements. High-quality transposable element annotations were recently generated forde novogenome assemblies of 26 diverse inbred maize lines. We generated base-pair resolved pairwise alignments between the B73 maize reference genome and the remaining 25 inbred maize line assemblies. From this data, we classified transposable elements as either shared or polymorphic in a given pairwise comparison. Our analysis uncovered substantial structural variation between lines, representing both simple and complex connections between TEs and structural variants. Putative insertions in SNP depleted regions, which represent recently diverged identity by state blocks, suggest some TE families may still be active. However, our analysis reveals that within these recently diverged genomic regions, deletions of transposable elements likely account for more structural variation events and base pairs than insertions. These deletions are often large structural variants containing multiple transposable elements. Combined, our results highlight how transposable elements contribute to structural variation and demonstrate that deletion events are a major contributor to genomic differences. 
    more » « less
  5. Larracuente, Amanda (Ed.)
    Abstract Chromosome size and morphology vary within and among species, but little is known about the proximate or ultimate causes of these differences. Cichlid fish species in the tribe Oreochromini share an unusual giant chromosome that is ∼3 times longer than the other chromosomes. This giant chromosome functions as a sex chromosome in some of these species. We test two hypotheses of how this giant sex chromosome may have evolved. The first hypothesis proposes that it evolved by accumulating repetitive elements as recombination was reduced around a dominant sex determination locus, as suggested by canonical models of sex chromosome evolution. An alternative hypothesis is that the giant sex chromosome originated via the fusion of an autosome with a highly repetitive B chromosome, one of which carried a sex determination locus. We test these hypotheses using comparative analysis of chromosome-scale cichlid and teleost genomes. We find that the giant sex chromosome consists of three distinct regions based on patterns of recombination, gene and transposable element content, and synteny to the ancestral autosome. The WZ sex determination locus encompasses the last ∼105 Mb of the 134-Mb giant chromosome. The last 47 Mb of the giant chromosome shares no obvious homology to any ancestral chromosome. Comparisons across 69 teleost genomes reveal that the giant sex chromosome contains unparalleled amounts of endogenous retroviral elements, immunoglobulin genes, and long noncoding RNAs. The results favor the B chromosome fusion hypothesis for the origin of the giant chromosome. 
    more » « less