skip to main content


Title: Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids
Abstract Background

De novo phased (haplo)genome assembly using long-read DNA sequencing data has improved the detection and characterization of structural variants (SVs) in plant and animal genomes. Able to span across haplotypes, long reads allow phased, haplogenome assembly in highly outbred organisms such as forest trees. Eucalyptus tree species and interspecific hybrids are the most widely planted hardwood trees with F1 hybrids of Eucalyptus grandis and E. urophylla forming the bulk of fast-growing pulpwood plantations in subtropical regions. The extent of structural variation and its effect on interspecific hybridization is unknown in these trees. As a first step towards elucidating the extent of structural variation between the genomes of E. grandis and E. urophylla, we sequenced and assembled the haplogenomes contained in an F1 hybrid of the two species.

Findings

Using Nanopore sequencing and a trio-binning approach, we assembled the separate haplogenomes (566.7 Mb and 544.5 Mb) to 98.0% BUSCO completion. High-density SNP genetic linkage maps of both parents allowed scaffolding of 88.0% of the haplogenome contigs into 11 pseudo-chromosomes (scaffold N50 of 43.8 Mb and 42.5 Mb for the E. grandis and E. urophylla haplogenomes, respectively). We identify 48,729 SVs between the two haplogenomes providing the first detailed insight into genome structural rearrangement in these species. The two haplogenomes have similar gene content, 35,572 and 33,915 functionally annotated genes, of which 34.7% are contained in genome rearrangements.

Conclusions

Knowledge of SV and haplotype diversity in the two species will form the basis for understanding the genetic basis of hybrid superiority in these trees.

 
more » « less
Award ID(s):
1943371
PAR ID:
10491504
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
GSA
Date Published:
Journal Name:
GigaScience
Volume:
12
ISSN:
2047-217X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Structural variants (SVs) can promote speciation by directly causing reproductive isolation or by suppressing recombination across large genomic regions. Whereas examples of each mechanism have been documented, systematic tests of the role of SVs in speciation are lacking. Here, we take advantage of long‐read (Oxford nanopore) whole‐genome sequencing and a hybrid zone between twoLycaeidesbutterfly taxa (L.melissaand Jackson HoleLycaeides) to comprehensively evaluate genome‐wide patterns of introgression for SVs and relate these patterns to hypotheses about speciation. We found >100,000 SVs segregating within or between the two hybridizing species. SVs and SNPs exhibited similar levels of genetic differentiation between species, with the exception of inversions, which were more differentiated. We detected credible variation in patterns of introgression among SV loci in the hybrid zone, with 562 of 1419 ancestry‐informative SVs exhibiting genomic clines that deviated from null expectations based on genome‐average ancestry. Overall, hybrids exhibited a directional shift towards Jackson HoleLycaeidesancestry at SV loci, consistent with the hypothesis that these loci experienced more selection on average than SNP loci. Surprisingly, we found that deletions, rather than inversions, showed the highest skew towards excess ancestry from Jackson HoleLycaeides. Excess Jackson HoleLycaeidesancestry in hybrids was also especially pronounced for Z‐linked SVs and inversions containing many genes. In conclusion, our results show that SVs are ubiquitous and suggest that SVs in general, but especially deletions, might disproportionately affect hybrid fitness and thus contribute to reproductive isolation.

     
    more » « less
  2. Purugganan, Michael (Ed.)
    Abstract Structural variants (SVs) are a largely unstudied feature of plant genome evolution, despite the fact that SVs contribute substantially to phenotypes. In this study, we discovered SVs across a population sample of 347 high-coverage, resequenced genomes of Asian rice (Oryza sativa) and its wild ancestor (O. rufipogon). In addition to this short-read data set, we also inferred SVs from whole-genome assemblies and long-read data. Comparisons among data sets revealed different features of genome variability. For example, genome alignment identified a large (∼4.3 Mb) inversion in indica rice varieties relative to japonica varieties, and long-read analyses suggest that ∼9% of genes from the outgroup (O. longistaminata) are hemizygous. We focused, however, on the resequencing sample to investigate the population genomics of SVs. Clustering analyses with SVs recapitulated the rice cultivar groups that were also inferred from SNPs. However, the site-frequency spectrum of each SV type—which included inversions, duplications, deletions, translocations, and mobile element insertions—was skewed toward lower frequency variants than synonymous SNPs, suggesting that SVs may be predominantly deleterious. Among transposable elements, SINE and mariner insertions were found at especially low frequency. We also used SVs to study domestication by contrasting between rice and O. rufipogon. Cultivated genomes contained ∼25% more derived SVs and mobile element insertions than O. rufipogon, indicating that SVs contribute to the cost of domestication in rice. Peaks of SV divergence were enriched for known domestication genes, but we also detected hundreds of genes gained and lost during domestication, some of which were enriched for traits of agronomic interest. 
    more » « less
  3. Vogel, K (Ed.)
    Abstract

    Coral species in the genus Acropora are key ecological components of coral reefs worldwide and represent the most diverse genus of scleractinian corals. While key species of Indo-Pacific Acropora have annotated genomes, no annotated genome has been published for either of the two species of Caribbean Acropora. Here we present the first fully annotated genome of the endangered Caribbean staghorn coral, Acropora cervicornis. We assembled and annotated this genome using high-fidelity nanopore long-read sequencing with gene annotations validated with mRNA sequencing. The assembled genome size is 318 Mb, with 28,059 validated genes. Comparative genomic analyses with other Acropora revealed unique features in A. cervicornis, including contractions in immune pathways and expansions in signaling pathways. Phylogenetic analysis confirms previous findings showing that A. cervicornis diverged from Indo-Pacific relatives around 41 million years ago, with the closure of the western Tethys Sea, prior to the primary radiation of Indo-Pacific Acropora. This new A. cervicornis genome enriches our understanding of the speciose Acropora and addresses evolutionary inquiries concerning speciation and hybridization in this diverse clade.

     
    more » « less
  4. SUMMARY

    Maples (the genusAcer) represent important and beloved forest, urban, and ornamental trees distributed throughout the Northern hemisphere. They exist in a diverse array of native ranges and distributions, across spectrums of tolerance or decline, and have varying levels of susceptibility to biotic and abiotic stress. AmongAcerspecies, several stand out in their importance to economic interest. Here we report the first two chromosome‐scale genomes for North American species,Acer negundoandAcer saccharum. Both assembled genomes contain scaffolds corresponding to 13 chromosomes, withA. negundoat a length of 442 Mb, an N50 of 32 Mb, and 30 491 genes, andA. saccharumat a length of 626 Mb, an N50 of 46 Mb, and 40 074 genes. No recent whole genome duplications were detected, thoughA. saccharumhas local gene duplication and more recent bursts of transposable elements, as well as a large‐scale translocation between two chromosomes. Genomic comparison revealed thatA. negundohas a smaller genome with recent gene family evolution that is predominantly contracted and expansions that are potentially related to invasive tendencies and tolerance to abiotic stress. Examination of RNA sequencing data obtained fromA. saccharumgiven long‐term aluminum and calcium soil treatments at the Hubbard Brook Experimental Forest provided insights into genes involved in the aluminum stress response at the systemic level, as well as signs of compromised processes upon calcium deficiency, a condition contributing to maple decline.

     
    more » « less
  5. Abstract

    In F1 hybrids, phenotypic values are expected to be near the parental means under additive effects or close to one parent under dominance. However, F1 traits can fall outside the parental range, and outbreeding depression occurs when inferior fitness is observed in hybrids. Another possible outcome is heterosis, a phenomenon that interspecific hybrids or intraspecific crossbred F1s exhibit improved fitness compared to both parental species or strains. As an application of heterosis, hybrids between channel catfish females and blue catfish males are superior in feed conversion efficiency, carcass yield, and harvestability. Over 20 years of hybrid catfish production in experimental settings and farming practices generated abundant phenotypic data, making it an ideal system to investigate heterosis. In this study, we characterized fitness in terms of growth and survival longitudinally, revealing environment-dependent heterosis. In ponds, hybrids outgrow both parents due to an extra rapid growth phase of 2–4 months in year 2. This bimodal growth pattern is unique to F1 hybrids in pond culture environments only. In sharp contrast, the same genetic types cultured in tanks display outbreeding depression, where hybrids perform poorly, while channel catfish demonstrate superiority in growth throughout development. Our findings represent the first example, known to the authors, of opposite fitness shifts in response to environmental changes in interspecific vertebrate hybrids, suggesting a broader fitness landscape for F1 hybrids. Future genomic studies based on this experiment will help understand genome-environment interaction in shaping the F1 progeny fitness in the scenario of environment-dependent heterosis and outbreeding depression.

     
    more » « less