skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Chromosome Level Genome Assembly and Annotation of Highly Invasive Japanese Stiltgrass ( Microstegium vimineum )
Abstract The invasive Japanese stiltgrass (Microstegium vimineum) affects a wide range of ecosystems and threatens biodiversity across the eastern USA. However, the mechanisms underlying rapid adaptation, plasticity, and epigenetics in the invasive range are largely unknown. We present a chromosome-level assembly for M. vimineum to investigate genome dynamics, evolution, adaptation, and the genomics of phenotypic plasticity. We generated a 1.12-Gb genome with scaffold N50 length of 53.44 Mb respectively, taking a de novo assembly approach that combined PacBio and Dovetail Genomics Omni-C sequencing. The assembly contains 23 pseudochromosomes, representing 99.96% of the genome. BUSCO assessment indicated that 80.3% of Poales gene groups are present in the assembly. The genome is predicted to contain 39,604 protein-coding genes, of which 26,288 are functionally annotated. Furthermore, 66.68% of the genome is repetitive, of which unclassified (35.63%) and long-terminal repeat (LTR) retrotransposons (26.90%) are predominant. Similar to other grasses, Gypsy (41.07%) and Copia (32%) are the most abundant LTR-retrotransposon families. The majority of LTR-retrotransposons are derived from a significant expansion in the past 1–2 Myr, suggesting the presence of relatively young LTR-retrotransposon lineages. We find corroborating evidence from Ks plots for a stiltgrass-specific duplication event, distinct from the more ancient grass-specific duplication event. The assembly and annotation of M. vimineum will serve as an essential genomic resource facilitating studies of the invasion process, the history and consequences of polyploidy in grasses, and provides a crucial tool for natural resource managers.  more » « less
Award ID(s):
1726534 1920858
PAR ID:
10314863
Author(s) / Creator(s):
; ; ; ; ;
Editor(s):
Eyre-Walker, Adam
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
13
Issue:
11
ISSN:
1759-6653
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. na (Ed.)
    Long Terminal Repeat (LTR) retrotransposons are a class of repetitive elements that are widespread in the genomes of plants and many fungi. LTR retrotransposons have been associated with rapidly evolving gene clusters in plants and virulence factor transfer in fungal-plant parasite-host interactions. We report here the abundance and transcriptional activity of LTR retrotransposons across several species of the early-branching Neocallimastigomycota, otherwise known as the anaerobic gut fungi (AGF). The ubiquity of LTR retrotransposons in these genomes suggests key evolutionary roles in these rumen-dwelling biomass degraders, whose genomes also contain many enzymes that are horizontally transferred from other rumen-dwelling prokaryotes. Up to 10% of anaerobic fungal genomes consist of LTR retrotransposons, and the mapping of sequences from LTR retrotransposons to transcriptomes shows that the majority of clusters are transcribed, with some exhibiting expression greater than 104 reads per kilobase million mapped reads (rpkm). Many LTR retrotransposons are strongly differentially expressed upon heat stress during fungal cultivation, with several exhibiting a nearly three-log10 fold increase in expression, whereas growth substrate variation modulated transcription to a lesser extent. We show that some LTR retrotransposons contain carbohydrate-active enzymes (CAZymes), and the expansion of CAZymes within genomes and among anaerobic fungal species may be linked to retrotransposon activity. We further discuss how these widespread sequences may be a source of promoters and other parts towards the bioengineering of anaerobic fungi. 
    more » « less
  2. Abstract Background The hard clam Mercenaria mercenaria is a major marine resource along the Atlantic coasts of North America and has been introduced to other continents for resource restoration or aquaculture activities. Significant mortality events have been reported in the species throughout its native range as a result of diseases (microbial infections, leukemia) and acute environmental stress. In this context, the characterization of the hard clam genome can provide highly needed resources to enable basic (e.g., oncogenesis and cancer transmission, adaptation biology) and applied (clam stock enhancement, genomic selection) sciences. Results Using a combination of long and short-read sequencing technologies, a 1.86 Gb chromosome-level assembly of the clam genome was generated. The assembly was scaffolded into 19 chromosomes, with an N50 of 83 Mb. Genome annotation yielded 34,728 predicted protein-coding genes, markedly more than the few other members of the Venerida sequenced so far, with coding regions representing only 2% of the assembly. Indeed, more than half of the genome is composed of repeated elements, including transposable elements. Major chromosome rearrangements were detected between this assembly and another recent assembly derived from a genetically segregated clam stock. Comparative analysis of the clam genome allowed the identification of a marked diversification in immune-related proteins, particularly extensive tandem duplications and expansions in tumor necrosis factors (TNFs) and C1q domain-containing proteins, some of which were previously shown to play a role in clam interactions with infectious microbes. The study also generated a comparative repertoire highlighting the diversity and, in some instances, the specificity of LTR-retrotransposons elements, particularly Steamer elements in bivalves. Conclusions The diversity of immune molecules in M. mercenaria may allow this species to cope with varying and complex microbial and environmental landscapes. The repertoire of transposable elements identified in this study, particularly Steamer elements, should be a prime target for the investigation of cancer cell development and transmission among bivalve mollusks. 
    more » « less
  3. Abstract Mimulus laciniatus (syn. Erythranthe lacinata) is an annual plant endemic to the Sierra Nevada region of California. Mimulus laciniatus is notable for its specialized ecological niche, thriving in granite outcrops of alpine environments characterized by shallow soils that dry out rapidly as the snowpack is exhausted during season-ending droughts. Due to its narrow habitat range and sensitivity to environmental change, this species serves as an important model for studying adaptation and survival in marginal habitats. As part of the California Conservation Genomics Project, here we report the sequencing and assembly of a high-quality nuclear genome and chloroplast genome of M. laciniatus. The primary assembly is 309.96 Mb and consists of 104 scaffolds with a scaffold N50 of 20.99 Mb, a largest contig size of 24.29 Mb and a contig N50 of 11.09 Mb, The alternate haplotype assembly consists of 194 scaffolds spanning 213.84 Mb. BUSCO completeness of the primary assembly is 98.6%. This high quality genome adds a valuable resource to the expanding collection of sequenced genomes of the monkeyflowers (Mimulus sensu lato), which have become a model clade for studying ecological adaptation, speciation, and evolutionary genetics. 
    more » « less
  4. Haplotype-level allelic characterization facilitates research on the functional, evolutionary and breeding-related features of extremely large and complex plant genomes. We report a 21.7-Gb chromosome-level haplotype-resolved assembly in Pinus densiflora. We found genome rearrangements involving translocations and inversions between chromosomes 1 and 3 of Pinus species and a proliferation of specific long terminal repeat (LTR) retrotransposons (LTR-RTs) in P. densiflora. Evolutionary analyses illustrated that tandem and LTR-RT-mediated duplications led to an increment of transcription factor (TF) genes in P. densiflora. The haplotype sequence comparison showed allelic imbalances, including presence–absence variations of genes (PAV genes) and their functional contributions to flowering and abiotic stress-related traits in P. densiflora. Allele-aware resequencing analysis revealed PAV gene diversity across P. densiflora accessions. Our study provides insights into key mechanisms underlying the evolution of genome structure, LTR-RTs and TFs within the Pinus lineage as well as allelic imbalances and diversity across P. densiflora. 
    more » « less
  5. Abstract Eukaryotic retroelements are generally divided into two classes: long terminal repeat (LTR) retrotransposons and non-LTR retrotransposons. A third class of eukaryotic retroelement, the Penelope-like elements (PLEs), has been well-characterized bioinformatically, but relatively little is known about the transposition mechanism of these elements. PLEs share some features with the R2 retrotransposon fromBombyx mori, which uses a target-primed reverse transcription (TPRT) mechanism, but their distinct phylogeny suggests PLEs may utilize a novel mechanism of mobilization. Using protein purified fromE. coli, we report unique in vitro properties of a PLE from the green anole (Anolis carolinensis), revealing mechanistic aspects not shared by other retrotransposons. We found that reverse transcription is initiated at two adjacent sites within the transposon RNA that is not homologous to the cleaved DNA, a feature that is reflected in the genomic “tail” signature shared between and unique to PLEs. Our results for the first active PLE in vitro provide a starting point for understanding PLE mobilization and biology. 
    more » « less