- Award ID(s):
- 1942437
- PAR ID:
- 10399409
- Editor(s):
- Slotte, Tanja
- Date Published:
- Journal Name:
- Genome Biology and Evolution
- ISSN:
- 1759-6653
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract Senna tora is a widely used medicinal plant. Its health benefits have been attributed to the large quantity of anthraquinones, but how they are made in plants remains a mystery. To identify the genes responsible for plant anthraquinone biosynthesis, we reveal the genome sequence ofS. tora at the chromosome level with 526 Mb (96%) assembled into 13 chromosomes. Comparison among related plant species shows that a chalcone synthase-like (CHS-L) gene family has lineage-specifically and rapidly expanded inS. tora . Combining genomics, transcriptomics, metabolomics, and biochemistry, we identify a CHS-L gene contributing to the biosynthesis of anthraquinones. TheS. tora reference genome will accelerate the discovery of biologically active anthraquinone biosynthesis pathways in medicinal plants. -
Within the arachnids, chromosome-level genome assemblies have greatly accelerated the understanding of gene family evolution and developmental genomics in key groups, such as spiders (Araneae), mites and ticks (Acariformes and Parasitiformes). Among other poorly studied arachnid orders that lack genome assemblies altogether are the clade Pedipalpi, which is comprised of three orders that form the sister group of spiders, which diverged over 400 Mya. We close this gap by generating the first chromosome-level assembly from a single specimen of the vinegaroon Mastigoproctus giganteus (Uropygi). We show that this highly complete genome retains plesiomorphic conditions for many gene families that have undergone lineage-specific derivations within the more diverse spiders. Consistent with the phylogenetic position of Uropygi, macrosynteny in the M. giganteus genome substantiates the signature of an ancient whole genome duplication.more » « less
-
Summary White oak (
Quercus alba ) is an abundant forest tree species across eastern North America that is ecologically, culturally, and economically important.We report the first haplotype‐resolved chromosome‐scale genome assembly of
Q. alba and conduct comparative analyses of genome structure and gene content against other published Fagaceae genomes. We investigate the genetic diversity of this widespread species and the phylogenetic relationships among oaks using whole genome data.Despite strongly conserved chromosome synteny and genome size across
Quercus , certain gene families have undergone rapid changes in size, including defense genes. Unbiased annotation of resistance (R) genes across oaks revealed that the overall number of R genes is similar across species – as are the chromosomal locations of R gene clusters – but, gene number within clusters is more labile. We found thatQ. alba has high genetic diversity, much of which predates its divergence from other oaks and likely impacts divergence time estimations. Our phylogenetic results highlight widespread phylogenetic discordance across the genus.The white oak genome represents a major new resource for studying genome diversity and evolution in
Quercus . Additionally, we show that unbiased gene annotation is key to accurately assessing R gene evolution inQuercus . -
Abstract As a model organism for studies of cell and environmental biology, the free‐living and cosmopolitan ciliate
Euplotes vannus shows intriguing features like dual genome architecture (i.e., separate germline and somatic nuclei in each cell/organism), “gene‐sized” chromosomes, stop codon reassignment, programmed ribosomal frameshifting (PRF) and strong resistance to environmental stressors. However, the molecular mechanisms that account for these remarkable traits remain largely unknown. Here we report a combined analysis of de novo assembled high‐quality macronuclear (MAC; i.e., somatic) and partial micronuclear (MIC; i.e., germline) genome sequences forE. vannus , and transcriptome profiling data under varying conditions. The results demonstrate that: (a) the MAC genome contains more than 25,000 complete “gene‐sized” nanochromosomes (~85 Mb haploid genome size) with the N50 ~2.7 kb; (b) although there is a high frequency of frameshifting at stop codons UAA and UAG, we did not observe impaired transcript abundance as a result of PRF in this species as has been reported for other euplotids; (c) the sequence motif 5′‐TA‐3′ is conserved at nearly all internally‐eliminated sequence (IES) boundaries in the MIC genome, and chromosome breakage sites (CBSs) are duplicated and retained in the MAC genome; (d) by profiling the weighted correlation network of genes in the MAC under different environmental stressors, including nutrient scarcity, extreme temperature, salinity and the presence of ammonia, we identified gene clusters that respond to these external physical or chemical stimulations, and (e) we observed a dramatic increase in HSP70 gene transcription under salinity and chemical stresses but surprisingly, not under temperature changes; we link this temperature‐resistance to the evolved loss of temperature stress‐sensitive elements in regulatory regions. Together with the genome resources generated in this study, which are available online atEuplotes vannus Genome Database (http://evan.ciliate.org ), these data provide molecular evidence for understanding the unique biology of highly adaptable microorganisms. -
Abstract Background Teak, a member of the Lamiaceae family, produces one of the most expensive hardwoods in the world. High demand coupled with deforestation have caused a decrease in natural teak forests, and future supplies will be reliant on teak plantations. Hence, selection of teak tree varieties for clonal propagation with superior growth performance is of great importance, and access to high-quality genetic and genomic resources can accelerate the selection process by identifying genes underlying desired traits.
Findings To facilitate teak research and variety improvement, we generated a highly contiguous, chromosomal-scale genome assembly using high-coverage Pacific Biosciences long reads coupled with high-throughput chromatin conformation capture. Of the 18 teak chromosomes, we generated 17 near-complete pseudomolecules with one chromosome present as two chromosome arm scaffolds. Genome annotation yielded 31,168 genes encoding 46,826 gene models, of which, 39,930 and 41,155 had Pfam domain and expression evidence, respectively. We identified 14 clusters of tandem-duplicated terpene synthases (TPSs), genes central to the biosynthesis of terpenes, which are involved in plant defense and pollinator attraction. Transcriptome analysis revealed 10 TPSs highly expressed in woody tissues, of which, 8 were in tandem, revealing the importance of resolving tandemly duplicated genes and the quality of the assembly and annotation. We also validated the enzymatic activity of four TPSs to demonstrate the function of key TPSs.
Conclusions In summary, this high-quality chromosomal-scale assembly and functional annotation of the teak genome will facilitate the discovery of candidate genes related to traits critical for sustainable production of teak and for anti-insecticidal natural products.