- Award ID(s):
- 1655624
- NSF-PAR ID:
- 10292547
- Editor(s):
- Sethuraman, A
- Date Published:
- Journal Name:
- G3 Genes|Genomes|Genetics
- ISSN:
- 2160-1836
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract The diatom, Cyclotella cryptica, is a well-established model species for physiological studies and biotechnology applications of diatoms. To further facilitate its use as a model diatom, we report an improved reference genome assembly and annotation for C. cryptica strain CCMP332. We used a combination of long- and short-read sequencing to assemble a high-quality and contaminant-free genome. The genome is 171 Mb in size and consists of 662 scaffolds with a scaffold N50 of 494 kb. This represents a 176-fold decrease in scaffold number and 41-fold increase in scaffold N50 compared to the previous assembly. The genome contains 21,250 predicted genes, 75% of which were assigned putative functions. Repetitive DNA comprises 59% of the genome, and an improved classification of repetitive elements indicated that a historically steady accumulation of transposable elements has contributed to the relatively large size of the C. cryptica genome. The high-quality C. cryptica genome will serve as a valuable reference for ecological, genetic, and biotechnology studies of diatoms.more » « less
-
Abstract Penstemon is the most speciose flowering plant genus endemic to North America. Penstemon species’ diverse morphology and adaptation to various environments have made them a valuable model system for studying evolution. Here, we report the first full reference genome assembly and annotation for Penstemon davidsonii. Using PacBio long-read sequencing and Hi-C scaffolding technology, we constructed a de novo reference genome of 437,568,744 bases, with a contig N50 of 40 Mb and L50 of 5. The annotation includes 18,199 gene models, and both the genome and transcriptome assembly contain over 95% complete eudicot BUSCOs. This genome assembly will serve as a valuable reference for studying the evolutionary history and genetic diversity of the Penstemon genus.
-
Wheat, Christopher (Ed.)
Abstract Echinometra lucunter, the rock-boring sea urchin, is a widely distributed echinoid and a model for ecological studies of reproduction, responses to climate change, and speciation. We present a near chromosome-level genome assembly of E. lucunter, including 21 scaffolds larger than 10 Mb predicted to represent each of the chromosomes of the species. The 760.4 Mb assembly includes a scaffold N50 of 30.0 Mb and BUSCO (benchmarking universal single-copy orthologue) single copy and a duplicated score of 95.8% and 1.4%, respectively. Ab-initio gene model prediction and annotation with transcriptomic data constructed 33,989 gene models composing 50.4% of the assembly, including 37,036 transcripts. Repetitive elements make up approximately 39.6% of the assembly, and unresolved gap sequences are estimated to be 0.65%. Whole genome alignment with Echinometra sp. EZ revealed high synteny and conservation between the two species, further bolstering Echinometra as an emerging genus for comparative genomics studies. This genome assembly represents a high-quality genomic resource for future evolutionary and developmental studies of this species and more broadly of echinoderms.
-
Abstract The angiosperm genus Silene is a model system for several traits of ecological and evolutionary significance in plants, including breeding system and sex chromosome evolution, host-pathogen interactions, invasive species biology, heavy metal tolerance, and cytonuclear interactions. Despite its importance, genomic resources for this large genus of approximately 850 species are scarce, with only one published whole-genome sequence (from the dioecious species Silene latifolia). Here, we provide genomic and transcriptomic resources for a hermaphroditic representative of this genus (S. noctiflora), including a PacBio Iso-Seq transcriptome, which uses long-read, single-molecule sequencing technology to analyze full-length mRNA transcripts. Using these data, we have assembled and annotated high-quality full-length cDNA sequences for approximately 14,126 S. noctiflora genes and 25,317 isoforms. We demonstrated the utility of these data to distinguish between recent and highly similar gene duplicates by identifying novel paralogous genes in an essential protease complex. Furthermore, we provide a draft assembly for the approximately 2.7-Gb genome of this species, which is near the upper range of genome-size values reported for diploids in this genus and threefold larger than the 0.9-Gb genome of Silene conica, another species in the same subgenus. Karyotyping confirmed that S. noctiflora is a diploid, indicating that its large genome size is not due to polyploidization. These resources should facilitate further study and development of this genus as a model in plant ecology and evolution.more » « less
-
Wheat, Christopher (Ed.)
Abstract Paper wasps are a model system for the study of social evolution due to a high degree of inter- and intraspecific variation in cooperation, aggression, and visual signals of social status. Increasing the taxonomic coverage of genomic resources for this diverse clade will aid comparative genomic approaches for testing predictions about the molecular basis of social evolution. Here, we provide draft genome assemblies for two well-studied species of paper wasps, Polistes exclamans and Mischocyttarus mexicanus. The P. exclamans genome assembly is 221.5 Mb in length with a scaffold N50 of 4.11 Mb. The M. mexicanus genome assembly is 227 Mb in length with a scaffold N50 of 1.1 Mb. Genomes have low repeat content (9.54–10.75%) and low GC content (32.06–32.4%), typical of other social hymenopteran genomes. The DNA methyltransferase gene, Dnmt3 , was lost early in the evolution of Polistinae. We identified a second independent loss of Dnmt3 within hornets (genus: Vespa).