skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Annotated genome of the Atlantic dog whelk, Nucella lapillus
Abstract Nucella lapillus is an important player in rocky shore food chains and has been a focal organism of ecological and evolutionary studies for decades. Despite poor dispersal, they have a broad geographic range, which makes them an ideal species to examine isolation by distance and selection across environmental gradients. Here we present the fully annotated genome of N. lapillus generated with Oxford Nanopore Techonology sequencing at ∼37× coverage. The genome assembly is 2.32 Gbp and consists of 2,525 contigs, with an N50 length of 2 Mbp. Repeat annotation identified 2,491 families that cover 67.56% of the genome, which is similar to other gastropods. Despite its large size and high proportion of repeats, the genome is of high quality. Benchmarking Universal Single-Copy Ortholog (BUSCO) analysis revealed a score of 96.8%. Functional annotation of the genome produced 45,848 protein-coding genes with a 96.6% BUSCO score. Genomic resources for mollusks lag behind that of other phyla, perhaps because many of their innate characteristics complicate DNA extraction, sequencing, and assembly. This new N. lapillus genome will increase our genomic understanding of the second largest phylum (and the most diverse class within said phylum) and serve as a key resource to advance studies on the organismal biology and population genetics of this iconic species as well as the connection between genomic variation and community-level processes.  more » « less
Award ID(s):
2017626 1924145
PAR ID:
10642496
Author(s) / Creator(s):
; ;
Editor(s):
Vogel, E
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
G3: Genes, Genomes, Genetics
Volume:
15
Issue:
10
ISSN:
2160-1836
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Carya glabra(2n= 4x= 64), also known as pignut hickory, is a widely distributed species in the walnut family (Juglandaceae). Native to the central and eastern United States and southeastern Canada,C. glabraplays an important ecological role as a common upland forest species; it is closely related to several economically valuable nut trees, includingC. illinoinensis(pecan). A deeper understanding of the genetics ofC. glabrais essential for studying its evolutionary history and biology, with potential implications for agricultural improvement of pecan. Here, we present the first nuclear genome assembly and annotation ofC. glabra. The assembly is chromosome-level and phased, representing the first assembled polyploid genome in the genusCarya. A total of 64 pseudochromosomes were assembled and phased into four haplotypes. The haplotype A assembly spans 600.4 Mb, comprises 55.0% repetitive sequences, and contains 30,947 protein-coding genes, with a BUSCO completeness score of 97.7%. Functional annotation assigned 94.3% of haplotype A genes to gene families, and 79.7% and 86.3% of genes were annotated with Gene Ontology terms and protein domains, respectively; 635 putative plant disease resistance genes were found in haplotype A. The other three haplotypes exhibited similarly high-quality annotation metrics. Our genomic analyses also suggest thatC. glabrais an autotetraploid. Comparative genomic analyses revealed high collinearity among the four haplotypes ofC. glabraand the published genomes of three otherCaryaspecies, although structural variation among the genomes of these species was identified. In addition, we provide an improved chloroplast genome assembly and the first mitochondrial genome forC. glabra. Importantly, most members of the research team are undergraduate students; the sequenced individual is located in McCarty Woods, a Conservation Area on the University of Florida campus. This work highlights the value of genome assembly efforts as powerful tools for teaching genomics and supporting conservation initiatives. This first high-quality reference genome forC. glabraprovides a valuable resource for studyingCarya, a genus of significant ecological and economic importance. Article summaryCarya glabra(pignut hickory) is a common upland forest species in North America. This species is a member of the walnut family (Juglandaceae), which includes many economically important nut trees. Here, we present the first nuclear genome assembly and annotation ofC. glabra. The assembly is chromosome-level and phased. The haplotype A assembly contains 30,947 protein-coding genes, with a BUSCO completeness score of 97.7%. Our genomic analyses suggest thatC. glabrais an autopolyploid. We also provide chloroplast and mitochondrial genome assemblies. This nuclear genome provides a valuable resource for studyingCarya, a genus of significant ecological and economic importance. 
    more » « less
  2. Jewel wasps in the genus of Nasonia are parasitoids with haplodiploidy sex determination, rapid development and are easy to culture in the laboratory. They are excellent models for insect genetics, genomics, epigenetics, development, and evolution. Nasonia vitripennis ( Nv ) and N. giraulti ( Ng ) are closely-related species that can be intercrossed, particularly after removal of the intracellular bacterium Wolbachia , which serve as a powerful tool to map and positionally clone morphological, behavioral, expression and methylation phenotypes. The Nv reference genome was assembled using Sanger, PacBio and Nanopore approaches and annotated with extensive RNA-seq data. In contrast, Ng genome is only available through low coverage resequencing. Therefore, de novo Ng assembly is in urgent need to advance this system. In this study, we report a high-quality Ng assembly using 10X Genomics linked-reads with 670X sequencing depth. The current assembly has a genome size of 259,040,977 bp in 3,160 scaffolds with 38.05% G-C and a 98.6% BUSCO completeness score. 97% of the RNA reads are perfectly aligned to the genome, indicating high quality in contiguity and completeness. A total of 14,777 genes are annotated in the Ng genome, and 72% of the annotated genes have a one-to-one ortholog in the Nv genome. We reported 5 million Ng-Nv SNPs which will facility mapping and population genomic studies in Nasonia . In addition, 42 Ng -specific genes were identified by comparing with Nv genome and annotation. This is the first de novo assembly for this important species in the Nasonia model system, providing a useful new genomic toolkit. 
    more » « less
  3. Fraser, Bonnie (Ed.)
    Abstract Over 400 million years old, scorpions represent an ancient group of arachnids and one of the first animals to adapt to life on land. Presently, the lack of available genomes within scorpions hinders research on their evolution. This study leverages ultralong nanopore sequencing and Pore-C to generate the first chromosome-level assembly and annotation for the desert hairy scorpion, Hadrurus arizonensis. The assembled genome is 2.23 Gb in size with an N50 of 280 Mb. Pore-C scaffolding reoriented 99.6% of bases into nine chromosomes and BUSCO identified 998 (98.6%) complete arthropod single copy orthologs. Repetitive elements represent 54.69% of the assembled bases, including 872,874 (29.39%) LINE elements. A total of 18,996 protein-coding genes and 75,256 transcripts were predicted, and extracted protein sequences yielded a BUSCO score of 97.2%. This is the first genome assembled and annotated within the family Hadruridae, representing a crucial resource for closing gaps in genomic knowledge of scorpions, resolving arachnid phylogeny, and advancing studies in comparative and functional genomics. 
    more » « less
  4. Abstract PremisePectocarya recurvata(Boraginaceae, subfamily Cynoglossoideae), a species native to the Sonoran Desert (North America), has served as a model system for a suite of ecological and evolutionary studies. However, no reference genomes are currently available in Cynoglossoideae. A high‐quality reference genome forP. recurvatawould be valuable for addressing questions in this system and across broader taxonomic scales. MethodsUsing PacBio HiFi sequencing, we assembled a reference genome forP. recurvataand annotated coding regions with full‐length transcripts from an Iso‐Seq library. We assessed genome completeness with BUSCO andk‐mer analysis, and estimated the genome size of six individuals using flow cytometry. ResultsThe chromosome‐scale genome assembly forP. recurvatawas 216.0 Mbp long (N50 = 12.1 Mbp). Previous observations indicatedP. recurvatais 2n = 24. Our assembly included 12 primary contigs (158.3 Mbp) containing 30,655 genes with telomeres at 23 out of 24 ends. Flow cytometry measurements from the same population included two plants with 1C = 196.9 Mbp, the smallest measured for Boraginaceae, and four with 1C = 385.8 Mbp, which is consistent with tetraploidy in this population. DiscussionTheP. recurvatagenome assembly and annotation provide a high‐quality genomic resource in a sparsely represented area of the angiosperm phylogeny. This new reference genome will facilitate answering open questions in ecophysiology, biogeography, and systematics. 
    more » « less
  5. Vogel, K (Ed.)
    Abstract The Hunt bumble bee, Bombus huntii, is a widely distributed pollinator in western North America. The species produces large colony sizes in captive rearing conditions, experiences low parasite and pathogen loads, and has been demonstrated to be an effective pollinator of tomatoes grown in controlled environment agriculture systems. These desirable traits have galvanized producer efforts to develop commercial Bombus huntii colonies for growers to deliver pollination services to crops. To better understand Bombus huntii biology and support population genetic studies and breeding decisions, we sequenced and assembled the Bombus huntii genome from a single haploid male. High-fidelity sequencing of the entire genome using PacBio, along with HiC sequencing, led to a comprehensive contig assembly of high continuity. This assembly was further organized into a chromosomal arrangement, successfully identifying 18 chromosomes spread across the 317.4 Mb assembly with a BUSCO score indicating 97.6% completeness. Synteny analysis demonstrates shared chromosome number (n = 18) with Bombus terrestris, a species belonging to a different subgenus, matching the expectation that presence of 18 haploid chromosomes is an ancestral trait at least between the subgenera Pyrobombus and Bombus sensu stricto. In conclusion, the assembly outcome, alongside the minimal tissue sampled destructively, showcases efficient techniques for producing a comprehensive, highly contiguous genome. 
    more » « less