skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on August 11, 2026

Title: New reference genome assembly for the declining American Bumble Bee, Bombus pensylvanicus
Abstract We present the first chromosome-level genome assembly for Bombus pensylvanicus, a historically widespread native pollinator species that was distributed across eastern North America but has subsequently undergone declines in range area and local relative abundance. This species has been of significant interest as a model for understanding both patterns and possible causes of bumble bee decline in the region, including the role of genetic variation. Here we present a chromosome-level reference genome assembled using Pacific Biosciences singe-molecule HiFi sequences and Hi-C data and annotated using evidence derived from RNA sequencing of multiple tissue types. The B. pensylvanicus genome has a total length of ∼352.6 Mb and was assembled into a total of 224 scaffolds, with 19 primary pseudomolecules representing putative chromosomes and an N50 = 14.872 Mb. Annotation with the Eukaryotic Genome Annotation Pipeline—External (EGAPx) identified 11,411 genes (10,263 protein coding), and BUSCO analysis of 5,991 Hymenoptera-specific BUSCO groups indicated a completeness for the proteins of 99.0% (98.6% single-copy, 0.5% duplicated) and for the genome of 98.5% (98.2% single-copy, 0.3% duplicated). We present synteny analyses with other recently assembled Bombus genomes representing different subgenera and examine the distribution of repetitive regions of the genome relative to the distribution of genes and noncoding RNAs.  more » « less
Award ID(s):
2126418
PAR ID:
10649058
Author(s) / Creator(s):
; ; ; ; ; ; ;
Editor(s):
Vogel, K
Publisher / Repository:
G3: Genes, Genomes, Genetics
Date Published:
Journal Name:
G3: Genes, Genomes, Genetics
Volume:
15
Issue:
10
ISSN:
2160-1836
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The Diaprepes root weevil (DRW), Diaprepes abbreviatus, is a broadly polyphagous invasive pest of agriculture in the southern United States and the Caribbean. Its genome was sequenced, assembled, and annotated to study genomic correlates of specialized plant-feeding and invasiveness and to facilitate the development of new methods for DRW control. The 1.69 Gb D. abbreviatus genome assembly was distributed across 653 contigs, with an N50 of 7.8 Mb and the largest contig of 62 Mb. Most of the genome was comprised of repetitive sequences, with 66.17% in transposable elements, 5.75% in macrosatellites, and 2.06% in microsatellites. Most expected orthologous genes were present and fully assembled, with 99.5% of BUSCO genes present and 1.5% duplicated. One hundred and nine contigs (27.19 Mb) were identified as putative fragments of the X and Y sex chromosomes, and homology assessment with other beetle X chromosomes indicated a possible sex chromosome turnover event. Genome annotation identified 18,412 genes, including 43 putative horizontally transferred (HT) loci. Notably, 258 genes were identified from gene families known to encode plant cell wall degrading enzymes and invertases, including carbohydrate esterases, polysaccharide lyases, and glycoside hydrolases (GH). GH genes were unusually numerous, with 239 putative genes representing 19 GH families. Interestingly, several other beetle species with large numbers of GH genes are (like D. abbreviatus) successful invasive pests of agriculture or forestry. 
    more » « less
  2. Fraser, Bonnie (Ed.)
    Abstract Over 400 million years old, scorpions represent an ancient group of arachnids and one of the first animals to adapt to life on land. Presently, the lack of available genomes within scorpions hinders research on their evolution. This study leverages ultralong nanopore sequencing and Pore-C to generate the first chromosome-level assembly and annotation for the desert hairy scorpion, Hadrurus arizonensis. The assembled genome is 2.23 Gb in size with an N50 of 280 Mb. Pore-C scaffolding reoriented 99.6% of bases into nine chromosomes and BUSCO identified 998 (98.6%) complete arthropod single copy orthologs. Repetitive elements represent 54.69% of the assembled bases, including 872,874 (29.39%) LINE elements. A total of 18,996 protein-coding genes and 75,256 transcripts were predicted, and extracted protein sequences yielded a BUSCO score of 97.2%. This is the first genome assembled and annotated within the family Hadruridae, representing a crucial resource for closing gaps in genomic knowledge of scorpions, resolving arachnid phylogeny, and advancing studies in comparative and functional genomics. 
    more » « less
  3. Lavrov, Dennis (Ed.)
    Abstract The painted lady butterfly, Vanessa cardui, has the longest migration routes, the widest hostplant diversity, and one of the most complex wing patterns of any insect. Due to minimal culturing requirements, easily characterized wing pattern elements, and technical feasibility of CRISPR/Cas9 genome editing, V. cardui is emerging as a functional genomics model for diverse research programs. Here, we report a high-quality, annotated genome assembly of the V. cardui genome, generated using 84× coverage of PacBio long-read data, which we assembled into 205 contigs with a total length of 425.4 Mb (N50 = 10.3 Mb). The genome was very complete (single-copy complete Benchmarking Universal Single-Copy Orthologs [BUSCO] 97%), with contigs assembled into presumptive chromosomes using synteny analyses. Our annotation used embryonic, larval, and pupal transcriptomes, and 20 transcriptomes across five different wing developmental stages. Gene annotations showed a high level of accuracy and completeness, with 14,437 predicted protein-coding genes. This annotated genome assembly constitutes an important resource for diverse functional genomic studies ranging from the developmental genetic basis of butterfly color pattern, to coevolution with diverse hostplants. 
    more » « less
  4. Abstract Genome-wide information has so far been unavailable for ribbon worms of the clade Hoplonemertea, the most species-rich class within the phylum Nemertea. While species within Pilidiophora, the sister clade of Hoplonemertea, possess a pilidium larval stage and lack stylets on their proboscis, Hoplonemertea species have a planuliform larva and are armed with stylets employed for the injection of toxins into their prey. To further compare these developmental, physiological, and behavioral differences from a genomic perspective, the availability of a reference genome for a Hoplonemertea species is crucial. Such data will be highly useful for future investigations toward a better understanding of molecular ecology, venom evolution, and regeneration not only in Nemertea but also in other marine invertebrate phyla. To this end, we herein present the annotated chromosome-level genome assembly for Emplectonema gracile (Nemertea; Hoplonemertea; Monostilifera; Emplectonematidae), an easily collected nemertean well suited for laboratory experimentation. The genome has an assembly size of 157.9 Mb. Hi-C scaffolding yielded chromosome-level scaffolds, with a scaffold N50 of 10.0 Mb and a score of 95.1% for complete BUSCO genes found as a single copy. Annotation predicted 20,684 protein-coding genes. The high-quality reference genome reaches an Earth BioGenome standard level of 7.C.Q50. 
    more » « less
  5. Wheat, Christopher (Ed.)
    Abstract The blackstripe livebearer Poeciliopsis prolifica is a live-bearing fish belonging to the family Poeciliidae with high level of postfertilization maternal investment (matrotrophy). This viviparous matrotrophic species has evolved a structure similarly to the mammalian placenta. Placentas have independently evolved multiple times in Poeciliidae from nonplacental ancestors, which provide an opportunity to study the placental evolution. However, there is a lack of high-quality reference genomes for the placental species in Poeciliidae. In this study, we present a 674 Mb assembly of P. prolifica in 504 contigs with excellent continuity (contig N50 7.7 Mb) and completeness (97.2% Benchmarking Universal Single-Copy Orthologs [BUSCO] completeness score, including 92.6% single-copy and 4.6% duplicated BUSCO score). A total of 27,227 protein-coding genes were annotated from the merged datasets based on bioinformatic prediction, RNA sequencing and homology evidence. Phylogenomic analyses revealed that P. prolifica diverged from the guppy (Poecilia reticulata) ∼19 Ma. Our research provides the necessary resources and the genomic toolkit for investigating the genetic underpinning of placentation. 
    more » « less