skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Unveiling the Genetic Blueprint of a Desert Scorpion: A Chromosome-level Genome of Hadrurus arizonensis Provides the First Reference for Parvorder Iurida
Abstract Over 400 million years old, scorpions represent an ancient group of arachnids and one of the first animals to adapt to life on land. Presently, the lack of available genomes within scorpions hinders research on their evolution. This study leverages ultralong nanopore sequencing and Pore-C to generate the first chromosome-level assembly and annotation for the desert hairy scorpion, Hadrurus arizonensis. The assembled genome is 2.23 Gb in size with an N50 of 280 Mb. Pore-C scaffolding reoriented 99.6% of bases into nine chromosomes and BUSCO identified 998 (98.6%) complete arthropod single copy orthologs. Repetitive elements represent 54.69% of the assembled bases, including 872,874 (29.39%) LINE elements. A total of 18,996 protein-coding genes and 75,256 transcripts were predicted, and extracted protein sequences yielded a BUSCO score of 97.2%. This is the first genome assembled and annotated within the family Hadruridae, representing a crucial resource for closing gaps in genomic knowledge of scorpions, resolving arachnid phylogeny, and advancing studies in comparative and functional genomics.  more » « less
Award ID(s):
2217100 1943371
PAR ID:
10534284
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Editor(s):
Fraser, Bonnie
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
16
Issue:
5
ISSN:
1759-6653
Page Range / eLocation ID:
evae097
Subject(s) / Keyword(s):
scorpion arachnid Hadruridae reference genome pore-c nanopore
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Vogel, K (Ed.)
    Abstract We present the first chromosome-level genome assembly for Bombus pensylvanicus, a historically widespread native pollinator species that was distributed across eastern North America but has subsequently undergone declines in range area and local relative abundance. This species has been of significant interest as a model for understanding both patterns and possible causes of bumble bee decline in the region, including the role of genetic variation. Here we present a chromosome-level reference genome assembled using Pacific Biosciences singe-molecule HiFi sequences and Hi-C data and annotated using evidence derived from RNA sequencing of multiple tissue types. The B. pensylvanicus genome has a total length of ∼352.6 Mb and was assembled into a total of 224 scaffolds, with 19 primary pseudomolecules representing putative chromosomes and an N50 = 14.872 Mb. Annotation with the Eukaryotic Genome Annotation Pipeline—External (EGAPx) identified 11,411 genes (10,263 protein coding), and BUSCO analysis of 5,991 Hymenoptera-specific BUSCO groups indicated a completeness for the proteins of 99.0% (98.6% single-copy, 0.5% duplicated) and for the genome of 98.5% (98.2% single-copy, 0.3% duplicated). We present synteny analyses with other recently assembled Bombus genomes representing different subgenera and examine the distribution of repetitive regions of the genome relative to the distribution of genes and noncoding RNAs. 
    more » « less
  2. Abstract Raphidioptera (snakeflies) are a holometabolan order with the least species diversity but play a pivotal role in understanding the origin of complete metamorphosis. Here, we provide an annotated, chromosome-level reference genome assembly for an Asian endemic snakeflyMongoloraphidia duomilia(Yang, 1998) of the family Raphidiidae, assembled using PacBio HiFi and Hi-C data from female specimens. The resulting assembly is 653.56 Mb, of which 97.90% is anchored into 13 chromosomes. The scaffold N50 is 53.50 Mb, and BUSCO completeness is 97.80%. Repetitive elements comprise 64.31% of the genome (366.04 Mb). We identified 599 noncoding RNAs and predicted 11,141 protein-coding genes in the genome (97.70% BUSCO completeness). The new snakefly genome will facilitate comparison of genome architecture across Neuropterida and Holometabola and shed light on the ecological and evolutionary transitions between Neuropterida and Coleopterida. 
    more » « less
  3. Abstract Fall webworm (Hyphantria cunea) is a widespread, highly polyphagous moth in the family Erebidae, whose native range spans much of North America and invasive range includes Asia and Europe. The species uses over 600 plant species as a larval host, making it among the most generalized insect herbivores described. Its variable host use, wide range, and genetic diversity make fall webworm an attractive emerging model system for the study of diet breadth, but studies have been limited by the lack of a high-quality annotated reference genome. Here we report an annotated, chromosome-scale genome of much improved continuity and completeness over the previously available unannotated fall webworm reference genome. We used PacBioHiFi long reads and Omni-C proximity ligation sequencing technology to produce a de novo assembled genome. Our genome assembly, the first for any species in the genus and third in the family, contains 321 scaffolds spanning 0.572 gigabases with a N50 of 14.6 Mb and BUSCO complete score of 99.1%. This genome will represent a valuable resource for research into the ecology, evolution, and genetics of dietary generalism and diet breadth in insect herbivores. 
    more » « less
  4. Reinke, Valerie (Ed.)
    Abstract As an entomopathogenic nematode (EPN), Steinernema hermaphroditum parasitizes insect hosts and harbors symbiotic Xenorhabdus griffinae bacteria. In contrast to other Steinernematids, S. hermaphroditum has hermaphroditic genetics, offering the experimental scope found in Caenorhabditis elegans. To enable study of S. hermaphroditum, we have assembled and analyzed its reference genome. This genome assembly has five chromosomal scaffolds and 83 unassigned scaffolds totaling 90.7 Mb, with 19,426 protein-coding genes having a BUSCO completeness of 88.0%. Its autosomes show higher densities of strongly conserved genes in their centers, as in C. elegans, but repetitive elements are evenly distributed along all chromosomes, rather than with higher arm densities as in C. elegans. Either when comparing protein motif frequencies between nematode species or when analyzing gene family expansions during nematode evolution, we observed two categories of genes preferentially associated with the origin of Steinernema or S. hermaphroditum: orthologs of venom genes in S. carpocapsae or S. feltiae; and some types of chemosensory G protein-coupled receptors, despite the tendency of parasitic nematodes to have reduced numbers of chemosensory genes. Three-quarters of venom orthologs occurred in gene clusters, with the larger clusters comprising functionally diverse gene groups rather than paralogous repeats of a single venom gene. While assembling S. hermaphroditum, we coassembled bacterial genomes, finding sequence data for not only the known symbiont, X. griffinae, but also for eight other bacterial genera. All eight genera have previously been observed to be associated with Steinernema species or the EPN Heterorhabditis, and may constitute a second bacterial circle of EPNs. 
    more » « less
  5. Abstract We present the first long-read de novo assembly and annotation of the luna moth (Actias luna) and provide the full characterization of heavy chain fibroin (h-fibroin), a long and highly repetitive gene (>20 kb) essential in silk fiber production. There are >160,000 described species of moths and butterflies (Lepidoptera), but only within the last 5 years have we begun to recover high-quality annotated whole genomes across the order that capture h-fibroin. Using PacBio HiFi reads, we produce the first high-quality long-read reference genome for this species. The assembled genome has a length of 532 Mb, a contig N50 of 16.8 Mb, an L50 of 14 contigs, and 99.4% completeness (BUSCO). Our annotation using Bombyx mori protein and A. luna RNAseq evidence captured a total of 20,866 genes at 98.9% completeness with 10,267 functionally annotated proteins and a full-length h-fibroin annotation of 2,679 amino acid residues. 
    more » « less