skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Unveiling the Genetic Blueprint of a Desert Scorpion: A Chromosome-level Genome of Hadrurus arizonensis Provides the First Reference for Parvorder Iurida
Abstract Over 400 million years old, scorpions represent an ancient group of arachnids and one of the first animals to adapt to life on land. Presently, the lack of available genomes within scorpions hinders research on their evolution. This study leverages ultralong nanopore sequencing and Pore-C to generate the first chromosome-level assembly and annotation for the desert hairy scorpion, Hadrurus arizonensis. The assembled genome is 2.23 Gb in size with an N50 of 280 Mb. Pore-C scaffolding reoriented 99.6% of bases into nine chromosomes and BUSCO identified 998 (98.6%) complete arthropod single copy orthologs. Repetitive elements represent 54.69% of the assembled bases, including 872,874 (29.39%) LINE elements. A total of 18,996 protein-coding genes and 75,256 transcripts were predicted, and extracted protein sequences yielded a BUSCO score of 97.2%. This is the first genome assembled and annotated within the family Hadruridae, representing a crucial resource for closing gaps in genomic knowledge of scorpions, resolving arachnid phylogeny, and advancing studies in comparative and functional genomics.  more » « less
Award ID(s):
2217100 1943371
PAR ID:
10534284
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Editor(s):
Fraser, Bonnie
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
16
Issue:
5
ISSN:
1759-6653
Page Range / eLocation ID:
evae097
Subject(s) / Keyword(s):
scorpion arachnid Hadruridae reference genome pore-c nanopore
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Vogel, K (Ed.)
    Abstract We present the first chromosome-level genome assembly for Bombus pensylvanicus, a historically widespread native pollinator species that was distributed across eastern North America but has subsequently undergone declines in range area and local relative abundance. This species has been of significant interest as a model for understanding both patterns and possible causes of bumble bee decline in the region, including the role of genetic variation. Here we present a chromosome-level reference genome assembled using Pacific Biosciences singe-molecule HiFi sequences and Hi-C data and annotated using evidence derived from RNA sequencing of multiple tissue types. The B. pensylvanicus genome has a total length of ∼352.6 Mb and was assembled into a total of 224 scaffolds, with 19 primary pseudomolecules representing putative chromosomes and an N50 = 14.872 Mb. Annotation with the Eukaryotic Genome Annotation Pipeline—External (EGAPx) identified 11,411 genes (10,263 protein coding), and BUSCO analysis of 5,991 Hymenoptera-specific BUSCO groups indicated a completeness for the proteins of 99.0% (98.6% single-copy, 0.5% duplicated) and for the genome of 98.5% (98.2% single-copy, 0.3% duplicated). We present synteny analyses with other recently assembled Bombus genomes representing different subgenera and examine the distribution of repetitive regions of the genome relative to the distribution of genes and noncoding RNAs. 
    more » « less
  2. Abstract Raphidioptera (snakeflies) are a holometabolan order with the least species diversity but play a pivotal role in understanding the origin of complete metamorphosis. Here, we provide an annotated, chromosome-level reference genome assembly for an Asian endemic snakeflyMongoloraphidia duomilia(Yang, 1998) of the family Raphidiidae, assembled using PacBio HiFi and Hi-C data from female specimens. The resulting assembly is 653.56 Mb, of which 97.90% is anchored into 13 chromosomes. The scaffold N50 is 53.50 Mb, and BUSCO completeness is 97.80%. Repetitive elements comprise 64.31% of the genome (366.04 Mb). We identified 599 noncoding RNAs and predicted 11,141 protein-coding genes in the genome (97.70% BUSCO completeness). The new snakefly genome will facilitate comparison of genome architecture across Neuropterida and Holometabola and shed light on the ecological and evolutionary transitions between Neuropterida and Coleopterida. 
    more » « less
  3. Abstract Carya glabra(2n= 4x= 64), also known as pignut hickory, is a widely distributed species in the walnut family (Juglandaceae). Native to the central and eastern United States and southeastern Canada,C. glabraplays an important ecological role as a common upland forest species; it is closely related to several economically valuable nut trees, includingC. illinoinensis(pecan). A deeper understanding of the genetics ofC. glabrais essential for studying its evolutionary history and biology, with potential implications for agricultural improvement of pecan. Here, we present the first nuclear genome assembly and annotation ofC. glabra. The assembly is chromosome-level and phased, representing the first assembled polyploid genome in the genusCarya. A total of 64 pseudochromosomes were assembled and phased into four haplotypes. The haplotype A assembly spans 600.4 Mb, comprises 55.0% repetitive sequences, and contains 30,947 protein-coding genes, with a BUSCO completeness score of 97.7%. Functional annotation assigned 94.3% of haplotype A genes to gene families, and 79.7% and 86.3% of genes were annotated with Gene Ontology terms and protein domains, respectively; 635 putative plant disease resistance genes were found in haplotype A. The other three haplotypes exhibited similarly high-quality annotation metrics. Our genomic analyses also suggest thatC. glabrais an autotetraploid. Comparative genomic analyses revealed high collinearity among the four haplotypes ofC. glabraand the published genomes of three otherCaryaspecies, although structural variation among the genomes of these species was identified. In addition, we provide an improved chloroplast genome assembly and the first mitochondrial genome forC. glabra. Importantly, most members of the research team are undergraduate students; the sequenced individual is located in McCarty Woods, a Conservation Area on the University of Florida campus. This work highlights the value of genome assembly efforts as powerful tools for teaching genomics and supporting conservation initiatives. This first high-quality reference genome forC. glabraprovides a valuable resource for studyingCarya, a genus of significant ecological and economic importance. Article summaryCarya glabra(pignut hickory) is a common upland forest species in North America. This species is a member of the walnut family (Juglandaceae), which includes many economically important nut trees. Here, we present the first nuclear genome assembly and annotation ofC. glabra. The assembly is chromosome-level and phased. The haplotype A assembly contains 30,947 protein-coding genes, with a BUSCO completeness score of 97.7%. Our genomic analyses suggest thatC. glabrais an autopolyploid. We also provide chloroplast and mitochondrial genome assemblies. This nuclear genome provides a valuable resource for studyingCarya, a genus of significant ecological and economic importance. 
    more » « less
  4. Tarvin, R (Ed.)
    Abstract Although the publication of high-quality reference genomes is steadily increasing, many clades remain chronically neglected. Skinks (order, Squamata; family, Scincidae) are one of the most diverse lizard families (1,785 species), yet there are currently just six published chromosome-level skink genomes. Here, we present the first telomere-to-telomere, chromosome-level reference genome for one of the most abundant lizards in the eastern United States, the common five-lined skink (Plestiodon fasciatus). Through the sequencing of RNA, long-read DNA, and Hi-C chromatin interactions, we produced an annotated reference genome (N50 = 227MB, L50 = 3) consisting of 6 macrochromosome pairs and 7 microchromosome pairs with 98% of BUSCO genes represented (lineage, Sauropsida; 7480 BUSCO markers), providing one of the most complete skink genomes to date: rPleFas1.1. Functional annotation predicts 32,520 protein-coding genes (16,100 unique, named genes) with an average gene length of 9,372 bp. Repeat annotations estimate that transposable elements comprise 46.7% of the genome, for which we show the amount and content is remarkably conserved across Scincidae. 
    more » « less
  5. Abstract Fall webworm (Hyphantria cunea) is a widespread, highly polyphagous moth in the family Erebidae, whose native range spans much of North America and invasive range includes Asia and Europe. The species uses over 600 plant species as a larval host, making it among the most generalized insect herbivores described. Its variable host use, wide range, and genetic diversity make fall webworm an attractive emerging model system for the study of diet breadth, but studies have been limited by the lack of a high-quality annotated reference genome. Here we report an annotated, chromosome-scale genome of much improved continuity and completeness over the previously available unannotated fall webworm reference genome. We used PacBioHiFi long reads and Omni-C proximity ligation sequencing technology to produce a de novo assembled genome. Our genome assembly, the first for any species in the genus and third in the family, contains 321 scaffolds spanning 0.572 gigabases with a N50 of 14.6 Mb and BUSCO complete score of 99.1%. This genome will represent a valuable resource for research into the ecology, evolution, and genetics of dietary generalism and diet breadth in insect herbivores. 
    more » « less