skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Unveiling the Genetic Blueprint of a Desert Scorpion: A Chromosome-level Genome of Hadrurus arizonensis Provides the First Reference for Parvorder Iurida
Abstract Over 400 million years old, scorpions represent an ancient group of arachnids and one of the first animals to adapt to life on land. Presently, the lack of available genomes within scorpions hinders research on their evolution. This study leverages ultralong nanopore sequencing and Pore-C to generate the first chromosome-level assembly and annotation for the desert hairy scorpion, Hadrurus arizonensis. The assembled genome is 2.23 Gb in size with an N50 of 280 Mb. Pore-C scaffolding reoriented 99.6% of bases into nine chromosomes and BUSCO identified 998 (98.6%) complete arthropod single copy orthologs. Repetitive elements represent 54.69% of the assembled bases, including 872,874 (29.39%) LINE elements. A total of 18,996 protein-coding genes and 75,256 transcripts were predicted, and extracted protein sequences yielded a BUSCO score of 97.2%. This is the first genome assembled and annotated within the family Hadruridae, representing a crucial resource for closing gaps in genomic knowledge of scorpions, resolving arachnid phylogeny, and advancing studies in comparative and functional genomics.  more » « less
Award ID(s):
2217100 1943371
PAR ID:
10534284
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Editor(s):
Fraser, Bonnie
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
16
Issue:
5
ISSN:
1759-6653
Page Range / eLocation ID:
evae097
Subject(s) / Keyword(s):
scorpion arachnid Hadruridae reference genome pore-c nanopore
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Raphidioptera (snakeflies) are a holometabolan order with the least species diversity but play a pivotal role in understanding the origin of complete metamorphosis. Here, we provide an annotated, chromosome-level reference genome assembly for an Asian endemic snakeflyMongoloraphidia duomilia(Yang, 1998) of the family Raphidiidae, assembled using PacBio HiFi and Hi-C data from female specimens. The resulting assembly is 653.56 Mb, of which 97.90% is anchored into 13 chromosomes. The scaffold N50 is 53.50 Mb, and BUSCO completeness is 97.80%. Repetitive elements comprise 64.31% of the genome (366.04 Mb). We identified 599 noncoding RNAs and predicted 11,141 protein-coding genes in the genome (97.70% BUSCO completeness). The new snakefly genome will facilitate comparison of genome architecture across Neuropterida and Holometabola and shed light on the ecological and evolutionary transitions between Neuropterida and Coleopterida. 
    more » « less
  2. Reinke, Valerie (Ed.)
    Abstract As an entomopathogenic nematode (EPN), Steinernema hermaphroditum parasitizes insect hosts and harbors symbiotic Xenorhabdus griffinae bacteria. In contrast to other Steinernematids, S. hermaphroditum has hermaphroditic genetics, offering the experimental scope found in Caenorhabditis elegans. To enable study of S. hermaphroditum, we have assembled and analyzed its reference genome. This genome assembly has five chromosomal scaffolds and 83 unassigned scaffolds totaling 90.7 Mb, with 19,426 protein-coding genes having a BUSCO completeness of 88.0%. Its autosomes show higher densities of strongly conserved genes in their centers, as in C. elegans, but repetitive elements are evenly distributed along all chromosomes, rather than with higher arm densities as in C. elegans. Either when comparing protein motif frequencies between nematode species or when analyzing gene family expansions during nematode evolution, we observed two categories of genes preferentially associated with the origin of Steinernema or S. hermaphroditum: orthologs of venom genes in S. carpocapsae or S. feltiae; and some types of chemosensory G protein-coupled receptors, despite the tendency of parasitic nematodes to have reduced numbers of chemosensory genes. Three-quarters of venom orthologs occurred in gene clusters, with the larger clusters comprising functionally diverse gene groups rather than paralogous repeats of a single venom gene. While assembling S. hermaphroditum, we coassembled bacterial genomes, finding sequence data for not only the known symbiont, X. griffinae, but also for eight other bacterial genera. All eight genera have previously been observed to be associated with Steinernema species or the EPN Heterorhabditis, and may constitute a second bacterial circle of EPNs. 
    more » « less
  3. Abstract We present the first long-read de novo assembly and annotation of the luna moth (Actias luna) and provide the full characterization of heavy chain fibroin (h-fibroin), a long and highly repetitive gene (>20 kb) essential in silk fiber production. There are >160,000 described species of moths and butterflies (Lepidoptera), but only within the last 5 years have we begun to recover high-quality annotated whole genomes across the order that capture h-fibroin. Using PacBio HiFi reads, we produce the first high-quality long-read reference genome for this species. The assembled genome has a length of 532 Mb, a contig N50 of 16.8 Mb, an L50 of 14 contigs, and 99.4% completeness (BUSCO). Our annotation using Bombyx mori protein and A. luna RNAseq evidence captured a total of 20,866 genes at 98.9% completeness with 10,267 functionally annotated proteins and a full-length h-fibroin annotation of 2,679 amino acid residues. 
    more » « less
  4. Lavrov, Dennis (Ed.)
    Abstract The painted lady butterfly, Vanessa cardui, has the longest migration routes, the widest hostplant diversity, and one of the most complex wing patterns of any insect. Due to minimal culturing requirements, easily characterized wing pattern elements, and technical feasibility of CRISPR/Cas9 genome editing, V. cardui is emerging as a functional genomics model for diverse research programs. Here, we report a high-quality, annotated genome assembly of the V. cardui genome, generated using 84× coverage of PacBio long-read data, which we assembled into 205 contigs with a total length of 425.4 Mb (N50 = 10.3 Mb). The genome was very complete (single-copy complete Benchmarking Universal Single-Copy Orthologs [BUSCO] 97%), with contigs assembled into presumptive chromosomes using synteny analyses. Our annotation used embryonic, larval, and pupal transcriptomes, and 20 transcriptomes across five different wing developmental stages. Gene annotations showed a high level of accuracy and completeness, with 14,437 predicted protein-coding genes. This annotated genome assembly constitutes an important resource for diverse functional genomic studies ranging from the developmental genetic basis of butterfly color pattern, to coevolution with diverse hostplants. 
    more » « less
  5. Abstract The first chromosome-scale reference genome of the rare narrow-endemic African moss Physcomitrellopsis africana (P. africana) is presented here. Assembled from 73 × Oxford Nanopore Technologies (ONT) long reads and 163 × Beijing Genomics Institute (BGI)-seq short reads, the 414 Mb reference comprises 26 chromosomes and 22,925 protein-coding genes [Benchmarking Universal Single-Copy Ortholog (BUSCO) scores: C:94.8% (D:13.9%)]. This genome holds 2 genes that withstood rigorous filtration of microbial contaminants, have no homolog in other land plants, and are thus interpreted as resulting from 2 unique horizontal gene transfers (HGTs) from microbes. Further, P. africana shares 176 of the 273 published HGT candidates identified in Physcomitrium patens (P. patens), but lacks 98 of these, highlighting that perhaps as many as 91 genes were acquired in P. patens in the last 40 million years following its divergence from its common ancestor with P. africana. These observations suggest rather continuous gene gains via HGT followed by potential losses during the diversification of the Funariaceae. Our findings showcase both dynamic flux in plant HGTs over evolutionarily “short” timescales, alongside enduring impacts of successful integrations, like those still functionally maintained in extant P. africana. Furthermore, this study describes the informatic processes employed to distinguish contaminants from candidate HGT events. 
    more » « less