skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A chromosome-level genome of the giant vinegaroon Mastigoproctus giganteus exhibits the signature of pre-Silurian whole genome duplication
Within the arachnids, chromosome-level genome assemblies have greatly accelerated the understanding of gene family evolution and developmental genomics in key groups, such as spiders (Araneae), mites and ticks (Acariformes and Parasitiformes). Among other poorly studied arachnid orders that lack genome assemblies altogether are the clade Pedipalpi, which is comprised of three orders that form the sister group of spiders, which diverged over 400 Mya. We close this gap by generating the first chromosome-level assembly from a single specimen of the vinegaroon Mastigoproctus giganteus (Uropygi). We show that this highly complete genome retains plesiomorphic conditions for many gene families that have undergone lineage-specific derivations within the more diverse spiders. Consistent with the phylogenetic position of Uropygi, macrosynteny in the M. giganteus genome substantiates the signature of an ancient whole genome duplication.  more » « less
Award ID(s):
2016141
PAR ID:
10563128
Author(s) / Creator(s):
; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Journal of Heredity
ISSN:
0022-1503
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Gossmann, Toni (Ed.)
    Abstract Spiders (Araneae) have a diverse spectrum of morphologies, behaviors, and physiologies. Attempts to understand the genomic-basis of this diversity are often hindered by their large, heterozygous, and AT-rich genomes with high repeat content resulting in highly fragmented, poor-quality assemblies. As a result, the key attributes of spider genomes, including gene family evolution, repeat content, and gene function, remain poorly understood. Here, we used Illumina and Dovetail Chicago technologies to sequence the genome of the long-jawed spider Tetragnatha kauaiensis, producing an assembly distributed along 3,925 scaffolds with an N50 of ∼2 Mb. Using comparative genomics tools, we explore genome evolution across available spider assemblies. Our findings suggest that the previously reported and vast genome size variation in spiders is linked to the different representation and number of transposable elements. Using statistical tools to uncover gene-family level evolution, we find expansions associated with the sensory perception of taste, immunity, and metabolism. In addition, we report strikingly different histories of chemosensory, venom, and silk gene families, with the first two evolving much earlier, affected by the ancestral whole genome duplication in Arachnopulmonata (∼450 Ma) and exhibiting higher numbers. Together, our findings reveal that spider genomes are highly variable and that genomic novelty may have been driven by the burst of an ancient whole genome duplication, followed by gene family and transposable element expansion. 
    more » « less
  2. Abstract Genomic resources across squamate reptiles (lizards and snakes) have lagged behind other vertebrate systems and high-quality reference genomes remain scarce. Of the 23 chromosome-scale reference genomes across the order, only 12 of the ~60 squamate families are represented. Within geckos (infraorder Gekkota), a species-rich clade of lizards, chromosome-level genomes are exceptionally sparse representing only two of the seven extant families. Using the latest advances in genome sequencing and assembly methods, we generated one of the highest-quality squamate genomes to date for the leopard gecko, Eublepharis macularius (Eublepharidae). We compared this assembly to the previous, short-read only, E. macularius reference genome published in 2016 and examined potential factors within the assembly influencing contiguity of genome assemblies using PacBio HiFi data. Briefly, the read N50 of the PacBio HiFi reads generated for this study was equal to the contig N50 of the previous E. macularius reference genome at 20.4 kilobases. The HiFi reads were assembled into a total of 132 contigs, which was further scaffolded using HiC data into 75 total sequences representing all 19 chromosomes. We identified 9 of the 19 chromosomal scaffolds were assembled as a near-single contig, whereas the other 10 chromosomes were each scaffolded together from multiple contigs. We qualitatively identified that the percent repeat content within a chromosome broadly affects its assembly contiguity prior to scaffolding. This genome assembly signifies a new age for squamate genomics where high-quality reference genomes rivaling some of the best vertebrate genome assemblies can be generated for a fraction of previous cost estimates. This new E. macularius reference assembly is available on NCBI at JAOPLA010000000. 
    more » « less
  3. Suh, Alexander (Ed.)
    Abstract Although spiders are one of the most diverse groups of arthropods, the genetic architecture of their evolutionary adaptations is largely unknown. Specifically, ancient genome-wide duplication occurring during arachnid evolution ~450 mya resulted in a vast assembly of gene families, yet the extent to which selection has shaped this variation is understudied. To aid in comparative genome sequence analyses, we provide a chromosome-level genome of the Western black widow spider (Latrodectus hesperus)—a focus due to its silk properties, venom applications, and as a model for urban adaptation. We used long-read and Hi-C sequencing data, combined with transcriptomes, to assemble 14 chromosomes in a 1.46 Gb genome, with 38,393 genes annotated, and a BUSCO score of 95.3%. Our analyses identified high repetitive gene content and heterozygosity, consistent with other spider genomes, which has led to challenges in genome characterization. Our comparative evolutionary analyses of eight genomes available for species within the Araneoidea group (orb weavers and their descendants) identified 1,827 single-copy orthologs. Of these, 155 exhibit significant positive selection primarily associated with developmental genes, and with traits linked to sensory perception. These results support the hypothesis that several traits unique to spiders emerged from the adaptive evolution of ohnologs—or retained ancestrally duplicated genes—from ancient genome-wide duplication. These comparative spider genome analyses can serve as a model to understand how positive selection continually shapes ancestral duplications in generating novel traits today within and between diverse taxonomic groups. 
    more » « less
  4. Pyhäjärvi, T (Ed.)
    Abstract Blackberries (Rubus spp.) are the fourth most economically important berry crop worldwide. Genome assemblies and annotations have been developed for Rubus species in subgenus Idaeobatus, including black raspberry (R. occidentalis), red raspberry (R. idaeus), and R. chingii, but very few genomic resources exist for blackberries and their relatives in subgenus Rubus. Here we present a chromosome-length assembly and annotation of the diploid blackberry germplasm accession “Hillquist” (R. argutus). “Hillquist” is the only known source of primocane-fruiting (annual-fruiting) in tetraploid fresh-market blackberry breeding programs and is represented in the pedigree of many important cultivars worldwide. The “Hillquist” assembly, generated using Pacific Biosciences long reads scaffolded with high-throughput chromosome conformation capture sequencing, consisted of 298 Mb, of which 270 Mb (90%) was placed on 7 chromosome-length scaffolds with an average length of 38.6 Mb. Approximately 52.8% of the genome was composed of repetitive elements. The genome sequence was highly collinear with a novel maternal haplotype-resolved linkage map of the tetraploid blackberry selection A-2551TN and genome assemblies of R. chingii and red raspberry. A total of 38,503 protein-coding genes were predicted, of which 72% were functionally annotated. Eighteen flowering gene homologs within a previously mapped locus aligning to an 11.2 Mb region on chromosome Ra02 were identified as potential candidate genes for primocane-fruiting. The utility of the “Hillquist” genome has been demonstrated here by the development of the first genotyping-by-sequencing-based linkage map of tetraploid blackberry and the identification of possible candidate genes for primocane-fruiting. This chromosome-length assembly will facilitate future studies in Rubus biology, genetics, and genomics and strengthen applied breeding programs. 
    more » « less
  5. Synopsis The proliferation of genomic resources for Chelicerata in the past 10 years has revealed that the evolution of chelicerate genomes is more dynamic than previously thought, with multiple waves of ancient whole genome duplications affecting separate lineages. Such duplication events are fascinating from the perspective of evolutionary history because the burst of new gene copies associated with genome duplications facilitates the acquisition of new gene functions (neofunctionalization), which may in turn lead to morphological novelties and spur net diversification. While neofunctionalization has been invoked in several contexts with respect to the success and diversity of spiders, the overall impact of whole genome duplications on chelicerate evolution and development remains imperfectly understood. The purpose of this review is to examine critically the role of whole genome duplication on the diversification of the extant arachnid orders, as well as assess functional datasets for evidence of subfunctionalization or neofunctionalization in chelicerates. This examination focuses on functional data from two focal model taxa: the spider Parasteatoda tepidariorum, which exhibits evidence for an ancient duplication, and the harvestman Phalangium opilio, which exhibits an unduplicated genome. I show that there is no evidence that taxa with genome duplications are more successful than taxa with unduplicated genomes. I contend that evidence for sub- or neofunctionalization of duplicated developmental patterning genes in spiders is indirect or fragmentary at present, despite the appeal of this postulate for explaining the success of groups like spiders. Available expression data suggest that the condition of duplicated Hox modules may have played a role in promoting body plan disparity in the posterior tagma of some orders, such as spiders and scorpions, but functional data substantiating this postulate are critically missing. Spatiotemporal dynamics of duplicated transcription factors in spiders may represent cases of developmental system drift, rather than neofunctionalization. Developmental system drift may represent an important, but overlooked, null hypothesis for studies of paralogs in chelicerate developmental biology. To distinguish between subfunctionalization, neofunctionalization, and developmental system drift, concomitant establishment of comparative functional datasets from taxa exhibiting the genome duplication, as well as those that lack the paralogy, is sorely needed. 
    more » « less