skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Skimming genomes for systematics and DNA barcodes of corals
Abstract Numerous genomic methods developed over the past two decades have enabled the discovery and extraction of orthologous loci to help resolve phylogenetic relationships across various taxa and scales. Genome skimming (or low‐coverage genome sequencing) is a promising method to not only extract high‐copy loci but also 100s to 1000s of phylogenetically informative nuclear loci (e.g., ultraconserved elements [UCEs] and exons) from contemporary and museum samples. The subphylum Anthozoa, including important ecosystem engineers (e.g., stony corals, black corals, anemones, and octocorals) in the marine environment, is in critical need of phylogenetic resolution and thus might benefit from a genome‐skimming approach. We conducted genome skimming on 242 anthozoan corals collected from 1886 to 2022. Using existing target‐capture baitsets, we bioinformatically obtained UCEs and exons from the genome‐skimming data and incorporated them with data from previously published target‐capture studies. The mean number of UCE and exon loci extracted from the genome skimming data was 1837 ± 662 SD for octocorals and 1379 ± 476 SD loci for hexacorals. Phylogenetic relationships were well resolved within each class. A mean of 1422 ± 720 loci was obtained from the historical specimens, with 1253 loci recovered from the oldest specimen collected in 1886. We also obtained partial to whole mitogenomes and nuclear rRNA genes from >95% of samples. Bioinformatically pulling UCEs, exons, mitochondrial genomes, and nuclear rRNA genes from genome skimming data is a viable and low‐cost option for phylogenetic studies. This approach can be used to review and support taxonomic revisions and reconstruct evolutionary histories, including historical museum and type specimens.  more » « less
Award ID(s):
1929319
PAR ID:
10660010
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley
Date Published:
Journal Name:
Ecology and Evolution
Volume:
14
Issue:
5
ISSN:
2045-7758
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Wiegmann, Brian (Ed.)
    Abstract Ultraconserved genomic elements (UCEs) are generally treated as independent loci in phylogenetic analyses. The identification pipeline for UCE probes does not require prior knowledge of genetic identity, only selecting loci that are highly conserved, single copy, without repeats, and of a particular length. Here, we characterized UCEs from 11 phylogenomic studies across the animal tree of life, from birds to marine invertebrates. We found that within vertebrate lineages, UCEs are mostly intronic and intergenic, while in invertebrates, the majority are in exons. We then curated four different sets of UCE markers by genomic category from five different studies including: birds, mammals, fish, Hymenoptera (ants, wasps, and bees), and Coleoptera (beetles). Of genes captured by UCEs, we find that many are represented by two or more UCEs, corresponding to nonoverlapping segments of a single gene. We considered these UCEs to be nonindependent, merged all UCEs that belonged to a particular gene, constructed gene and species trees, and then evaluated the subsequent effect of merging cogenic UCEs on gene and species tree reconstruction. Average bootstrap support for merged UCE gene trees was significantly improved across all data sets apparently driven by the increase in loci length. Additionally, we conducted simulations and found that gene trees generated from merged UCEs were more accurate than those generated by unmerged UCEs. As loci length improves gene tree accuracy, this modest degree of UCE characterization and curation impacts downstream analyses and demonstrates the advantages of incorporating basic genomic characterizations into phylogenomic analyses. [Anchored hybrid enrichment; ants; ASTRAL; bait capture; carangimorph; Coleoptera; conserved nonexonic elements; exon capture; gene tree; Hymenoptera; mammal; phylogenomic markers; songbird; species tree; ultraconserved elements; weevils.] 
    more » « less
  2. Abstract Snake venoms are complex mixtures of toxic proteins that hold significant medical, pharmacological and evolutionary interest. To better understand the genetic diversity underlying snake venoms, we developed VenomCap, a novel exon‐capture probe set targeting toxin‐coding genes from a wide range of elapid snakes, with a particular focus on the ecologically diverse and medically important subfamily Hydrophiinae. We tested the capture success of VenomCap across 24 species, representing all major elapid lineages. We included snake phylogenomic probes in the VenomCap capture set, allowing us to compare capture performance between venom and phylogenomic loci and to infer elapid phylogenetic relationships. We demonstrated VenomCap's ability to recover exons from ~1500 target markers, representing a total of 24 known venom gene families, which includes the dominant gene families found in elapid venoms. We find that VenomCap's capture results are robust across all elapids sampled, and especially among hydrophiines, with respect to measures of target capture success (target loci matched, sensitivity, specificity and missing data). As a cost‐effective and efficient alternative to full genome sequencing, VenomCap can dramatically accelerate the sequencing and analysis of venom gene families. Overall, our tool offers a model for genomic studies on snake venom gene diversity and evolution that can be expanded for comprehensive comparisons across the other families of venomous snakes. 
    more » « less
  3. Abstract PremiseTo date, phylogenetic relationships within the monogeneric Brunelliaceae have been based on morphological evidence, which does not provide sufficient phylogenetic resolution. Here we use target‐enriched nuclear data to improve our understanding of phylogenetic relationships in the family. MethodsWe used the Angiosperms353 toolkit for targeted recovery of exonic regions and supercontigs (exons + introns) from low copy nuclear genes from 53 of 70 species inBrunellia, and several outgroup taxa. We removed loci that indicated biased inference of relationships and applied concatenated and coalescent methods to inferBrunelliaphylogeny. We identified conflicts among gene trees that may reflect hybridization or incomplete lineage sorting events and assessed their impact on phylogenetic inference. Finally, we performed ancestral‐state reconstructions of morphological traits and assessed the homology of character states used to define sections and subsections inBrunellia. ResultsBrunelliacomprises two major clades and several subclades. Most of these clades/subclades do not correspond to previous infrageneric taxa. There is high topological incongruence among the subclades across analyses. ConclusionsPhylogenetic reconstructions point to rapid species diversification in Brunelliaceae, reflected in very short branches between successive species splits. The removal of putatively biased loci slightly improves phylogenetic support for individual clades. Reticulate evolution due to hybridization and/or incomplete lineage sorting likely both contribute to gene‐tree discordance. Morphological characters used to define taxa in current classification schemes are homoplastic in the ancestral character‐state reconstructions. While target enrichment data allows us to broaden our understanding of diversification inBrunellia, the relationships among subclades remain incompletely understood. 
    more » « less
  4. Onychophora are cryptic, soil-dwelling invertebrates known for their biogeographic affinities, diversity of reproductive modes, close phylogenetic relationship to arthropods, and peculiar prey capture mechanism. The 216 valid species of Onychophora are grouped into two families – Peripatopsidae and Peripatidae – and apart from a few relationships among major lineages within these two families, a stable phylogenetic backbone for the phylum has yet to be resolved. This has hindered our understanding of onychophoran biogeographic patterns, evolutionary history, and systematics. Neopatida, the Neotropical clade of peripatids, has proved particularly difficult, with recalcitrant nodes and low resolution, potentially due to rapid radiation of the group during the Cretaceous. Previous studies have had to compromise between number of loci and number of taxa due to limitations of Sanger sequencing and phylotranscriptomics, respectively. Additionally, aspects of their genome size and structure have made molecular phylogenetics difficult and data matrices have been affected by missing data. To address these issues, we leveraged recent, published transcriptomes and the first high quality genome for the phylum and designed a high affinity ultraconserved element (UCE) probe set for Onychophora. This new probe set, consisting of ~ 20,000 probes that target 1,465 loci across both families, has high locus recovery and phylogenetic utility. Phylogenetic analyses recovered the monophyly of major clades of Onychophora and revealed a novel lineage from the Neotropics that challenges our current understanding of onychophoran biogeographic endemicity. This new resource could drastically increase the power of molecular datasets and potentially allow access to genomic scale data from archival museum specimens to further tackle the issues exasperating onychophoran systematics. 
    more » « less
  5. Over the past decade, museum genomics studies have focused on obtaining DNA of sufficient quality and quantity for sequencing from fluid-preserved natural history specimens, primarily to be used in systematic studies. While these studies have opened windows to evolutionary and biodiversity knowledge of many species worldwide, published works often focus on the success of these DNA sequencing efforts, which is undoubtedly less common than obtaining minimal or sometimes no DNA or unusable sequence data from specimens in natural history collections. Here, we attempt to obtain and sequence DNA extracts from 115 fresh and 41 degraded samples of homalopsid snakes, as well as from two degraded samples of a poorly known snake, Hydrablabes periops . Hydrablabes has been suggested to belong to at least two different families (Natricidae and Homalopsidae) and with no fresh tissues known to be available, intractable museum specimens currently provide the only opportunity to determine this snake’s taxonomic affinity. Although our aim was to generate a target-capture dataset for these samples, to be included in a broader phylogenetic study, results were less than ideal due to large amounts of missing data, especially using the same downstream methods as with standard, high-quality samples. However, rather than discount results entirely, we used mapping methods with references and pseudoreferences, along with phylogenetic analyses, to maximize any usable molecular data from our sequencing efforts, identify the taxonomic affinity of H. periops , and compare sequencing success between fresh and degraded tissue samples. This resulted in largely complete mitochondrial genomes for five specimens and hundreds to thousands of nuclear loci (ultra-conserved loci, anchored-hybrid enrichment loci, and a variety of loci frequently used in squamate phylogenetic studies) from fluid-preserved snakes, including a specimen of H. periops from the Field Museum of Natural History collection. We combined our H. periops data with previously published genomic and Sanger-sequenced datasets to confirm the familial designation of this taxon, reject previous taxonomic hypotheses, and make biogeographic inferences for Hydrablabes . A second H. periops specimen, despite being seemingly similar for initial raw sequencing results and after being put through the same protocols, resulted in little usable molecular data. We discuss the successes and failures of using different pipelines and methods to maximize the products from these data and provide expectations for others who are looking to use DNA sequencing efforts on specimens that likely have degraded DNA. Life Science Identifier ( Hydrablabes periops ) urn:lsid:zoobank.org :pub:F2AA44 E2-D2EF-4747-972A-652C34C2C09D. 
    more » « less