skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Metagenomic clustering reveals microbial contamination as an essential consideration in ultraconserved element design for phylogenomics with insect museum specimens
Abstract Phylogenomics via ultraconserved elements (UCEs) has led to improved phylogenetic reconstructions across the tree of life. However, inadvertently incorporating non‐targeted DNA into the UCE marker design will lead to misinformation being incorporated into subsequent analyses. To date, the effectiveness of basic metagenomic filtering strategies has not been assessed in arthropods. Designing markers from museum specimens requires careful consideration of methods due to the high levels of microbial contamination typically found in such specimens. We investigate if contaminant sequences are carried forward into a UCE marker set we developed from insect museum specimens using a standard bioinformatics pipeline. We find that the methods currently employed by most researchers do not exclude contamination from the final set of targets. Lastly, we highlight several paths forward for reducing contamination in UCE marker design.  more » « less
Award ID(s):
1856402
PAR ID:
10371073
Author(s) / Creator(s):
 ;  ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Ecology and Evolution
Volume:
12
Issue:
3
ISSN:
2045-7758
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Marvaldi, Adriana (Ed.)
    Abstract Tailoring ultraconserved element (UCE) probe set design to focal taxa has been demonstrated to improve locus recovery and phylogenomic inference. However, beyond conducting expensive in vitro testing, it remains unclear how best to determine whether an existing UCE probe set is likely to suffice for phylogenomic inference or whether tailored probe design will be desirable. Here we investigate the utility of 8 different UCE probe sets for the in silico phylogenomic inference of scarabaeoid beetles. Probe sets tested differed in terms of (i) how phylogenetically distant from Scarabaeoidea taxa those used during probe design are, (ii) breadth of phylogenetic inference probe set was designed for, and (iii) method of probe design. As part of this study, 2 new UCE probe sets are produced for the beetle family Scarabaeidae and superfamily Hydrophiloidea. We confirm that probe set utility decreases with increasing phylogenetic distance from target taxa. In addition, narrowing the phylogenetic breadth of probe design decreases the phylogenetic capture range. We also confirm previous findings regarding ways to optimize UCE probe design. Finally, we make suggestions regarding assessment of need for de novo probe design. 
    more » « less
  2. Abstract Next‐generation sequencing has greatly expanded the utility and value of museum collections by revealing specimens as genomic resources. As the field of museum genomics grows, so does the need for extraction methods that maximize DNA yields. For avian museum specimens, the established method of extracting DNA from toe pads works well for most specimens. However, for some specimens, especially those of birds that are very small or very large, toe pads can be a poor source of DNA. In this study, we apply two DNA extraction methods (phenol–chloroform and silica column) to three different sources of DNA (toe pad, skin punch and bone) from 10 historical avian museum specimens. We show that a modified phenol–chloroform protocol yielded significantly more DNA than a silica column protocol (e.g., Qiagen DNeasy Blood & Tissue Kit) across all tissue types. However, extractions using the silica column protocol contained longer fragments on average than those using the phenol–chloroform protocol, probably as a result of loss of small fragments through the silica column. While toe pads yielded more DNA than skin punches and bone fragments, skin punches proved to be a reliable alternative source of DNA and might be especially appealing when toe pad extractions are impractical. Overall, we found that historical bird museum specimens contain substantial amounts of DNA for genomic studies under most extraction scenarios, but that a phenol–chloroform protocol consistently provides the high quantities of DNA required for most current genomic protocols. 
    more » « less
  3. Abstract Marker selection has emerged as an important component of phylogenomic study design due to rising concerns of the effects of gene tree estimation error, model misspecification, and data-type differences. Researchers must balance various trade-offs associated with locus length and evolutionary rate among other factors. The most commonly used reduced representation data sets for phylogenomics are ultraconserved elements (UCEs) and Anchored Hybrid Enrichment (AHE). Here, we introduce Rapidly Evolving Long Exon Capture (RELEC), a new set of loci that targets single exons that are both rapidly evolving (evolutionary rate faster than RAG1) and relatively long in length (>1,500 bp), while at the same time avoiding paralogy issues across amniotes. We compare the RELEC data set to UCEs and AHE in squamate reptiles by aligning and analyzing orthologous sequences from 17 squamate genomes, composed of 10 snakes and 7 lizards. The RELEC data set (179 loci) outperforms AHE and UCEs by maximizing per-locus genetic variation while maintaining presence and orthology across a range of evolutionary scales. RELEC markers show higher phylogenetic informativeness than UCE and AHE loci, and RELEC gene trees show greater similarity to the species tree than AHE or UCE gene trees. Furthermore, with fewer loci, RELEC remains computationally tractable for full Bayesian coalescent species tree analyses. We contrast RELEC to and discuss important aspects of comparable methods, and demonstrate how RELEC may be the most effective set of loci for resolving difficult nodes and rapid radiations. We provide several resources for capturing or extracting RELEC loci from other amniote groups. 
    more » « less
  4. Onychophora are cryptic, soil-dwelling invertebrates known for their biogeographic affinities, diversity of reproductive modes, close phylogenetic relationship to arthropods, and peculiar prey capture mechanism. The 216 valid species of Onychophora are grouped into two families – Peripatopsidae and Peripatidae – and apart from a few relationships among major lineages within these two families, a stable phylogenetic backbone for the phylum has yet to be resolved. This has hindered our understanding of onychophoran biogeographic patterns, evolutionary history, and systematics. Neopatida, the Neotropical clade of peripatids, has proved particularly difficult, with recalcitrant nodes and low resolution, potentially due to rapid radiation of the group during the Cretaceous. Previous studies have had to compromise between number of loci and number of taxa due to limitations of Sanger sequencing and phylotranscriptomics, respectively. Additionally, aspects of their genome size and structure have made molecular phylogenetics difficult and data matrices have been affected by missing data. To address these issues, we leveraged recent, published transcriptomes and the first high quality genome for the phylum and designed a high affinity ultraconserved element (UCE) probe set for Onychophora. This new probe set, consisting of ~ 20,000 probes that target 1,465 loci across both families, has high locus recovery and phylogenetic utility. Phylogenetic analyses recovered the monophyly of major clades of Onychophora and revealed a novel lineage from the Neotropics that challenges our current understanding of onychophoran biogeographic endemicity. This new resource could drastically increase the power of molecular datasets and potentially allow access to genomic scale data from archival museum specimens to further tackle the issues exasperating onychophoran systematics. 
    more » « less
  5. Sharma, Prashant (Ed.)
    Pettalidae is a family of mite harvestmen that inhabits the former circum-Antarctic Gondwanan terranes, including southern South America, South Africa, Madagascar, Sri Lanka, Australia and New Zealand. Australia is home to two pettalid genera, Austropurcellia, in northern New South Wales and Queensland, and Karripurcellia, in Western Australia, until now showing a large distributional gap between these two parts of the Australian continent. Here we report specimens of a new pettalid from South Australia, Archaeopurcellia eureka, gen. et sp. nov., closing this distributional gap of Australian pettalids. Phylogenetic analyses using traditional Sanger markers as well as ultra-conserved elements (UCEs) reveal that the new genus is related to the Chilean Chileogovea, instead of any of the other East Gondwanan genera. This relationship of an Australian species to a South American clade can be explained by the Antarctic land bridge between these two terranes, a connection that was maintained with Australia until 45 Ma. The UCE dataset also shows the promise of using museum specimens to resolve relationships within Pettalidae and Cyphophthalmi. ZooBank: urn:lsid:zoobank.org:pub:9B57A054-30D8-4412-99A2-6191CBD3BD7E 
    more » « less