skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: To design, or not to design? Comparison of beetle ultraconserved element probe set utility based on phylogenetic distance, breadth, and method of probe design
Abstract Tailoring ultraconserved element (UCE) probe set design to focal taxa has been demonstrated to improve locus recovery and phylogenomic inference. However, beyond conducting expensive in vitro testing, it remains unclear how best to determine whether an existing UCE probe set is likely to suffice for phylogenomic inference or whether tailored probe design will be desirable. Here we investigate the utility of 8 different UCE probe sets for the in silico phylogenomic inference of scarabaeoid beetles. Probe sets tested differed in terms of (i) how phylogenetically distant from Scarabaeoidea taxa those used during probe design are, (ii) breadth of phylogenetic inference probe set was designed for, and (iii) method of probe design. As part of this study, 2 new UCE probe sets are produced for the beetle family Scarabaeidae and superfamily Hydrophiloidea. We confirm that probe set utility decreases with increasing phylogenetic distance from target taxa. In addition, narrowing the phylogenetic breadth of probe design decreases the phylogenetic capture range. We also confirm previous findings regarding ways to optimize UCE probe design. Finally, we make suggestions regarding assessment of need for de novo probe design.  more » « less
Award ID(s):
1942193
PAR ID:
10493725
Author(s) / Creator(s):
; ; ; ;
Editor(s):
Marvaldi, Adriana
Publisher / Repository:
Oxford Academics
Date Published:
Journal Name:
Insect Systematics and Diversity
Volume:
7
Issue:
4
ISSN:
2399-3421
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. This repository contains Materials and designed UCE probe sets for the manuscript entitled "To design or not to design? Comparison of beetle ultraconserved element probe set utility based on phylogenetic distance, breadth, and method of probe design". 
    more » « less
  2. Abstract PremiseRubiaceae is among the most species‐rich plant families, as well as one of the most morphologically and geographically diverse. Currently available phylogenies have mostly relied on few genomic and plastid loci, as opposed to large‐scale genomic data. Target enrichment provides the ability to generate sequence data for hundreds to thousands of phylogenetically informative, single‐copy loci, which often leads to improved phylogenetic resolution at both shallow and deep taxonomic scales; however, a publicly accessible Rubiaceae‐specific probe set that allows for comparable phylogenetic inference across clades is lacking. MethodsHere, we use publicly accessible genomic resources to identify putatively single‐copy nuclear loci for target enrichment in two Rubiaceae groups: tribe Hillieae (Cinchonoideae) and tribal complex Palicoureeae+Psychotrieae (Rubioideae). We sequenced 2270 exonic regions corresponding to 1059 loci in our target clades and generated in silico target enrichment sequences for other Rubiaceae taxa using our designed probe set. To test the utility of our probe set for phylogenetic inference across Rubiaceae, we performed a coalescent‐aware phylogenetic analysis using a subset of 27 Rubiaceae taxa from 10 different tribes and three subfamilies, and one outgroup in Apocynaceae. ResultsWe recovered an average of 75% and 84% of targeted exons and loci, respectively, per Rubiaceae sample. Probes designed using genomic resources from a particular subfamily were most efficient at targeting sequences from taxa in that subfamily. The number of paralogs recovered during assembly varied for each clade. Phylogenetic inference of Rubiaceae with our target regions resolves relationships at various scales. Relationships are largely consistent with previous studies of relationships in the family with high support (≥0.98 local posterior probability) at nearly all nodes and evidence of gene tree discordance. DiscussionOur probe set, which we call Rubiaceae2270x, was effective for targeting loci in species across and even outside of Rubiaceae. This probe set will facilitate phylogenomic studies in Rubiaceae and advance systematics and macroevolutionary studies in the family. 
    more » « less
  3. Abstract PremiseTarget sequence capture (Hyb‐Seq) is a cost‐effective sequencing strategy that employs RNA probes to enrich for specific genomic sequences. By targeting conserved low‐copy orthologs, Hyb‐Seq enables efficient phylogenomic investigations. Here, we present Asparagaceae1726—a Hyb‐Seq probe set targeting 1726 low‐copy nuclear genes for phylogenomics in the angiosperm family Asparagaceae—which will aid the often‐challenging delineation and resolution of evolutionary relationships within Asparagaceae. MethodsHere we describe and validate the Asparagaceae1726 probe set (https://github.com/bentzpc/Asparagaceae1726) in six of the seven subfamilies of Asparagaceae. We perform phylogenomic analyses with these 1726 loci and evaluate how inclusion of paralogs and bycatch plastome sequences can enhance phylogenomic inference with target‐enriched data sets. ResultsWe recovered at least 82% of target orthologs from all sampled taxa, and phylogenomic analyses resulted in strong support for all subfamilial relationships. Additionally, topology and branch support were congruent between analyses with and without inclusion of target paralogs, suggesting that paralogs had limited effect on phylogenomic inference. DiscussionAsparagaceae1726 is effective across the family and enables the generation of robust data sets for phylogenomics of any Asparagaceae taxon. Asparagaceae1726 establishes a standardized set of loci for phylogenomic analysis in Asparagaceae, which we hope will be widely used for extensible and reproducible investigations of diversification in the family. 
    more » « less
  4. Yoshizawa, Kazunori (Ed.)
    Abstract The order Psocodea includes the two historically recognized groups Psocoptera (free-living bark lice) and Phthiraptera (parasitic lice) that were once considered separate orders. Psocodea is divided in three suborders: Trogiomorpha, Troctomorpha, and Psocomorpha, the latter being the largest within the free-living groups. Despite the increasing number of transcriptomes and whole genome sequence (WGS) data available for this group, the relationships among the six known infraorders within Psocomorpha remain unclear. Here, we evaluated the utility of a bait set designed specifically for parasitic lice belonging to suborder Troctomorpha to extract UCE loci from transcriptome and WGS data of 55 bark louse species and explored the phylogenetic relationships within Psocomorpha using these UCE loci markers. Taxon sampling was heavily focused on the families Lachesillidae and Elipsocidae, whose relationships have been problematic in prior phylogenetic studies. We successfully recovered a total of 2,622 UCE loci, with a 40% completeness matrix containing 2,081 UCE loci and an 80% completeness matrix containing 178 UCE loci. The average number of UCE loci recovered for the 55 species was 1,401. The WGS data sets produced a larger number of UCE loci (1,495) on average than the transcriptome data sets (972). Phylogenetic relationships reconstructed with Maximum Likelihood and coalescent-based analysis were concordant regarding the paraphyly of Lachesillidae and Elipsocidae. Branch support values were generally lower in analyses that used a fewer number of loci, even though they had higher matrix completeness. 
    more » « less
  5. Onychophora are cryptic, soil-dwelling invertebrates known for their biogeographic affinities, diversity of reproductive modes, close phylogenetic relationship to arthropods, and peculiar prey capture mechanism. The 216 valid species of Onychophora are grouped into two families – Peripatopsidae and Peripatidae – and apart from a few relationships among major lineages within these two families, a stable phylogenetic backbone for the phylum has yet to be resolved. This has hindered our understanding of onychophoran biogeographic patterns, evolutionary history, and systematics. Neopatida, the Neotropical clade of peripatids, has proved particularly difficult, with recalcitrant nodes and low resolution, potentially due to rapid radiation of the group during the Cretaceous. Previous studies have had to compromise between number of loci and number of taxa due to limitations of Sanger sequencing and phylotranscriptomics, respectively. Additionally, aspects of their genome size and structure have made molecular phylogenetics difficult and data matrices have been affected by missing data. To address these issues, we leveraged recent, published transcriptomes and the first high quality genome for the phylum and designed a high affinity ultraconserved element (UCE) probe set for Onychophora. This new probe set, consisting of ~ 20,000 probes that target 1,465 loci across both families, has high locus recovery and phylogenetic utility. Phylogenetic analyses recovered the monophyly of major clades of Onychophora and revealed a novel lineage from the Neotropics that challenges our current understanding of onychophoran biogeographic endemicity. This new resource could drastically increase the power of molecular datasets and potentially allow access to genomic scale data from archival museum specimens to further tackle the issues exasperating onychophoran systematics. 
    more » « less