skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A rodent anchored hybrid enrichment probe set for a range of phylogenetic utility: From order to species
Abstract Rodents are the largest order of mammals and contain several model organisms important to scientific research in a variety of fields, yet no large set of genomic markers have been designed for this group to date, hindering evolutionary studies into relationships of the group as a whole. Here we present a genomic probe set designed and optimized for rodents with a protocol that is easy to replicate with little laboratory investment. This design utilizes an anchored hybrid enrichment approach specifically targeting rodents to generate longer loci with a higher substitution rate than existing vertebrate probes to provide utility at various taxonomic levels. Using a test set of rodents from all five suborders, we successfully obtained alignments for 416 of the 418 target loci with an average of 1379 bp per locus and a total alignment of more than half a million base pairs. This genomic data set performed well in all phylogenetic analyses, especially in recent phylogenetic splits, with ample parsimony‐informative sites within genera and even within species, showing more than four times as many single nucleotide polymorphisms per locus than a recent vertebrate ultraconserved elements study. Additional support is provided in resolving deeper clades in Rodentia. By providing this probe design, we hope that more laboratories can easily generate data for answering questions in rodents from species delimitation to understanding relationships among families in rapid radiations.  more » « less
Award ID(s):
1754748
PAR ID:
10445857
Author(s) / Creator(s):
 ;  
Publisher / Repository:
Wiley-Blackwell
Date Published:
Journal Name:
Molecular Ecology Resources
Volume:
22
Issue:
4
ISSN:
1755-098X
Page Range / eLocation ID:
p. 1521-1528
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract PremiseRubiaceae is among the most species‐rich plant families, as well as one of the most morphologically and geographically diverse. Currently available phylogenies have mostly relied on few genomic and plastid loci, as opposed to large‐scale genomic data. Target enrichment provides the ability to generate sequence data for hundreds to thousands of phylogenetically informative, single‐copy loci, which often leads to improved phylogenetic resolution at both shallow and deep taxonomic scales; however, a publicly accessible Rubiaceae‐specific probe set that allows for comparable phylogenetic inference across clades is lacking. MethodsHere, we use publicly accessible genomic resources to identify putatively single‐copy nuclear loci for target enrichment in two Rubiaceae groups: tribe Hillieae (Cinchonoideae) and tribal complex Palicoureeae+Psychotrieae (Rubioideae). We sequenced 2270 exonic regions corresponding to 1059 loci in our target clades and generated in silico target enrichment sequences for other Rubiaceae taxa using our designed probe set. To test the utility of our probe set for phylogenetic inference across Rubiaceae, we performed a coalescent‐aware phylogenetic analysis using a subset of 27 Rubiaceae taxa from 10 different tribes and three subfamilies, and one outgroup in Apocynaceae. ResultsWe recovered an average of 75% and 84% of targeted exons and loci, respectively, per Rubiaceae sample. Probes designed using genomic resources from a particular subfamily were most efficient at targeting sequences from taxa in that subfamily. The number of paralogs recovered during assembly varied for each clade. Phylogenetic inference of Rubiaceae with our target regions resolves relationships at various scales. Relationships are largely consistent with previous studies of relationships in the family with high support (≥0.98 local posterior probability) at nearly all nodes and evidence of gene tree discordance. DiscussionOur probe set, which we call Rubiaceae2270x, was effective for targeting loci in species across and even outside of Rubiaceae. This probe set will facilitate phylogenomic studies in Rubiaceae and advance systematics and macroevolutionary studies in the family. 
    more » « less
  2. Abstract Anchored hybrid enrichment (AHE) has emerged as a powerful tool for uncovering the evolutionary relationships within many taxonomic groups. AHE probe sets have been developed for a variety of insect groups, though none have yet been shown to be capable of simultaneously resolving deep and very shallow (e.g., intraspecific) divergences. In this study, we present NOC1, a new AHE probe set (730 loci) for Lepidoptera specialized for tiger moths and assess its ability to deliver phylogenetic utility at all taxonomic levels. We test the NOC1 probe set with 142 individuals from 116 species sampled from all the major lineages of Arctiinae (Erebidae), one of the most diverse groups of noctuoids (>11 000 species) for which no well‐resolved, strongly supported phylogenetic hypothesis exists. Compared to previous methods, we generally recover much higher branch support (BS), resulting in the most well‐supported, well‐resolved phylogeny of Arctiinae to date. At the most shallow‐levels, NOC1 confidently resolves species‐level and intraspecific relationships and potentially uncovers cryptic species diversity within the genusHypoprepia. We also implement a ‘sensitivity analysis’ to explore different loci combinations and site sampling strategies to determine whether a reduced probe set can yield results similar to those of the full probe set. At both deep and shallow levels, only 50–175 of the 730 loci included in the complete NOC1 probe set were necessary to resolve most relationships with high confidence, though only when the more rapidly evolving sites within each locus are included. This demonstrates that AHE probe sets can be tailored to target fewer loci without a significant reduction in BS, allowing future studies to incorporate more taxa at a lower per‐sample sequencing cost. NOC1 shows great promise for resolving long‐standing taxonomic issues and evolutionary questions within arctiine lineages, one of the most speciose clades within Lepidoptera. 
    more » « less
  3. PremisePhylogenetic relationships within major angiosperm clades are increasingly well resolved, but largely informed by plastid data. Areas of poor resolution persist within the Dipsacales, including placement ofHeptacodiumandZabelia, and relationships within the Caprifolieae and Linnaeeae, hindering our interpretation of morphological evolution. Here, we sampled a significant number of nuclear loci using a Hyb‐Seq approach and used these data to infer the Dipsacales phylogeny and estimate divergence times. MethodsSampling all major clades within the Dipsacales, we applied the Angiosperms353 probe set to 96 species. Data were filtered based on locus completeness and taxon recovery per locus, and trees were inferred using RAxML and ASTRAL. Plastid loci were assembled from off‐target reads, and 10 fossils were used to calibrate dated trees. ResultsVarying numbers of targeted loci and off‐target plastomes were recovered from most taxa. Nuclear and plastid data confidently placeHeptacodiumwith Caprifolieae, implying homoplasy in calyx morphology, ovary development, and fruit type. Placement ofZabelia, and relationships within the Caprifolieae and Linnaeeae, remain uncertain. Dipsacales diversification began earlier than suggested by previous angiosperm‐wide dating analyses, but many major splitting events date to the Eocene. ConclusionsThe Angiosperms353 probe set facilitated the assembly of a large, single‐copy nuclear dataset for the Dipsacales. Nevertheless, many relationships remain unresolved, and resolution was poor for woody clades with low rates of molecular evolution. We favor expanding the Angiosperms353 probe set to include more variable loci and loci of special interest, such as developmental genes, within particular clades. 
    more » « less
  4. Onychophora are cryptic, soil-dwelling invertebrates known for their biogeographic affinities, diversity of reproductive modes, close phylogenetic relationship to arthropods, and peculiar prey capture mechanism. The 216 valid species of Onychophora are grouped into two families – Peripatopsidae and Peripatidae – and apart from a few relationships among major lineages within these two families, a stable phylogenetic backbone for the phylum has yet to be resolved. This has hindered our understanding of onychophoran biogeographic patterns, evolutionary history, and systematics. Neopatida, the Neotropical clade of peripatids, has proved particularly difficult, with recalcitrant nodes and low resolution, potentially due to rapid radiation of the group during the Cretaceous. Previous studies have had to compromise between number of loci and number of taxa due to limitations of Sanger sequencing and phylotranscriptomics, respectively. Additionally, aspects of their genome size and structure have made molecular phylogenetics difficult and data matrices have been affected by missing data. To address these issues, we leveraged recent, published transcriptomes and the first high quality genome for the phylum and designed a high affinity ultraconserved element (UCE) probe set for Onychophora. This new probe set, consisting of ~ 20,000 probes that target 1,465 loci across both families, has high locus recovery and phylogenetic utility. Phylogenetic analyses recovered the monophyly of major clades of Onychophora and revealed a novel lineage from the Neotropics that challenges our current understanding of onychophoran biogeographic endemicity. This new resource could drastically increase the power of molecular datasets and potentially allow access to genomic scale data from archival museum specimens to further tackle the issues exasperating onychophoran systematics. 
    more » « less
  5. Yoshizawa, Kazunori (Ed.)
    Abstract The order Psocodea includes the two historically recognized groups Psocoptera (free-living bark lice) and Phthiraptera (parasitic lice) that were once considered separate orders. Psocodea is divided in three suborders: Trogiomorpha, Troctomorpha, and Psocomorpha, the latter being the largest within the free-living groups. Despite the increasing number of transcriptomes and whole genome sequence (WGS) data available for this group, the relationships among the six known infraorders within Psocomorpha remain unclear. Here, we evaluated the utility of a bait set designed specifically for parasitic lice belonging to suborder Troctomorpha to extract UCE loci from transcriptome and WGS data of 55 bark louse species and explored the phylogenetic relationships within Psocomorpha using these UCE loci markers. Taxon sampling was heavily focused on the families Lachesillidae and Elipsocidae, whose relationships have been problematic in prior phylogenetic studies. We successfully recovered a total of 2,622 UCE loci, with a 40% completeness matrix containing 2,081 UCE loci and an 80% completeness matrix containing 178 UCE loci. The average number of UCE loci recovered for the 55 species was 1,401. The WGS data sets produced a larger number of UCE loci (1,495) on average than the transcriptome data sets (972). Phylogenetic relationships reconstructed with Maximum Likelihood and coalescent-based analysis were concordant regarding the paraphyly of Lachesillidae and Elipsocidae. Branch support values were generally lower in analyses that used a fewer number of loci, even though they had higher matrix completeness. 
    more » « less