skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Gene Flow Increases Phylogenetic Structure and Inflates Cryptic Species Estimations: A Case Study on Widespread Philippine Puddle Frogs (Occidozyga laevis)
In cryptic amphibian complexes, there is a growing trend to equate high levels of genetic structure with hidden cryptic species diversity. Typically, phylogenetic structure and distance-based approaches are used to demonstrate the distinctness of clades and justify the recognition of new cryptic species. However, this approach does not account for gene flow, spatial, and environmental processes that can obfuscate phylogenetic inference and bias species delimitation. As a case study, we sequenced genome-wide exons and introns to evince the processes that underlie the diversification of Philippine Puddle Frogs—a group that is widespread, phenotypically conserved, and exhibits high levels of geographically based genetic structure. We showed that widely adopted tree- and distance-based approaches inferred up to 20 species, compared to genomic analyses that inferred an optimal number of five distinct genetic groups. Using a suite of clustering, admixture, and phylogenetic network analyses, we demonstrate extensive admixture among the five groups and elucidate two specificways in which gene flowcan cause overestimations of species diversity: 1) admixed populations can be inferred as distinct lineages characterized by long branches in phylograms; and 2) admixed lineages can appear to be genetically divergent, even from their parental populations when simple measures of genetic distance are used. We demonstrate that the relationship between mitochondrial and genome-wide nuclear p-distances is decoupled in admixed clades, leading to erroneous estimates of genetic distances and, consequently, species diversity. Additionally, genetic distance was also biased by spatial and environmental processes. Overall, we showed that high levels of genetic diversity in Philippine Puddle Frogs predominantly comprise metapopulation lineages that arose through complex patterns of admixture, isolation-bydistance, and isolation-by-environment as opposed to species divergence. Our findings suggest that speciation may not be the major process underlying the high levels of hidden diversity observed in many taxonomic groups and that widely adopted tree- and distance-based methods overestimate species diversity in the presence of gene flow.  more » « less
Award ID(s):
1654388
PAR ID:
10295334
Author(s) / Creator(s):
Date Published:
Journal Name:
Systematic biology
ISSN:
1063-5157
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Burbrink, Frank (Ed.)
    Abstract In cryptic amphibian complexes, there is a growing trend to equate high levels of genetic structure with hidden cryptic species diversity. Typically, phylogenetic structure and distance-based approaches are used to demonstrate the distinctness of clades and justify the recognition of new cryptic species. However, this approach does not account for gene flow, spatial, and environmental processes that can obfuscate phylogenetic inference and bias species delimitation. As a case study, we sequenced genome-wide exons and introns to evince the processes that underlie the diversification of Philippine Puddle Frogs—a group that is widespread, phenotypically conserved, and exhibits high levels of geographically based genetic structure. We showed that widely adopted tree- and distance-based approaches inferred up to 20 species, compared to genomic analyses that inferred an optimal number of five distinct genetic groups. Using a suite of clustering, admixture, and phylogenetic network analyses, we demonstrate extensive admixture among the five groups and elucidate two specific ways in which gene flow can cause overestimations of species diversity: 1) admixed populations can be inferred as distinct lineages characterized by long branches in phylograms; and 2) admixed lineages can appear to be genetically divergent, even from their parental populations when simple measures of genetic distance are used. We demonstrate that the relationship between mitochondrial and genome-wide nuclear $$p$$-distances is decoupled in admixed clades, leading to erroneous estimates of genetic distances and, consequently, species diversity. Additionally, genetic distance was also biased by spatial and environmental processes. Overall, we showed that high levels of genetic diversity in Philippine Puddle Frogs predominantly comprise metapopulation lineages that arose through complex patterns of admixture, isolation-by-distance, and isolation-by-environment as opposed to species divergence. Our findings suggest that speciation may not be the major process underlying the high levels of hidden diversity observed in many taxonomic groups and that widely adopted tree- and distance-based methods overestimate species diversity in the presence of gene flow. [Cryptic species; gene flow; introgression; isolation-by-distance; isolation-by-environment; phylogenetic network; species delimitation.] 
    more » « less
  2. null (Ed.)
    In cryptic amphibian complexes, there is a growing trend to equate high levels of genetic structure with hidden cryptic species diversity. Typically, phylogenetic structure and distance-based approaches are used to demonstrate the distinctness of clades and justify the recognition of new cryptic species. However, this approach does not account for gene flow, spatial, and environmental processes that can obfuscate phylogenetic inference and bias species delimitation. As a case study, we sequenced genome-wide exons and introns to evince the processes that underlie the diversification of Philippine Puddle Frogs—a group that is widespread, phenotypically conserved, and exhibits high levels of geographically based genetic structure. We showed that widely adopted tree- and distance-based approaches inferred up to 20 species, compared to genomic analyses that inferred an optimal number of five distinct genetic groups. Using a suite of clustering, admixture, and phylogenetic network analyses, we demonstrate extensive admixture among the five groups and elucidate two specificways in which gene flowcan cause overestimations of species diversity: 1) admixed populations can be inferred as distinct lineages characterized by long branches in phylograms; and 2) admixed lineages can appear to be genetically divergent, even from their parental populations when simple measures of genetic distance are used. We demonstrate that the relationship between mitochondrial and genome-wide nuclear p-distances is decoupled in admixed clades, leading to erroneous estimates of genetic distances and, consequently, species diversity. Additionally, genetic distance was also biased by spatial and environmental processes. Overall, we showed that high levels of genetic diversity in Philippine Puddle Frogs predominantly comprise metapopulation lineages that arose through complex patterns of admixture, isolation-bydistance, and isolation-by-environment as opposed to species divergence. Our findings suggest that speciation may not be the major process underlying the high levels of hidden diversity observed in many taxonomic groups and that widely adopted tree- and distance-based methods overestimate species diversity in the presence of gene flow. 
    more » « less
  3. Abstract Most new cryptic species are described using conventional tree‐ and distance‐based species delimitation methods (SDMs), which rely on phylogenetic arrangements and measures of genetic divergence. However, although numerous factors such as population structure and gene flow are known to confound phylogenetic inference and species delimitation, the influence of these processes is not frequently evaluated. Using large numbers of exons, introns, and ultraconserved elements obtained using the FrogCap sequence‐capture protocol, we compared conventional SDMs with more robust genomic analyses that assess population structure and gene flow to characterize species boundaries in a Southeast Asian frog complex (Pulchrana picturata). Our results showed that gene flow and introgression can produce phylogenetic patterns and levels of divergence that resemble distinct species (up to 10% divergence in mitochondrial DNA). Hybrid populations were inferred as independent (singleton) clades that were highly divergent from adjacent populations (7%–10%) and unusually similar (<3%) to allopatric populations. Such anomalous patterns are not uncommon in Southeast Asian amphibians, which brings into question whether the high levels of cryptic diversity observed in other amphibian groups reflect distinct cryptic species—or, instead, highly admixed and structured metapopulation lineages. Our results also provide an alternative explanation to the conundrum of divergent (sometimes nonsister) sympatric lineages—a pattern that has been celebrated as indicative of true cryptic speciation. Based on these findings, we recommend that species delimitation of continuously distributed “cryptic” groups should not rely solely on conventional SDMs, but should necessarily examine population structure and gene flow to avoid taxonomic inflation. 
    more » « less
  4. Abstract Genomic‐scale datasets, sophisticated analytical techniques, and conceptual advances have disproportionately failed to resolve species boundaries in some groups relative to others. To understand the processes that underlie taxonomic intractability, we dissect the speciation history of an Australian lizard clade that arguably represents a “worst‐case” scenario for species delimitation within vertebrates: theCtenotus inornatusspecies group, a clade beset with decoupled genetic and phenotypic breaks, uncertain geographic ranges, and parallelism in purportedly diagnostic morphological characters. We sampled hundreds of localities to generate a genomic perspective on population divergence, structure, and admixture. Our results revealed rampant paraphyly of nominate taxa in the group, with lineages that are either morphologically cryptic or polytypic. Isolation‐by‐distance patterns reflect spatially continuous differentiation among certain pairs of putative species, yet genetic and geographic distances are decoupled in other pairs. Comparisons of mitochondrial and nuclear gene trees, tests of nuclear introgression, and historical demographic modelling identified gene flow between divergent candidate species. Levels of admixture are decoupled from phylogenetic relatedness; gene flow is often higher between sympatric species than between parapatric populations of the same species. Such idiosyncratic patterns of introgression contribute to species boundaries that are fuzzy while also varying in fuzziness. Our results suggest that “taxonomic disaster zones” like theC. inornatusspecies group result from spatial variation in the porosity of species boundaries and the resulting patterns of genetic and phenotypic variation. This study raises questions about the origin and persistence of hybridizing species and highlights the unique insights provided by taxa that have long eluded straightforward taxonomic categorization. 
    more » « less
  5. null (Ed.)
    One of the most urgent contemporary tasks for taxonomists and evolutionary biologists is to estimate the number of species on earth. Recording alpha diversity is crucial for protecting biodiversity, especially in areas of elevated species richness, which coincide geographically with increased anthropogenic environmental pressures - the world’s so-called biodiversity hotspots. Although the distribution of Puddle frogs of the genus Occidozyga in South and Southeast Asia includes five biodiversity hotspots, the available data on phylogeny, species diversity, and biogeography are surprisingly patchy. Samples analyzed in this study were collected throughout Southeast Asia, with a primary focus on Sundaland and the Philippines. A mitochondrial gene region comprising ~ 2000 bp of 12S and 16S rRNA with intervening tRNA Valine and three nuclear loci (BDNF, NTF3, POMC) were analyzed to obtain a robust, time-calibrated phylogenetic hypothesis. We found a surprisingly high level of genetic diversity within Occidozyga, based on uncorrected p-distance values corroborated by species delimitation analyses. This extensive genetic diversity revealed 29 evolutionary lineages, defined by the > 5% uncorrected p-distance criterion for the 16S rRNA gene, suggesting that species diversity in this clade of phenotypically homogeneous forms probably has been underestimated. The comparison with results of other anuran groups leads to the assumption that anuran species diversity could still be substantially underestimated in Southeast Asia in general. Many genetically divergent lineages of frogs are phenotypically similar, indicating a tendency towards extensive morphological conservatism. We present a biogeographic reconstruction of the colonization of Sundaland and nearby islands which, together with our temporal framework, suggests that lineage diversification centered on the landmasses of the northern Sunda Shelf. This remarkably genetically structured group of amphibians could represent an exceptional case for future studies of geographical structure and diversification in a widespread anuran clade spanning some of the most pronounced geographical barriers on the planet (e.g., Wallace’s Line). Studies considering gene flow, morphology, ecological and bioacoustic data are needed to answer these questions and to test whether observed diversity of Puddle frog lineages warrants taxonomic recognition. 
    more » « less