A challenge to understanding biological diversification is accounting for community-scale processes that cause multiple, co-distributed lineages to co-speciate. Such processes predict non-independent, temporally clustered divergences across taxa. Approximate-likelihood Bayesian computation (ABC) approaches to inferring such patterns from comparative genetic data are very sensitive to prior assumptions and often biased toward estimating shared divergences. We introduce a full-likelihood Bayesian approach, ecoevolity, which takes full advantage of information in genomic data. By analytically integrating over gene trees, we are able to directly calculate the likelihood of the population history from genomic data, and efficiently sample the model-averaged posterior via Markov chain Monte Carlo algorithms. Using simulations, we find that the new method is much more accurate and precise at estimating the number and timing of divergence events across pairs of populations than existing approximate-likelihood methods. Our full Bayesian approach also requires several orders of magnitude less computational time than existing ABC approaches. We find that despite assuming unlinked characters (e.g., unlinked single-nucleotide polymorphisms), the new method performs better if this assumption is violated in order to retain the constant characters of whole linked loci. In fact, retaining constant characters allows the new method to robustly estimate the correct number of divergence events with high posterior probability in the face of character-acquisition biases, which commonly plague loci assembled from reduced-representation genomic libraries. We apply our method to genomic data from four pairs of insular populations of Gekko lizards from the Philippines that are not expected to have co-diverged. Despite all four pairs diverging very recently, our method strongly supports that they diverged independently, and these results are robust to very disparate prior assumptions.
more »
« less
Generalizing Bayesian phylogenetics to infer shared evolutionary events
Many processes of biological diversification can simultaneously affect multiple evolutionary lineages. Examples include multiple members of a gene family diverging when a region of a chromosome is duplicated, multiple viral strains diverging at a “super-spreading” event, and a geological event fragmenting whole communities of species. It is difficult to test for patterns of shared divergences predicted by such processes because all phylogenetic methods assume that lineages diverge independently. We introduce a Bayesian phylogenetic approach to relax the assumption of independent, bifurcating divergences by expanding the space of topologies to include trees with shared and multifurcating divergences. This allows us to jointly infer phylogenetic relationships, divergence times, and patterns of divergences predicted by processes of diversification that affect multiple evolutionary lineages simultaneously or lead to more than two descendant lineages. Using simulations, we find that the method accurately infers shared and multifurcating divergence events when they occur and performs as well as current phylogenetic methods when divergences are independent and bifurcating. We apply our approach to genomic data from two genera of geckos from across the Philippines to test if past changes to the islands’ landscape caused bursts of speciation. Unlike previous analyses restricted to only pairs of gecko populations, we find evidence for patterns of shared divergences. By generalizing the space of phylogenetic trees in a way that is independent from the likelihood model, our approach opens many avenues for future research into processes of diversification across the life sciences.
more »
« less
- Award ID(s):
- 1654388
- PAR ID:
- 10427871
- Date Published:
- Journal Name:
- Proceedings of the National Academy of Sciences
- Volume:
- 119
- Issue:
- 29
- ISSN:
- 0027-8424
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Analyses of evolutionary dynamics depend on how phylogenetic data are time-scaled. Most analyses of extant taxa assume a purely bifurcating model, where nodes are calibrated using the daughter lineage with the older first occurrence in the fossil record. This contrasts with budding, where nodes are calibrated using the younger first occurrence. Here, we use the extensive fossil record of bivalve molluscs for a large-scale evaluation of how branching models affect macroevolutionary analyses. We time-calibrated 91% of nodes, ranging in age from 2.59 to 485 Ma, in a phylogeny of 97 extant bivalve families. Allowing budding-based calibrations minimizes conflict between the tree and observed fossil record, and reduces the summed duration of inferred ‘ghost lineages’ from 6.76 billion years (Gyr; bifurcating model) to 1.00 Gyr (budding). Adding 31 extinct paraphyletic families raises ghost lineage totals to 7.86 Gyr (bifurcating) and 1.92 Gyr (budding), but incorporates more information to date divergences between lineages. Macroevolutionary analyses under a bifurcating model conflict with other palaeontological evidence on the magnitude of the end-Palaeozoic extinction, and strongly reduce Cenozoic diversification. Consideration of different branching models is essential when node-calibrating phylogenies, and for a major clade with a robust fossil record, a budding model appears more appropriate.more » « less
-
Carstens, Bryan (Ed.)Gene flow between diverging lineages challenges the resolution of species boundaries and the understanding of evolutionary history in recent radiations. Here, we integrate phylogenetic and coalescent tools to resolve reticulate patterns of diversification and use a perspective focused on evolutionary mechanisms to distinguish interspecific and intraspecific taxonomic variation. We use this approach to resolve the systematics for one of the most intensively studied but difficult to understand groups of reptiles: the spotted whiptail lizards of the genus Aspidoscelis (A. gularis complex). Whiptails contain the largest number of unisexual species known within any vertebrate group and the spotted whiptail complex has played a key role in the generation of this diversity through hybrid speciation. Understanding lineage boundaries and the evolutionary history of divergence and reticulation within this group is therefore key to understanding the generation of unisexual diversity in whiptails. Despite this importance, long-standing confusion about their systematics has impeded understanding of which gonochoristic species have contributed to the formation of unisexual lineages. Using reduced representation genomic data, we resolve patterns of divergence and gene flow within the spotted whiptails and clarify patterns of hybrid speciation. We find evidence that biogeographically structured ecological and environmental variation has been important in morphological and genetic diversification, as well as the maintenance of species boundaries in this system. Our study elucidates how gene flow among lineages and the continuous nature of speciation can bias the practice of species delimitation and lead taxonomists operating under different frameworks to different conclusions (here we propose that a 2 species arrangement best reflects our current understanding). In doing so, this study provides conceptual and methodological insights into approaches to resolving diversification patterns and species boundaries in rapid radiations with complex histories, as well as long-standing taxonomic challenges in the field of systematic biology.more » « less
-
Colijn and Plazzotta (2018) [1] described a bijective scheme for associating the unlabeled bifurcating rooted trees with the positive integers. In mathematical and biological applications of unlabeled rooted trees, however, nodes of rooted trees are sometimes multifurcating rather than bifurcating. Building on the bijection between the unlabeled bifurcating rooted trees and the positive integers, we describe bijective schemes for associating the unlabeled multifurcating rooted trees with the positive integers. We devise bijections with the positive integers for a set of trees in which each non-leaf node has exactly k child nodes, and for a set of trees in which each non-leaf node has at most k child nodes. The calculations make use of Macaulay's binomial expansion formula. The generalization to multifurcating trees can assist with the use of unlabeled trees for applications in evolutionary biology, such as the measurement of phylogenetic patterns of genetic lineages in pathogens.more » « less
-
Abstract Simultaneous molecular dating of population and species divergences is essential in many biological investigations, including phylogeography, phylodynamics and species delimitation studies. In these investigations, multiple sequence alignments consist of both intra‐ and interspecies samples (mixed samples). As a result, the phylogenetic trees contain interspecies, interpopulation and within‐population divergences. Bayesian relaxed clock methods are often employed in these analyses, but they assume the same tree prior for both inter‐ and intraspecies branching processes and require specification of a clock model for branch rates (independent vs. autocorrelated rates models). We evaluated the impact of a single tree prior on Bayesian divergence time estimates by analysing computer‐simulated data sets. We also examined the effect of the assumption of independence of evolutionary rate variation among branches when the branch rates are autocorrelated. Bayesian approach with coalescent tree priors generally produced excellent molecular dates and highest posterior densities with high coverage probabilities. We also evaluated the performance of a non‐Bayesian method, RelTime, which does not require the specification of a tree prior or a clock model. RelTime's performance was similar to that of the Bayesian approach, suggesting that it is also suitable to analyse data sets containing both populations and species variation when its computational efficiency is needed.more » « less
An official website of the United States government

