skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on February 13, 2026

Title: Fossils improve extinction-rate estimates under state-dependent diversification models
The effect of traits on diversification rates is a major topic of study in the fields of evolutionary biology and palaeontology. Many researchers investigating these macroevolutionary questions currently make use of the extensive suite of state-dependent speciation and extinction (SSE) models. These models were developed for, and are almost exclusively used with, phylogenetic trees of extant species. However, analyses considering only extant taxa are limited in their power to estimate extinction rates. Furthermore, SSE models can erroneously detect associations between neutral traits and diversification rates when the true associated trait is not observed. In this study, we examined the impact of including fossil data on the accuracy of parameter estimates under the binary-state speciation and extinction (BiSSE) model. This was achieved by combining SSE models with the fossilized birth–death process. We show that the inclusion of fossils improves the accuracy of extinction-rate estimates for analyses applying the BiSSE model in a Bayesian inference framework, with no negative impact on speciation-rate and state transition-rate estimates when compared with estimates from trees of only extant taxa. However, even with the addition of fossil data, analyses under the BiSSE model continued to incorrectly identify correlations between diversification rates and neutral traits. This article is part of the theme issue ‘“A mathematical theory of evolution”: phylogenetic models dating back 100 years’.  more » « less
Award ID(s):
2346172
PAR ID:
10595212
Author(s) / Creator(s):
; ;
Publisher / Repository:
The Royal Society Publishing
Date Published:
Journal Name:
Philosophical Transactions of the Royal Society B: Biological Sciences
Volume:
380
Issue:
1919
ISSN:
0962-8436
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Ryan Folk (Ed.)
    Phylogenetic divergence-time estimation has been revolutionized by two recent developments: 1) total-evidence dating (or "tip-dating") approaches that allow for the incorporation of fossils as tips in the analysis, with their phylogenetic and temporal relationships to the extant taxa inferred from the data and 2) the fossilized birth-death (FBD) class of tree models that capture the processes that produce the tree (speciation, extinction, and fossilization) and thus provide a coherent and biologically interpretable tree prior. To explore the behavior of these methods, we apply them to marattialean ferns, a group that was dominant in Carboniferous landscapes prior to declining to its modest extant diversity of slightly over 100 species. We show that tree models have a dramatic influence on estimates of both divergence times and topological relationships. This influence is driven by the strong, counter-intuitive informativeness of the uniform tree prior, and the inherent nonidentifiability of divergence-time models. In contrast to the strong influence of the tree models, we find minor effects of differing the morphological transition model or the morphological clock model. We compare the performance of a large pool of candidate models using a combination of posterior-predictive simulation and Bayes factors. Notably, an FBD model with epoch-specific speciation and extinction rates was strongly favored by Bayes factors. Our best-fitting model infers stem and crown divergences for the Marattiales in the mid-Devonian and Late Cretaceous, respectively, with elevated speciation rates in the Mississippian and elevated extinction rates in the Cisuralian leading to a peak diversity of ∼2800 species at the end of the Carboniferous, representing the heyday of the Psaroniaceae. This peak is followed by the rapid decline and ultimate extinction of the Psaroniaceae, with their descendants, the Marattiaceae, persisting at approximately stable levels of diversity until the present. This general diversification pattern appears to be insensitive to potential biases in the fossil record; despite the preponderance of available fossils being from Pennsylvanian coal balls, incorporating fossilization-rate variation does not improve model fit. In addition, by incorporating temporal data directly within the model and allowing for the inference of the phylogenetic position of the fossils, our study makes the surprising inference that the clade of extant Marattiales is relatively young, younger than any of the fossils historically thought to be congeneric with extant species. This result is a dramatic demonstration of the dangers of node-based approaches to divergence-time estimation, where the assignment of fossils to particular clades is made a priori (earlier node-based studies that constrained the minimum ages of extant genera based on these fossils resulted in much older age estimates than in our study) and of the utility of explicit models of morphological evolution and lineage diversification. [Bayesian model comparison; Carboniferous; divergence-time estimation; fossil record; fossilized birth–death; lineage diversification; Marattiales; models of morphological evolution; Psaronius; RevBayes.] 
    more » « less
  2. Folk, Ryan (Ed.)
    Abstract Phylogenetic divergence-time estimation has been revolutionized by two recent developments: 1) total-evidence dating (or "tip-dating") approaches that allow for the incorporation of fossils as tips in the analysis, with their phylogenetic and temporal relationships to the extant taxa inferred from the data and 2) the fossilized birth-death (FBD) class of tree models that capture the processes that produce the tree (speciation, extinction, and fossilization) and thus provide a coherent and biologically interpretable tree prior. To explore the behavior of these methods, we apply them to marattialean ferns, a group that was dominant in Carboniferous landscapes prior to declining to its modest extant diversity of slightly over 100 species. We show that tree models have a dramatic influence on estimates of both divergence times and topological relationships. This influence is driven by the strong, counter-intuitive informativeness of the uniform tree prior, and the inherent nonidentifiability of divergence-time models. In contrast to the strong influence of the tree models, we find minor effects of differing the morphological transition model or the morphological clock model. We compare the performance of a large pool of candidate models using a combination of posterior-predictive simulation and Bayes factors. Notably, an FBD model with epoch-specific speciation and extinction rates was strongly favored by Bayes factors. Our best-fitting model infers stem and crown divergences for the Marattiales in the mid-Devonian and Late Cretaceous, respectively, with elevated speciation rates in the Mississippian and elevated extinction rates in the Cisuralian leading to a peak diversity of $${\sim}$$2800 species at the end of the Carboniferous, representing the heyday of the Psaroniaceae. This peak is followed by the rapid decline and ultimate extinction of the Psaroniaceae, with their descendants, the Marattiaceae, persisting at approximately stable levels of diversity until the present. This general diversification pattern appears to be insensitive to potential biases in the fossil record; despite the preponderance of available fossils being from Pennsylvanian coal balls, incorporating fossilization-rate variation does not improve model fit. In addition, by incorporating temporal data directly within the model and allowing for the inference of the phylogenetic position of the fossils, our study makes the surprising inference that the clade of extant Marattiales is relatively young, younger than any of the fossils historically thought to be congeneric with extant species. This result is a dramatic demonstration of the dangers of node-based approaches to divergence-time estimation, where the assignment of fossils to particular clades is made a priori (earlier node-based studies that constrained the minimum ages of extant genera based on these fossils resulted in much older age estimates than in our study) and of the utility of explicit models of morphological evolution and lineage diversification. [Bayesian model comparison; Carboniferous; divergence-time estimation; fossil record; fossilized birth–death; lineage diversification; Marattiales; models of morphological evolution; Psaronius; RevBayes.] 
    more » « less
  3. Abstract Traits that have arisen multiple times yet still remain rare present a curious paradox. A number of these rare traits show a distinct tippy pattern, where they appear widely dispersed across a phylogeny, are associated with short branches and differ between recently diverged sister species. This phylogenetic pattern has classically been attributed to the trait being an evolutionary dead end, where the trait arises due to some short‐term evolutionary advantage, but it ultimately leads species to extinction. While the higher extinction rate associated with a dead end trait could produce such a tippy pattern, a similar pattern could appear if lineages with the trait speciated slower than other lineages, or if the trait was lost more often that it was gained. In this study, we quantify the degree of tippiness of red flowers in the tomato family, Solanaceae, and investigate the macroevolutionary processes that could explain the sparse phylogenetic distribution of this trait. Using a suite of metrics, we confirm that red‐flowered lineages are significantly overdispersed across the tree and form smaller clades than expected under a null model. Next, we fit 22 alternative models using HiSSE(Hidden State Speciation and Extinction), which accommodates asymmetries in speciation, extinction and transition rates that depend on observed and unobserved (hidden) character states. Results of the model fitting indicated significant variation in diversification rates across the family, which is best explained by the inclusion of hidden states. Our best fitting model differs between the maximum clade credibility tree and when incorporating phylogenetic uncertainty, suggesting that the extreme tippiness and rarity of red Solanaceae flowers makes it difficult to distinguish among different underlying processes. However, both of the best models strongly support a bias towards the loss of red flowers. The best fitting HiSSEmodel when incorporating phylogenetic uncertainty lends some support to the hypothesis that lineages with red flowers exhibit reduced diversification rates due to elevated extinction rates. Future studies employing simulations or targeting population‐level processes may allow us to determine whether red flowers in Solanaceae or other angiosperms clades are rare and tippy due to a combination of processes, or asymmetrical transitions alone. 
    more » « less
  4. Abstract Identifying along which lineages shifts in diversification rates occur is a central goal of comparative phylogenetics; these shifts may coincide with key evolutionary events such as the development of novel morphological characters, the acquisition of adaptive traits, polyploidization or other structural genomic changes, or dispersal to a new habitat and subsequent increase in environmental niche space. However, while multiple methods now exist to estimate diversification rates and identify shifts using phylogenetic topologies, the appropriate use and accuracy of these methods are hotly debated. Here we test whether five Bayesian methods—Bayesian Analysis of Macroevolutionary Mixtures (BAMM), two implementations of the Lineage-Specific Birth–Death–Shift model (LSBDS and PESTO), the approximate Multi-Type Birth–Death model (MTBD; implemented in BEAST2), and the Cladogenetic Diversification Rate Shift model (ClaDS2)—produce comparable results. We apply each of these methods to a set of 65 empirical time-calibrated phylogenies and compare inferences of speciation rate, extinction rate, and net diversification rate. We find that the five methods often infer different speciation, extinction, and net-diversification rates. Consequently, these different estimates may lead to different interpretations of the macroevolutionary dynamics. The different estimates can be attributed to fundamental differences among the compared models. Therefore, the inference of shifts in diversification rates is strongly method dependent. We advise biologists to apply multiple methods to test the robustness of the conclusions or to carefully select the method based on the validity of the underlying model assumptions to their particular empirical system. 
    more » « less
  5. Abstract Estimating speciation and extinction rates is essential for understanding past and present biodiversity, but is challenging given the incompleteness of the rock and fossil records. Interest in this topic has led to a divergent suite of independent methods—paleontological estimates based on sampled stratigraphic ranges and phylogenetic estimates based on the observed branching times in a given phylogeny of living species. The fossilized birth–death (FBD) process is a model that explicitly recognizes that the branching events in a phylogenetic tree and sampled fossils were generated by the same underlying diversification process. A crucial advantage of this model is that it incorporates the possibility that some species may never be sampled. Here, we present an FBD model that estimates tree-wide diversification rates from stratigraphic range data when the underlying phylogeny of the fossil taxa may be unknown. The model can be applied when only occurrence data for taxonomically identified fossils are available, but still accounts for the incomplete phylogenetic structure of the data. We tested this new model using simulations and focused on how inferences are impacted by incomplete fossil recovery. We compared our approach with a phylogenetic model that does not incorporate incomplete species sampling and to three fossil-based alternatives for estimating diversification rates, including the widely implemented boundary-crosser and three-timer methods. The results of our simulations demonstrate that estimates under the FBD model are robust and more accurate than the alternative methods, particularly when fossil data are sparse, as the FBD model incorporates incomplete species sampling explicitly. 
    more » « less