skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on May 19, 2026

Title: When the Map Fails the Territory: Hidden State Models, Complex Traits and the Evolution of Bird Migration
Phylogenetic comparative methods often rely on simplifying complex biological traits into discrete categories, potentially obscuring evolutionary patterns and generally limiting inferences. This dissertation confronts this ``map versus territory" problem by developing and evaluating methodological approaches that integrate known and unknown trait complexity into macroevolutionary analyses. To establish the statistical power of discrete methods in detecting trait complexity, I first demonstrate the utility of structured hidden Markov models (SHMMs) for identifying underlying continuous architectures, like threshold traits, within simulated and empirical discrete datasets (Chapter ref{ch:1}). Taking bird migration as an example of a hard-to-measure complex trait, I then develop new continuous metrics of bird movement from large-scale community science (eBird) data, using entropy-based measures and phylogenetically aligned component analysis (PACA) to reveal a multi-dimensional structure of evolutionarily relevant combinations of traits, representing underlying movement behavior in North American birds (Chapter ref{ch:2}). Next, I fit SHMMs informed by this structure to global and North American bird phylogenies, testing hypotheses about how migration may have evolved, while accounting for classification ambiguity (Chapter ref{ch:3}). I show that models incorporating hidden states that imitate the structure from Chapter ref{ch:2} were often preferred over generalized hidden Markov models and standard Markov models, suggesting that migration both contains hidden complexity and evolves along specific pathways. Overall, this dissertation provides a methodological framework for integrating continuous data and theoretical knowledge into discrete trait analyses, demonstrating a more holistic treatment of how to treat complex discretized traits like avian migration in phylogenetic comparative methods.  more » « less
Award ID(s):
1942717
PAR ID:
10656883
Author(s) / Creator(s):
Publisher / Repository:
Virginia Tech
Date Published:
Format(s):
Medium: X
Institution:
Virginia Tech
Sponsoring Org:
National Science Foundation
More Like this
  1. The evolution of molecular and phenotypic traits is commonly modelled using Markov processes along a phylogeny. This phylogeny can be a tree, or a network if it includes reticulations, representing events such as hybridization or admixture. Computing the likelihood of data observed at the leaves is costly as the size and complexity of the phylogeny grows. Efficient algorithms exist for trees, but cannot be applied to networks. We show that a vast array of models for trait evolution along phylogenetic networks can be reformulated as graphical models, for which efficient belief propagation algorithms exist. We provide a brief review of belief propagation on general graphical models, then focus on linear Gaussian models for continuous traits. We show how belief propagation techniques can be applied for exact or approximate (but more scalable) likelihood and gradient calculations, and prove novel results for efficient parameter inference of some models. We highlight the possible fruitful interactions between graphical models and phylogenetic methods. For example, approximate likelihood approaches have the potential to greatly reduce computational costs for phylogenies with reticulations. This article is part of the theme issue ‘“A mathematical theory of evolution”: phylogenetic models dating back 100 years’. 
    more » « less
  2. Abstract Seasonal migration is performed by taxonomically diverse groups across the planet’s oceans and continents. Migration has been hypothesized to promote speciation through a variety of mechanisms that may initiate reproductive isolation and population divergence, such as temporal or spatial migratory divides, migration “falloffs,” or the colonization of new, geographically isolated breeding areas.  Migration has also been implicated in recent population divergence within a handful of bird species; however, it is unknown whether migration is generally associated with higher speciation rates. We sought to test this question in two large clades of New World birds with diverse migratory phenotypes, the suboscines and the Emberizoidea, employing three state-of-the-art comparative methods of trait-based diversification: estimates of tip speciation rates using 1) BAMM and 2) ClaDS, and 3) hidden-state speciation extinction models. Our results differed across methods and across taxonomic scales, suggesting an acute need to corroborate inferences across different frameworks and data sets prior to concluding that a given trait has, in fact, promoted diversification. Overall, and based upon the majority of results across different methods, we conclude that there is no methodologically consistent evidence of faster speciation in migratory lineages in these groups.  We discuss the biological implications of this finding, as well as the challenges of inference posed by current trait-based diversification methods. 
    more » « less
  3. Abstract Here we present the open-source and cross-platform BEAST X software that combines molecular phylogenetic reconstruction with complex trait evolution, divergence-time dating and coalescent demographics in an efficient statistical inference engine.BEAST Xsignificantly advances the flexibility and scalability of evolutionary models supported. Novel clock and substitution models leverage a large variety of evolutionary processes; discrete, continuous and mixed traits with missingness and measurement errors; and fast, gradient-informed integration techniques that rapidly traverse high-dimensional parameter spaces. 
    more » « less
  4. Abstract Just exactly which tree(s) should we assume when testing evolutionary hypotheses? This question has plagued comparative biologists for decades. Though all phylogenetic comparative methods require input trees, we seldom know with certainty whether even a perfectly estimated tree (if this is possible in practice) is appropriate for our studied traits. Yet, we also know that phylogenetic conflict is ubiquitous in modern comparative biology, and we are still learning about its dangers when testing evolutionary hypotheses. Here, we investigate the consequences of tree-trait mismatch for phylogenetic regression in the presence of gene tree–species tree conflict. Our simulation experiments reveal excessively high false positive rates for mismatched models with both small and large trees, simple and complex traits, and known and estimated phylogenies. In some cases, we find evidence of a directionality of error: assuming a species tree for traits that evolved according to a gene tree sometimes fares worse than the opposite. We also explored the impacts of tree choice using an expansive, cross-species gene expression dataset as an arguably “best-case” scenario in which one may have a better chance of matching tree with trait. Offering a potential path forward, we found promise in the application of a robust estimator as a potential, albeit imperfect, solution to some issues raised by tree mismatch. Collectively, our results emphasize the importance of careful study design for comparative methods, highlighting the need to fully appreciate the role of accurate and thoughtful phylogenetic modeling. 
    more » « less
  5. Abstract Understanding phenotypic disparity across the tree of life requires identifying where and when evolutionary rates change on phylogeny. A primary methodological challenge in macroevolution is therefore to develop methods for accurate inference of among-lineage variation in rates of phenotypic evolution. Here, we describe a method for inferring among-lineage evolutionary rate heterogeneity in both continuous and discrete traits. The method assumes that the present-day distribution of a trait is shaped by a variable-rate process arising from a mixture of constant-rate processes and uses a single-pass tree traversal algorithm to estimate branch-specific evolutionary rates. By employing dynamic programming optimization techniques and approximate maximum likelihood estimators where appropriate, our method permits rapid exploration of the tempo and mode of phenotypic evolution. Simulations indicate that the method reconstructs rates of trait evolution with high accuracy. Application of the method to data sets on squamate reptile reproduction and turtle body size recovers patterns of rate heterogeneity identified by previous studies but with computational costs reduced by many orders of magnitude. Our results expand the set of tools available for detecting macroevolutionary rate heterogeneity and point to the utility of fast, approximate methods for studying large-scale biodiversity dynamics. [Brownian motion; continuous characters; discrete characters; macroevolution; Markov process; rate heterogeneity.] 
    more » « less