Abstract It has become common in evolutionary biology to characterize phenotypes multivariately. However, visualizing macroevolutionary trends in multivariate datasets requires appropriate ordination methods.In this paper we describe phylogenetically aligned component analysis (PACA): a new ordination approach that aligns phenotypic data with phylogenetic signal. Unlike phylogenetic principal component analysis (Phy‐PCA), which finds an alignment of a principal eigenvector that is independent of phylogenetic signal, PACA maximizes variation in directions that describe phylogenetic signal, while simultaneously preserving the Euclidean distances among observations in the data space.We demonstrate with simulated and empirical examples that with PACA, it is possible to visualize the trend in phylogenetic signal in multivariate data spaces, irrespective of other signals in the data. In conjunction with Phy‐PCA, one can visualize both phylogenetic signal and trends in data independent of phylogenetic signal.Phylogenetically aligned component analysis can distinguish between weak phylogenetic signals and strong signals concentrated in only a portion of all data dimensions. We provide empirical examples that emphasize the difference. Use of PACA in studies focused on phylogenetic signal should enable much more precise description of the phylogenetic signal, as a result.Overall, PACA will return a projection that shows the most phylogenetic signal in the first few components, irrespective of other signals in the data. By comparing Phy‐PCA and PACA results, one may glean the relative importance of phylogenetic and other (ecological) signals in the data.
more »
« less
This content will become publicly available on July 6, 2025
Exploring Phylogenetic Signal in Multivariate Phenotypes by Maximizing Blomberg’s K
Abstract Due to the hierarchical structure of the tree of life, closely related species often resemble each other more than distantly related species; a pattern termed phylogenetic signal. Numerous univariate statistics have been proposed as measures of phylogenetic signal for single phenotypic traits, but the study of phylogenetic signal for multivariate data, as is common in modern biology, remains challenging. Here, we introduce a new method to explore phylogenetic signal in multivariate phenotypes. Our approach decomposes the data into linear combinations with maximal (or minimal) phylogenetic signal, as measured by Blomberg’s K. The loading vectors of these phylogenetic components or K-components can be biologically interpreted, and scatterplots of the scores can be used as a low-dimensional ordination of the data that maximally (or minimally) preserves phylogenetic signal. We present algebraic and statistical properties, along with 2 new summary statistics, KA and KG, of phylogenetic signal in multivariate data. Simulation studies showed that KA and KG have higher statistical power than the previously suggested statistic Kmult, especially if phylogenetic signal is low or concentrated in a few trait dimensions. In 2 empirical applications to vertebrate cranial shape (crocodyliforms and papionins), we found statistically significant phylogenetic signal concentrated in a few trait dimensions. The finding that phylogenetic signal can be highly variable across the dimensions of multivariate phenotypes has important implications for current maximum likelihood approaches to phylogenetic signal in multivariate data.
more »
« less
- Award ID(s):
- 2140720
- PAR ID:
- 10588767
- Publisher / Repository:
- Oxford University Press
- Date Published:
- Journal Name:
- Systematic Biology
- Volume:
- 74
- Issue:
- 2
- ISSN:
- 1063-5157
- Page Range / eLocation ID:
- 215 to 229
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Summary Species in the genusSphagnumcreate, maintain, and dominate boreal peatlands through ‘extended phenotypes’ that allow these organisms to engineer peatland ecosystems and thereby impact global biogeochemical cycles. One such phenotype is the production of peat, or incompletely decomposed biomass, that accumulates when rates of growth exceed decomposition. Interspecific variation in peat production is thought to be responsible for the establishment and maintenance of ecological gradients such as the microtopographic hummock‐hollow gradient, along which sympatric species sort within communities.This study investigated the mode and tempo of functional trait evolution across 15 species ofSphagnumusing data from the most extensive studies ofSphagnumfunctional traits to date and phylogenetic comparative methods.We found evidence for phylogenetic conservatism of the niche descriptor height‐above‐water‐table and of traits related to growth, decay and litter quality. However, we failed to detect the influence of phylogeny on interspecific variation in other traits such as shoot density and suggest that environmental context can obscure phylogenetic signal. Trait correlations indicate possible adaptive syndromes that may relate to niche and its construction.This study is the first to formally test the extent to which functional trait variation amongSphagnumspecies is a result of shared evolutionary history.more » « less
-
Zhou, Xuming (Ed.)Abstract Comparative genomics approaches seek to associate molecular evolution with the evolution of phenotypes across a phylogeny. Many of these methods lack the ability to analyze non-ordinal categorical traits with more than two categories. To address this limitation, we introduce an expansion to RERconverge that associates shifts in evolutionary rates with the convergent evolution of categorical traits. The categorical RERconverge expansion includes methods for performing categorical ancestral state reconstruction, statistical tests for associating relative evolutionary rates with categorical variables, and a new method for performing phylogeny-aware permutations, “permulations”, on categorical traits. We demonstrate our new method on a three-category diet phenotype, and we compare its performance to binary RERconverge analyses and two existing methods for comparative genomic analyses of categorical traits: phylogenetic simulations and a phylogenetic signal based method. We present an analysis of how the categorical permulations scale with the number of species and the number of categories included in the analysis. Our results show that our new categorical method outperforms phylogenetic simulations at identifying genes and enriched pathways significantly associated with the diet phenotypes and that the categorical ancestral state reconstruction drives an improvement in our ability to capture diet-related enriched pathways compared to binary RERconverge when implemented without user input on phenotype evolution. The categorical expansion to RERconverge will provide a strong foundation for applying the comparative method to categorical traits on larger data sets with more species and more complex trait evolution than have previously been analyzed.more » « less
-
Casadesús, Josep (Ed.)The evolutionary forces that determine genome size in bacteria and archaea have been the subject of intense debate over the last few decades. Although the preferential loss of genes observed in prokaryotes is explained through the deletional bias, factors promoting and preventing the fixation of such gene losses often remain unclear. Importantly, statistical analyses on this topic typically do not consider the potential bias introduced by the shared ancestry of many lineages, which is critical when using species as data points because of the potential dependence on residuals. In this study, we investigated the genome size distributions across a broad diversity of bacteria and archaea to evaluate if this trait is phylogenetically conserved at broad phylogenetic scales. After model fit, Pagel’s lambda indicated a strong phylogenetic signal in genome size data, suggesting that the diversification of this trait is influenced by shared evolutionary histories. We used a phylogenetic generalized least-squares analysis (PGLS) to test whether phylogeny influences the predictability of genome size from dN/dS ratios and 16S copy number, two variables that have been previously linked to genome size. These results confirm that failure to account for evolutionary history can lead to biased interpretations of genome size predictors. Overall, our results indicate that although bacteria and archaea can rapidly gain and lose genetic material through gene transfers and deletions, respectively, phylogenetic signal for genome size distributions can still be recovered at broad phylogenetic scales that should be taken into account when inferring the drivers of genome size evolution.more » « less
-
The New World sparrows (Passerellidae) are a large, diverse group of songbirds that vary in morphology, behavior, and ecology. Thus, they are excellent for studying trait evolution in a phylogenetic framework. We examined lability versus conservatism in morphological and behavioral traits in two related clades of sparrows ( Aimophila, Peucaea ), and assessed whether habitat has played an important role in trait evolution. We first inferred a multi-locus phylogeny which we used to reconstruct ancestral states, and then quantified phylogenetic signal among morphological and behavioral traits in these clades and in New World sparrows more broadly. Behavioral traits have a stronger phylogenetic signal than morphological traits. Specifically, vocal duets and song structure are the most highly conserved traits, and nesting behavior appears to be maintained within clades. Furthermore, we found a strong correlation between open habitat and unpatterned plumage, complex song, and ground nesting. However, even within lineages that share the same habitat type, species vary in nesting, plumage pattern, song complexity, and duetting. Our findings highlight trade-offs between behavior, morphology, and ecology in sparrow diversification.more » « less