To examine phylogenetic heterogeneity in turtle evolution, we collected thousands of high-confidence single-copy orthologs from 19 genome assemblies representative of extant turtle diversity and estimated a phylogeny with multispecies coalescent and concatenated partitioned methods. We also collected next-generation sequences from 26 turtle species and assembled millions of biallelic markers to reconstruct phylogenies based on annotated regions from the western painted turtle (Chrysemys picta bellii) genome (coding regions, introns, untranslated regions, intergenic, and others). We then measured gene tree-species tree discordance, as well as gene and site heterogeneity at each node in the inferred trees, and tested for temporal patterns in phylogenomic conflict across turtle evolution. We found strong and consistent support for all bifurcations in the inferred turtle species phylogenies. However, a number of genes, sites, and genomic features supported alternate relationships between turtle taxa. Our results suggest that gene tree-species tree discordance in these data sets is likely driven by population-level processes such as incomplete lineage sorting. We found very little effect of substitutional saturation on species tree topologies, and no clear phylogenetic patterns in codon usage bias and compositional heterogeneity. There was no correlation between gene and site concordance, node age, and DNA substitution rate across mostmore »
The páramo ecosystem, located above the timberline in the tropical Andes, has been the setting for some of the most dramatic plant radiations, and it is one of the world’s fastest evolving and most diverse high-altitude ecosystems. Today 144+ species of frailejones (subtribe Espeletiinae Cuatrec., Asteraceae) dominate the páramo. Frailejones have intrigued naturalists and botanists, not just for their appealing beauty and impressive morphological diversity, but also for their remarkable adaptations to the extremely harsh environmental conditions of the páramo. Previous attempts to reconstruct the evolutionary history of this group failed to resolve relationships among genera and species, and there is no agreement regarding the classification of the group. Thus, our goal was to reconstruct the phylogeny of the frailejones and to test the influence of the geography on it as a first step to understanding the patterns of radiation of these plants.
Field expeditions in 70 páramos of Colombia and Venezuela resulted in 555 collected samples from 110 species. Additional material was obtained from herbarium specimens. Sequence data included nrDNA (ITS and ETS) and cpDNA (rpl16), for an aligned total of 2,954 bp. Fragment analysis was performed with AFLP data using 28 primer combinations and yielding 1,665 fragments. Phylogenies more »
Phylogenies reconstructed suggest that most genera are paraphyletic, but the phylogenetic signal may be misled by hybridization and incomplete lineage sorting. A tree with all the available molecular data shows two large clades: one of primarily Venezuelan species that includes a few neighboring Colombian species; and a second clade of only Colombian species. Results from the Monte Carlo permutation test suggests a very strong influence of the geography on the phylogenetic relationships. Venezuelan páramos tend to hold taxa that are more distantly-related to each other than Colombian páramos, where taxa are more closely-related to each other.
Our data suggest the presence of two independent radiations: one in Venezuela and the other in Colombia. In addition, the current generic classification will need to be deeply revised. Analyses show a strong geographic structure in the phylogeny, with large clades grouped in hotspots of diversity at a regional scale, and in páramo localities at a local scale. Differences in the degrees of relatedness between sympatric species of Venezuelan and Colombian páramos may be explained because of the younger age of the latter páramos, and the lesser time for speciation of Espeletiinae in them.
- Publication Date:
- NSF-PAR ID:
- Journal Name:
- Page Range or eLocation-ID:
- Article No. e2968
- Sponsoring Org:
- National Science Foundation
More Like this
Core genome phylogenies are widely used to build the evolutionary history of individual prokaryote species. By using hundreds or thousands of shared genes, these approaches are the gold standard to reconstruct the relationships of large sets of strains. However, there is growing evidence that bacterial strains exchange DNA through homologous recombination at rates that vary widely across prokaryote species, indicating that core genome phylogenies might not be able to reconstruct true phylogenies when recombination rate is high. Few attempts have been made to evaluate the robustness of core genome phylogenies to recombination, but some analyses suggest that reconstructed trees are not always accurate.
In this study, we tested the robustness of core genome phylogenies to various levels of recombination rates. By analyzing simulated and empirical data, we observed that core genome phylogenies are relatively robust to recombination rates; nevertheless, our results suggest that many reconstructed trees are not completely accurate even when bootstrap supports are high. We found that some core genome phylogenies are highly robust to recombination whereas others are strongly impacted by it, and we identified that the robustness of core genome phylogenies to recombination is highly linked to the levels of selective pressures acting on amore »
Overall, these results have important implications for the application of core genome phylogenies in prokaryotes.
TopHap: rapid inference of key phylogenetic structures from common haplotypes in large genome collections with limited diversity
Building reliable phylogenies from very large collections of sequences with a limited number of phylogenetically informative sites is challenging because sequencing errors and recurrent/backward mutations interfere with the phylogenetic signal, confounding true evolutionary relationships. Massive global efforts of sequencing genomes and reconstructing the phylogeny of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) strains exemplify these difficulties since there are only hundreds of phylogenetically informative sites but millions of genomes. For such datasets, we set out to develop a method for building the phylogenetic tree of genomic haplotypes consisting of positions harboring common variants to improve the signal-to-noise ratio for more accurate and fast phylogenetic inference of resolvable phylogenetic features.
We present the TopHap approach that determines spatiotemporally common haplotypes of common variants and builds their phylogeny at a fraction of the computational time of traditional methods. We develop a bootstrap strategy that resamples genomes spatiotemporally to assess topological robustness. The application of TopHap to build a phylogeny of 68 057 SARS-CoV-2 genomes (68KG) from the first year of the pandemic produced an evolutionary tree of major SARS-CoV-2 haplotypes. This phylogeny is concordant with the mutation tree inferred using the co-occurrence pattern of mutations and recovers key phylogenetic relationships from moremore »
Availability and implementation
TopHap is available at https://github.com/SayakaMiura/TopHap.
Supplementary data are available at Bioinformatics online.
Ultraconserved elements resolve the phylogeny and corroborate patterns of molecular rate variation in herons (Aves: Ardeidae)
Thoroughly sampled and well-supported phylogenetic trees are essential to taxonomy and to guide studies of evolution and ecology. Despite extensive prior inquiry, a comprehensive tree of heron relationships (Aves: Ardeidae) has not yet been published. As a result, the classification of this family remains unstable, and their evolutionary history remains poorly studied. Here, we sample genome-wide ultraconserved elements (UCEs) and mitochondrial DNA sequences (mtDNA) of >90% of extant species to estimate heron phylogeny using a combination of maximum likelihood, coalescent, and Bayesian inference methods. The UCE and mtDNA trees are mostly concordant with one another, providing a topology that resolves relationships among the 5 heron subfamilies and indicates that the genera Gorsachius, Botaurus, Ardea, and Ixobrychus are not monophyletic. We also present the first genetic data from the Forest Bittern Zonerodius heliosylus, an enigmatic species of New Guinea; our results suggest that it is a member of the genus Ardeola and not the Tigrisomatinae (tiger herons), as previously thought. Finally, we compare molecular rates between heron clades in the UCE tree with those in previously constructed mtDNA and DNA–DNA hybridization trees. We show that rate variation in the UCE tree corroborates rate patterns in the previously constructed trees—that bitternsmore »
Ancient Rapid Radiation Explains Most Conflicts Among Gene Trees and Well-Supported Phylogenomic Trees of Nostocalean Cyanobacteria
Prokaryotic genomes are often considered to be mosaics of genes that do not necessarily share the same evolutionary history due to widespread horizontal gene transfers (HGTs). Consequently, representing evolutionary relationships of prokaryotes as bifurcating trees has long been controversial. However, studies reporting conflicts among gene trees derived from phylogenomic data sets have shown that these conflicts can be the result of artifacts or evolutionary processes other than HGT, such as incomplete lineage sorting, low phylogenetic signal, and systematic errors due to substitution model misspecification. Here, we present the results of an extensive exploration of phylogenetic conflicts in the cyanobacterial order Nostocales, for which previous studies have inferred strongly supported conflicting relationships when using different concatenated phylogenomic data sets. We found that most of these conflicts are concentrated in deep clusters of short internodes of the Nostocales phylogeny, where the great majority of individual genes have low resolving power. We then inferred phylogenetic networks to detect HGT events while also accounting for incomplete lineage sorting. Our results indicate that most conflicts among gene trees are likely due to incomplete lineage sorting linked to an ancient rapid radiation, rather than to HGTs. Moreover, the short internodes of this radiation fit themore »