skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Computing the Bounds of the Number of Reticulations in a Tree-Child Network That Displays a Set of Trees
Phylogenetic network is an evolutionary model that uses a rooted directed acyclic graph (instead of a tree) to model an evolutionary history of species in which reticulate events (e.g., hybrid speciation or horizontal gene transfer) occurred. Tree-child network is a kind of phylogenetic network with structural constraints. Existing approaches for tree-child network reconstruction can be slow for large data. In this study, we present several computational approaches for bounding from below the number of reticulations in a tree-child network that displays a given set of rooted binary phylogenetic trees. In addition, we also present some theoretical results on bounding from above the number of reticulations. Through simulation, we demonstrate that the new lower bounds on the reticulation number for tree-child networks can practically be computed for large tree data. The bounds can provide estimates of reticulation for relatively large data.  more » « less
Award ID(s):
1909425
PAR ID:
10540524
Author(s) / Creator(s):
;
Publisher / Repository:
Mary Ann Liebert
Date Published:
Journal Name:
Journal of Computational Biology
Volume:
31
Issue:
4
ISSN:
1557-8666
Page Range / eLocation ID:
345 to 359
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Phylogenetic networks extend phylogenetic trees to allow for modeling reticulate evolutionary processes such as hybridization. They take the shape of a rooted, directed, acyclic graph, and when parameterized with evolutionary parameters, such as divergence times and population sizes, they form a generative process of molecular sequence evolution. Early work on computational methods for phylogenetic network inference focused exclusively on reticulations and sought networks with the fewest number of reticulations to fit the data. As processes such as incomplete lineage sorting (ILS) could be at play concurrently with hybridization, work in the last decade has shifted to computational approaches for phylogenetic network inference in the presence of ILS. In such a short period, significant advances have been made on developing and implementing such computational approaches. In particular, parsimony, likelihood, and Bayesian methods have been devised for estimating phylogenetic networks and associated parameters using estimated gene trees as data. Use of those inference methods has been augmented with statistical tests for specific hypotheses of hybridization, like the D-statistic. Most recently, Bayesian approaches for inferring phylogenetic networks directly from sequence data were developed and implemented. In this chapter, we survey such advances and discuss model assumptions as well as methods’ strengths and limitations. We also discuss parallel efforts in the population genetics community aimed at inferring similar structures. Finally, we highlight major directions for future research in this area. 
    more » « less
  2. The reconstruction of phylogenetic networks is an important but challenging problem in phylogenetics and genome evolution, as the space of phylogenetic networks is vast and cannot be sampled well. One approach to the problem is to solve the minimum phylogenetic network problem, in which phylogenetic trees are first inferred, and then the smallest phylogenetic network that displays all the trees is computed. The approach takes advantage of the fact that the theory of phylogenetic trees is mature, and there are excellent tools available for inferring phylogenetic trees from a large number of biomolecular sequences. A tree–child network is a phylogenetic network satisfying the condition that every nonleaf node has at least one child that is of indegree one. Here, we develop a new method that infers the minimum tree–child network by aligning lineage taxon strings in the phylogenetic trees. This algorithmic innovation enables us to get around the limitations of the existing programs for phylogenetic network inference. Our new program, named ALTS, is fast enough to infer a tree–child network with a large number of reticulations for a set of up to 50 phylogenetic trees with 50 taxa that have only trivial common clusters in about a quarter of an hour on average. 
    more » « less
  3. In recent years, it has become widely accepted that interspecific gene flow is common across the Tree of Life. Questions remain about how species boundaries can be maintained in the face of high levels of gene flow and how phylogeneticists should account for reticulation in their analyses. The true lemurs of Madagascar (genus Eulemur, 12 species) provide a unique opportunity to explore these questions, as they form a recent radiation with at least five active hybrid zones. Here, we present new analyses of a mitochondrial dataset with hundreds of individuals in the genus Eulemur, as well as a nuclear dataset containing hundreds of genetic loci for a small number of individuals. Traditional coalescent-based phylogenetic analyses of both datasets reveal that not all recognized species are monophyletic. Using network-based approaches, we also find that a species tree containing between one and three ancient reticulations is supported by strong evidence. Together, these results suggest that hybridization has been a prominent feature of the genus Eulemur in both the past and present. We also recommend that greater taxonomic attention should be paid to this group so that geographic boundaries and conservation priorities can be better established. 
    more » « less
  4. Abstract Phylogenetic networks can represent evolutionary events that cannot be described by phylogenetic trees. These networks are able to incorporate reticulate evolutionary events such as hybridization, introgression, and lateral gene transfer. Recently, network-based Markov models of DNA sequence evolution have been introduced along with model-based methods for reconstructing phylogenetic networks. For these methods to be consistent, the network parameter needs to be identifiable from data generated under the model. Here, we show that the semi-directed network parameter of a triangle-free, level-1 network model with any fixed number of reticulation vertices is generically identifiable under the Jukes–Cantor, Kimura 2-parameter, or Kimura 3-parameter constraints. 
    more » « less
  5. Abstract A fundamental assumption of evolutionary biology is that phylogeny follows a bifurcating process. However, hybrid speciation and introgression are becoming more widely documented in many groups. Hybrid inference studies have been historically limited to small sets of taxa, while exploration of the prevalence and trends of reticulation at deep time scales remains unexplored. We study the evolutionary history of an adaptive radiation of 109 gemsnakes in Madagascar (Pseudoxyrhophiinae) to identify potential instances of introgression. Using several network inference methods, we find 12 reticulation events within the 22-million-year evolutionary history of gemsnakes, producing 28% of the diversity for the group, including one reticulation that resulted in the diversification of an 18 species radiation. These reticulations are found at nodes with high gene tree discordance and occurred among parental lineages distributed along a north-south axis that share similar ecologies. Younger hybrids occupy intermediate contact zones between the parent lineages showing that post-speciation dispersal in this group has not eroded the spatial signatures of introgression. Reticulations accumulated consistently over time, despite drops in overall speciation rates during the Pleistocene. This suggests that while bifurcating speciation rates may decline as the result of species accumulation and environmental change, speciation by hybridization may be more robust to these processes. 
    more » « less