Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation

Yu, Xilin; Le, Thien; Christensen, Sarah A.; Molloy, Erin K.; Warnow, Tandy (ORCID:0000000177173514)

doi:10.1186/s13015-021-00189-2

Citation Details

Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation

Abstract

One of the Grand Challenges in Science is the construction of theTree of Life, an evolutionary tree containing several million species, spanning all life on earth. However, the construction of the Tree of Life is enormously computationally challenging, as all the current most accurate methods are either heuristics forNP-hard optimization problems or Bayesian MCMC methods that sample from tree space. One of the most promising approaches for improving scalability and accuracy for phylogeny estimation uses divide-and-conquer: a set of species is divided into overlapping subsets, trees are constructed on the subsets, and then merged together using a “supertree method”. Here, we present Exact-RFS-2, the first polynomial-time algorithm to find an optimal supertree of two trees, using the Robinson-Foulds Supertree (RFS) criterion (a major approach in supertree estimation that is related to maximum likelihood supertrees), and we prove that finding the RFS of three input trees isNP-hard. Exact-RFS-2 is available in open source form on Github athttps://github.com/yuxilin51/GreedyRFS.

NSF-PAR ID:: 10253453

Author(s) / Creator(s):: Yu, Xilin; Le, Thien; Christensen, Sarah A.; Molloy, Erin K.; Warnow, Tandy

Publisher / Repository:: Springer Science + Business Media

Date Published:: 2021-06-28

Journal Name:: Algorithms for Molecular Biology

Volume:: 16

Issue:: 1

ISSN:: 1748-7188

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1186/s13015-021-00189-2

More Like this