SODA: multi-locus species delimitation using quartet frequencies

Rabiee, Maryam; Mirarab, Siavash

doi:10.1093/bioinformatics/btaa1010

Citation Details

SODA: multi-locus species delimitation using quartet frequencies

Abstract Motivation Species delimitation, the process of deciding how to group a set of organisms into units called species, is one of the most challenging problems in computational evolutionary biology. While many methods exist for species delimitation, most based on the coalescent theory, few are scalable to very large datasets, and methods that scale tend to be not accurate. Species delimitation is closely related to species tree inference from discordant gene trees, a problem that has enjoyed rapid advances in recent years. Results In this article, we build on the accuracy and scalability of recent quartet-based methods for species tree estimation and propose a new method called SODA for species delimitation. SODA relies heavily on a recently developed method for testing zero branch length in species trees. In extensive simulations, we show that SODA can easily scale to very large datasets while maintaining high accuracy. Availability and implementation The code and data presented here are available on https://github.com/maryamrabiee/SODA. Supplementary information Supplementary data are available at Bioinformatics online. more »

Award ID(s):: 1845967

PAR ID:: 10310397

Author(s) / Creator(s):: Rabiee, Maryam; Mirarab, Siavash

Editor(s):: Ponty, Yann

Date Published:: 2020-12-15

Journal Name:: Bioinformatics

Volume:: 36

Issue:: 24

ISSN:: 1367-4803

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1093/bioinformatics/btaa1010

More Like this