Identifying model violations under the multispecies coalescent model using P2C2M.SNAPP

Duckett, Drew J.; Pelletier, Tara A.; Carstens, Bryan C.

doi:10.7717/peerj.8271

Citation Details

Identifying model violations under the multispecies coalescent model using P2C2M.SNAPP

Phylogenetic estimation under the multispecies coalescent model (MSCM) assumes all incongruence among loci is caused by incomplete lineage sorting. Therefore, applying the MSCM to datasets that contain incongruence that is caused by other processes, such as gene flow, can lead to biased phylogeny estimates. To identify possible bias when using the MSCM, we present P2C2M.SNAPP. P2C2M.SNAPP is an R package that identifies model violations using posterior predictive simulation. P2C2M.SNAPP uses the posterior distribution of species trees output by the software package SNAPP to simulate posterior predictive datasets under the MSCM, and then uses summary statistics to compare either the empirical data or the posterior distribution to the posterior predictive distribution to identify model violations. In simulation testing, P2C2M.SNAPP correctly classified up to 83% of datasets (depending on the summary statistic used) as to whether or not they violated the MSCM model. P2C2M.SNAPP represents a user-friendly way for researchers to perform posterior predictive model checks when using the popular SNAPP phylogenetic estimation program. It is freely available as an R package, along with additional program details and tutorials. more »

Award ID(s):: 1661029

PAR ID:: 10148262

Author(s) / Creator(s):: Duckett, Drew J.; Pelletier, Tara A.; Carstens, Bryan C.

Date Published:: 2020-01-01

Journal Name:: PeerJ

Volume:: 8

ISSN:: 2167-8359

Page Range / eLocation ID:: e8271

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.7717/peerj.8271

More Like this