Data‐driven guidelines for phylogenomic analyses using SNP data

Suissa, Jacob S; De_La_Cerda, Gisel Y; Graber, Leland C; Jelley, Chloe; Wickell, David; Phillips, Heather R; Grinage, Ayress D; Moreau, Corrie S; Specht, Chelsea D; Doyle, Jeff J; Landis, Jacob B

doi:10.1002/aps3.11611

Citation Details

Data‐driven guidelines for phylogenomic analyses using SNP data

There is a general lack of consensus on the best practices for filtering of single‐nucleotide polymorphisms (SNPs) and whether it is better to use SNPs or include flanking regions (full “locus”) in phylogenomic analyses and subsequent comparative methods. Using genotyping‐by‐sequencing data from 22Glycinespecies, we assessed the effects of SNP vs. locus usage and SNP retention stringency. We compared branch length, node support, and divergence time estimation across 16 datasets with varying amounts of missing data and total size. Our results revealed five aspects of phylogenomic data usage that may be generally applicable: (1) tree topology is largely congruent across analyses; (2) filtering strictly for SNP retention (e.g., 90–100%) reduces support and can alter some inferred relationships; (3) absolute branch lengths vary by two orders of magnitude between SNP and locus datasets; (4) data type and branch length variation have little effect on divergence time estimation; and (5) phylograms alter the estimation of ancestral states and rates of morphological evolution. Using SNP or locus datasets does not alter phylogenetic inference significantly, unless researchers want or need to use absolute branch lengths. We recommend against using excessive filtering thresholds for SNP retention to reduce the risk of producing inconsistent topologies and generating low support. more »

Award ID(s):: 1929318

PAR ID:: 10537769

Author(s) / Creator(s):: Suissa, Jacob S; De_La_Cerda, Gisel Y; Graber, Leland C; Jelley, Chloe; Wickell, David; Phillips, Heather R; Grinage, Ayress D; Moreau, Corrie S; Specht, Chelsea D; Doyle, Jeff J; Landis, Jacob B

Publisher / Repository:: Wiley Online Library

Date Published:: 2024-08-09

Journal Name:: Applications in Plant Sciences

ISSN:: 2168-0450

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1002/aps3.11611

More Like this