Exaggerated false positives by popular differential expression methods when analyzing human population samples

Li, Yumei; Ge, Xinzhou; Peng, Fanglue; Li, Wei; Li, Jingyi_Jessica (ORCID:0000000292885648)

doi:10.1186/s13059-022-02648-4

Citation Details

Exaggerated false positives by popular differential expression methods when analyzing human population samples

Abstract When identifying differentially expressed genes between two conditions using human population RNA-seq samples, we found a phenomenon by permutation analysis: two popular bioinformatics methods, DESeq2 and edgeR, have unexpectedly high false discovery rates. Expanding the analysis to limma-voom, NOISeq, dearseq, and Wilcoxon rank-sum test, we found that FDR control is often failed except for the Wilcoxon rank-sum test. Particularly, the actual FDRs of DESeq2 and edgeR sometimes exceed 20% when the target FDR is 5%. Based on these results, for population-level RNA-seq studies with large sample sizes, we recommend the Wilcoxon rank-sum test. more »

Award ID(s):: 1846216 2113754

PAR ID:: 10363895

Author(s) / Creator(s):: Li, Yumei; Ge, Xinzhou; Peng, Fanglue; Li, Wei; Li, Jingyi_Jessica

Publisher / Repository:: Springer Science + Business Media

Date Published:: 2022-03-15

Journal Name:: Genome Biology

Volume:: 23

Issue:: 1

ISSN:: 1474-760X

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1186/s13059-022-02648-4

More Like this