RFtest: A Robust and Flexible Community-Level Test for Microbiome Data Powerfully Detects Phylogenetically Clustered Signals

Zhang, Lujun; Wang, Yanshan; Chen, Jingwen; Chen, Jun

doi:10.3389/fgene.2021.749573

Citation Details

RFtest: A Robust and Flexible Community-Level Test for Microbiome Data Powerfully Detects Phylogenetically Clustered Signals

Random forest is considered as one of the most successful machine learning algorithms, which has been widely used to construct microbiome-based predictive models. However, its use as a statistical testing method has not been explored. In this study, we propose “Random Forest Test” (RFtest), a global (community-level) test based on random forest for high-dimensional and phylogenetically structured microbiome data. RFtest is a permutation test using the generalization error of random forest as the test statistic. Our simulations demonstrate that RFtest has controlled type I error rates, that its power is superior to competing methods for phylogenetically clustered signals, and that it is robust to outliers and adaptive to interaction effects and non-linear associations. Finally, we apply RFtest to two real microbiome datasets to ascertain whether microbial communities are associated or not with the outcome variables. more »

Award ID(s):: 2113360

PAR ID:: 10324492

Author(s) / Creator(s):: Zhang, Lujun; Wang, Yanshan; Chen, Jingwen; Chen, Jun

Date Published:: 2022-01-24

Journal Name:: Frontiers in Genetics

Volume:: 12

ISSN:: 1664-8021

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.3389/fgene.2021.749573

More Like this