Tree-weighting for multi-study ensemble learners

Ramchandran, Maya; Parmigiani, Giovanni; Patil, Prasad

doi:10.1142/9789811215636_0040

Citation Details

Tree-weighting for multi-study ensemble learners

Multi-study learning uses multiple training studies, separately trains classifiers on individual studies, and then forms ensembles with weights rewarding members with better cross-study prediction ability. This article considers novel weighting approaches for constructing tree-based ensemble learners in this setting. Using Random Forests as a single-study learner, we perform a comparison of either weighting each forest to form the ensemble, or extracting the individual trees trained by each Random Forest and weighting them directly. We consider weighting approaches that reward cross-study replicability within the training set. We find that incorporating multiple layers of ensembling in the training process increases the robustness of the resulting predictor. Furthermore, we explore the mechanisms by which the ensembling weights correspond to the internal structure of trees to shed light on the important features in determining the relationship between the Random Forests algorithm and the true outcome model. Finally, we apply our approach to genomic datasets and show that our method improves upon the basic multi-study learning paradigm. more »

Award ID(s):: 1810829

PAR ID:: 10105531

Author(s) / Creator(s):: Ramchandran, Maya; Parmigiani, Giovanni; Patil, Prasad

Date Published:: 2019-01-01

Journal Name:: Pacific Symposium on Biocomputing 2020

Page Range / eLocation ID:: 451-462

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1142/9789811215636_0040

More Like this