Meta-analysis of heterogeneous data: integrative sparse regression in high-dimensions

Maity, Subha; Sun, Yuekai; Banerjee, Moulinath

Citation Details

We consider the task of meta-analysis in high-dimensional settings in which the data sources are similar but non-identical. To borrow strength across such heterogeneous datasets, we introduce a global parameter that emphasizes interpretability and statistical efficiency in the presence of heterogeneity. We also propose a one-shot estimator of the global parameter that preserves the anonymity of the data sources and converges at a rate that depends on the size of the combined dataset. For high-dimensional linear model settings, we demonstrate the superiority of our identification restrictions in adapting to a previously seen data distribution as well as predicting for a new/unseen data distribution. Finally, we demonstrate the benefits of our approach on a large-scale drug treatment dataset involving several different cancer cell-lines. more »

Award ID(s):: 1916271

PAR ID:: 10483794

Author(s) / Creator(s):: Maity, Subha; Sun, Yuekai; Banerjee, Moulinath

Publisher / Repository:: Journal of Machine Learning Research

Date Published:: 2022-01-01

Journal Name:: Journal of machine learning research

ISSN:: 1533-7928

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this