Iterative Feature Matching: Toward Provable Domain Generalization with Logarithmic Environments

Chen, Yining; Rosenfeld, Elan; Sellke, Mark; Ma, Tengyu; Risteski, Andrej

Citation Details

Domain generalization aims at performing well on unseen test environments with data from a limited number of training environments. Despite a proliferation of proposed algorithms for this task, assessing their performance both theoretically and empirically is still very challenging. Distributional matching algorithms such as (Conditional) Domain Adversarial Networks [12, 28] are popular and enjoy empirical success, but they lack formal guarantees. Other approaches such as Invariant Risk Minimization (IRM) require a prohibitively large number of training environments—linear in the dimension of the spurious feature space ds—even on simple data models like the one proposed by Rosenfeld et al. [37]. Under a variant of this model, we show that ERM and IRM can fail to fnd the optimal invariant predictor with o(ds) environments. We then present an iterative feature matching algorithm that is guaranteed with high probability to find the optimal invariant predictor after seeing only O(log ds) environments. Our results provide the first theoretical justification for distribution-matching algorithms widely used in practice under a concrete nontrivial data model. more »

Award ID(s):: 2211907

PAR ID:: 10450562

Author(s) / Creator(s):: Chen, Yining; Rosenfeld, Elan; Sellke, Mark; Ma, Tengyu; Risteski, Andrej

Date Published:: 2022-01-01

Journal Name:: Advances in neural information processing systems

ISSN:: 1049-5258

Page Range / eLocation ID:: 1725-1736

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this