Multi-Dimensional Domain Generalization with Low-Rank Structures

Li, Sai; Zhang, Linjun

doi:10.1080/01621459.2025.2471055

Citation Details

This content will become publicly available on April 11, 2026

Multi-Dimensional Domain Generalization with Low-Rank Structures

In conventional statistical and machine learning methods, it is typically assumed that the test data are identically distributed with the training data. However, this assumption does not always hold, especially in applications where the target population are not well-represented in the training data. This is a notable issue in health-related studies, where specific ethnic populations may be underrepresented, posing a significant challenge for researchers aiming to make statistical inferences about these minority groups. In this work, we present a novel approach to addressing this challenge in linear regression models. We organize the model parameters for all the sub-populations into a tensor. By studying a structured tensor completion problem, we can achieve robust domain generalization, that is, learning about sub-populations with limited or no available data. Our method novelly leverages the structure of group labels and it can produce more reliable and interpretable generalization results. We establish rigorous theoretical guarantees for the proposed method and demonstrate its minimax optimality. To validate the effectiveness of our approach, we conduct extensive numerical experiments and a real data study focused on diabetes prediction for multiple subgroups, comparing our results with those obtained using other existing methods. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work. more »

Award ID(s):: 2340241

PAR ID:: 10600831

Author(s) / Creator(s):: Li, Sai; Zhang, Linjun

Publisher / Repository:: JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

Date Published:: 2025-04-11

Journal Name:: Journal of the American Statistical Association

ISSN:: 0162-1459

Page Range / eLocation ID:: 1 to 13

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on April 11, 2026
Journal Article:
https://doi.org/10.1080/01621459.2025.2471055

More Like this