Bayesian Multi-Group Gaussian Process Models for Heterogeneous Group-Structured Data

Li, Didong; Jones, Andrew; Banerjee, Sudipto; Engelhardt, Barbara E

Citation Details

Gaussian processes are pervasive in functional data analysis, machine learning, and spatial statistics for modeling complex dependencies. Scientific data are often heterogeneous in their inputs and contain multiple known discrete groups of samples; thus, it is desirable to leverage the similarity among groups while accounting for heterogeneity across groups. We propose multi-group Gaussian processes (MGGPs) defined over Rp×C , where C is a finite set representing the group label, by developing general classes of valid (positive definite) covariance functions on such domains. MGGPs are able to accurately recover relationships between the groups and efficiently share strength across samples from all groups during inference, while capturing distinct group-specific behaviors in the conditional posterior distributions. We demonstrate inference in MGGPs through simulation experiments, and we apply our proposed MGGP regression framework to gene expression data to illustrate the behavior and enhanced inferential capabilities of multi-group Gaussian processes by jointly modeling continuous and categorical variables. more »

Award ID(s):: 2113778

PAR ID:: 10611614

Author(s) / Creator(s):: Li, Didong; Jones, Andrew; Banerjee, Sudipto; Engelhardt, Barbara E

Publisher / Repository:: https://www.jmlr.org/papers/v26/23-0291.html

Date Published:: 2025-01-01

Journal Name:: Journal of machine learning research

Volume:: 26

Issue:: 30

ISSN:: 1532-4435

Page Range / eLocation ID:: 1-34

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this