Splitting Gaussian processes for computationally-efficient regression

Terry, Nick; Choe, Youngjun

doi:10.1371/journal.pone.0256470

Citation Details

Splitting Gaussian processes for computationally-efficient regression

Gaussian processes offer a flexible kernel method for regression. While Gaussian processes have many useful theoretical properties and have proven practically useful, they suffer from poor scaling in the number of observations. In particular, the cubic time complexity of updating standard Gaussian process models can be a limiting factor in applications. We propose an algorithm for sequentially partitioning the input space and fitting a localized Gaussian process to each disjoint region. The algorithm is shown to have superior time and space complexity to existing methods, and its sequential nature allows the model to be updated efficiently. The algorithm constructs a model for which the time complexity of updating is tightly bounded above by a pre-specified parameter. To the best of our knowledge, the model is the first local Gaussian process regression model to achieve linear memory complexity. Theoretical continuity properties of the model are proven. We demonstrate the efficacy of the resulting model on several multi-dimensional regression tasks. more »

Award ID(s):: 1824681 1952781

PAR ID:: 10290270

Author(s) / Creator(s):: Terry, Nick; Choe, Youngjun

Editor(s):: Scalas, Enrico

Date Published:: 2021-08-24

Journal Name:: PLOS ONE

Volume:: 16

Issue:: 8

ISSN:: 1932-6203

Page Range / eLocation ID:: e0256470

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1371/journal.pone.0256470

More Like this