Distributed nearest-neighbor Gaussian processes

Grenier, Isabelle; Sansó, Bruno

doi:10.1080/03610918.2021.1921798

Citation Details

Distributed nearest-neighbor Gaussian processes

While many statistical approaches have tackled the problem of large spa- tial datasets, the issues arising from costly data movement and data stor- age have long been set aside. Having easy access to the data has been taken for granted and is now becoming an important bottleneck in the performance of statistical inference. As the availability of high resolution spatial data continues to grow, the need to develop efficient modeling techniques that leverage multi-processor and multi-storage capabilities is becoming a priority. To that end, the development of a distributed method to implement Nearest-Neighbor Gaussian Process (NNGP) models for spa- tial interpolation and inference for large datasets is of interest. The pro- posed framework retains the exact implementation of the NNGP while allowing for distributed or sequential computation of the posterior infer- ence. The method allows for any choice of grouping of the data whether it is at random or by region. As a result of this new method, the NNGP model can be implemented with an even split of the computation burden with minimum overload at the master node level. more »

Award ID(s):: 1953168 1952970

PAR ID:: 10251513

Author(s) / Creator(s):: Grenier, Isabelle; Sansó, Bruno

Date Published:: 2021-04-01

Journal Name:: Communications in Statistics - Simulation and Computation

ISSN:: 0361-0918

Page Range / eLocation ID:: 1 to 13

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1080/03610918.2021.1921798

More Like this