null
(Ed.)
While many statistical approaches have tackled the problem of large spa- tial datasets, the issues arising from costly data movement and data stor- age have long been set aside. Having easy access to the data has been taken for granted and is now becoming an important bottleneck in the performance of statistical inference. As the availability of high resolution spatial data continues to grow, the need to develop efficient modeling techniques that leverage multi-processor and multi-storage capabilities is becoming a priority. To that end, the development of a distributed method to implement Nearest-Neighbor Gaussian Process (NNGP) models for spa- tial interpolation and inference for large datasets is of interest. The pro- posed framework retains the exact implementation of the NNGP while allowing for distributed or sequential computation of the posterior infer- ence. The method allows for any choice of grouping of the data whether it is at random or by region. As a result of this new method, the NNGP model can be implemented with an even split of the computation burden with minimum overload at the master node level.
more »
« less