Spatial Data Decomposition and Load Balancing on HPC Platforms

Yang, Jie; Paudel, Anmol; Puri, Satish

doi:10.1145/3332186.3333266

Citation Details

Spatial Data Decomposition and Load Balancing on HPC Platforms

We are in the era of Spatial Big Data. Due to the developments of topographic techniques, clear satellite imagery, and various means for collecting information, geospatial datasets are growing in volume, complexity and heterogeneity. For example, OpenStreetMap data for the whole world is about 1 TB and NASA world climate datasets are about 17 TB. Spatial data volume and variety makes spatial computations both data-intensive and compute-intensive. Due to the irregular distribution of spatial data, domain decomposition becomes challenging. In this work, we present spatial data partitioning technique that takes into account spatial join cost. In addition, we present spatial join computation using Asynchronous Dynamic Load Balancing (ADLB) library. ADLB is a software library designed to help rapidly build scalable parallel programs using MPI. We evaluated the performance of ADLB-based MPI-GIS implementation. In our existing work, spatial data movement cost from ADLB server to worker MPI processes limited the scalability of MPI-GIS. more »

Award ID(s):: 1756000

PAR ID:: 10110446

Author(s) / Creator(s):: Yang, Jie; Paudel, Anmol; Puri, Satish

Date Published:: 2019-07-01

Journal Name:: Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning)

Page Range / eLocation ID:: 1 to 4

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3332186.3333266

More Like this