Benchmarking Parallel K-Means Cloud Type Clustering from Satellite Data

Barajas, Carlos; Guo, Pei; Mukherjee, Lipi; Hoban, Susan; Wang, Jianwu; Jin, Daeho; Gangopadhyay, Aryya; Gobbert, Matthias K

doi:10.1007/978-3-030-32813-9_20

Citation Details

Benchmarking Parallel K-Means Cloud Type Clustering from Satellite Data

The study of clouds, i.e., where they occur and what are their characteristics, plays a key role in the understanding of climate change. Clustering is a common machine learning technique used in atmospheric science to classify cloud types. Many parallelism techniques e.g., MPI, OpenMP and Spark, could achieve efficient and scalable clustering of large-scale satellite observation data. In order to understand their differences, this paper studies and compares three different approaches on parallel clustering of satellite observation data. Benchmarking experiments with k-means clustering are conducted with three parallelism techniques, namely OpenMP, OpenMP+MPI, and Spark, on a HPC cluster using up to 16 nodes. more »

Award ID(s):: 1726023 1730250

PAR ID:: 10189026

Author(s) / Creator(s):: Barajas, Carlos; Guo, Pei; Mukherjee, Lipi; Hoban, Susan; Wang, Jianwu; Jin, Daeho; Gangopadhyay, Aryya; Gobbert, Matthias K

Date Published:: 2019-01-01

Journal Name:: International Symposium on Benchmarking, Measuring and Optimization

Page Range / eLocation ID:: 248-260

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1007/978-3-030-32813-9_20

More Like this