Data Integration Tasks on Heterogeneous Systems Using OpenCL

Faber, Clayton J.; Cabrera, Anthony M.; Booker, Orondé; Maayan, Gabe; Chamberlain, Roger D.

doi:10.1145/3318170.3318187

Citation Details

Data Integration Tasks on Heterogeneous Systems Using OpenCL

In the era of big data, many new algorithms are developed to try and find the most efficient way to perform computations with massive amounts of data. However, what is often overlooked is the preprocessing step for many of these applications. The Data Integration Benchmark Suite (DIBS) was designed to understand the characteristics of dataset transformations in a hardware agnostic way. While on the surface these applications have a high amount of data parallelism, there are caveats in their specification that can potentially affect this characteristic. Even still, OpenCL can be an effective deployment environment for these applications. In this work we take a subset of the data transformations from each category presented in DIBS and implement them in OpenCL to evaluate their performance for heterogeneous systems. For targeting heterogeneous systems, we take a common application and attempt to deploy it to three platforms targetable by OpenCL (CPU, GPU, and FPGA). The applications are evaluated by their average transformation data rate. We illustrate the advantages of each compute device in the data integration space along with different communications schemes allowed for host/device communication in the OpenCL platform. more »

Award ID(s):: 1763503 1527510

NSF-PAR ID:: 10108237

Author(s) / Creator(s):: Faber, Clayton J.; Cabrera, Anthony M.; Booker, Orondé; Maayan, Gabe; Chamberlain, Roger D.

Date Published:: 2019-05-14

Journal Name:: Proc. of 7th International Workshop on OpenCL

Page Range / eLocation ID:: 1 to 1

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3318170.3318187

More Like this