Understanding Behavior Trends of Big Data Frameworks in Ongoing Software-Defined Cyber-Infrastructure

Chen, Shouwei; Rodero, Ivan

doi:10.1145/3148055.3148079

Citation Details

Understanding Behavior Trends of Big Data Frameworks in Ongoing Software-Defined Cyber-Infrastructure

As data analytics applications become increasingly important in a wide range of domains, the ability to develop large-scale and sustainable platforms and software infrastructure to support these applications has significant potential to drive research and innovation in both science and business domains. This paper characterizes performance and power-related behavior trends and tradeoffs of the two predominant frameworks for Big Data analytics (i.e., Apache Hadoop and Spark) for a range of representative applications. It also evaluates system design knobs, such as storage and network technologies and power capping techniques. Experimental results from empirical executions provide meaningful data points for exploring the potential of software-defined infrastructure for Big Data processing systems through simulation. The results provide better understanding of the design space to build multi-criteria application-centric models as well as show significant advantages of software-defined infrastructure in terms of execution time, energy and cost. It motivates further research focused on in-memory processing formulations regarding systems with deeper memory hierarchies and software-defined infrastructure. more »

Award ID(s):: 1464317 1305375

PAR ID:: 10077381

Author(s) / Creator(s):: Chen, Shouwei; Rodero, Ivan

Date Published:: 2017-01-01

Journal Name:: BDCAT '17 Proceedings of the Fourth IEEE/ACM International Conference on Big Data Computing, Applications and Technologies

Page Range / eLocation ID:: 199 - 208

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3148055.3148079

More Like this