Semantic-aware Workflow Construction and Analysis for Distributed Data Analytics Systems

Pi, Aidi; Chen, Wei; Wang, Shaoqi; Zhou, Xiaobo

doi:doi.org/10.1145/3307681.3325404

Citation Details

Semantic-aware Workflow Construction and Analysis for Distributed Data Analytics Systems

Logging is a universal approach to recording important events in system workflows of distributed systems. Current log analysis tools ignore the semantic knowledge that is key to workflow construction and analysis. In addition, they focus on infrastructure-level distributed systems. Because of fundamental differences in log features, they are ineffective in distributed data analytics systems. This paper proposes IntelLog, a semantic-aware non-intrusive workflow reconstruction tool for distributed data analytics systems. It is capable of building hierarchical relationships between components and events from logs generated by the targeted systems with little or even no domain knowledge. Leveraging natural language processing, IntelLog automatically extracts and formats semantic information in each log message, including system events, identifiers, locality information, and metrics values. It builds a graph to represent the hierarchical relationship of components in the targeted system via nomenclature conventions. We implement IntelLog for Hadoop MapReduce, Spark and Tez. Evaluation results show that IntelLog provides a fine-grained view of the system workflows with semantics. It outperforms existing tools in automatically detecting anomalies caused by real-world problems, misconfigurations and system bugs. Users can query the formatted semantic knowledge to understand and further troubleshoot the systems. more »

Award ID(s):: 1816850

PAR ID:: 10142543

Author(s) / Creator(s):: Pi, Aidi; Chen, Wei; Wang, Shaoqi; Zhou, Xiaobo

Date Published:: 2019-06-01

Journal Name:: HPDC '19: ACM Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing

Page Range / eLocation ID:: 255 to 266

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/doi.org/10.1145/3307681.3325404

More Like this