It Can Understand the Logs, Literally

Pi, Aidi; Chen, Wei; Zeller, Will; Zhou, Xiaobo

doi:10.1109/IPDPSW.2019.00084

Citation Details

It Can Understand the Logs, Literally

Workflow reconstruction through logs is crucial for troubleshooting targeted distributed systems. It is also challenging to extract enough information from logs and keep a concise view, which makes manual log analysis hard to practice. However, currently popular tools rely on identifier-based log parsing, leaving a large amount of workflow information unexploited. In this paper, we propose a log extraction approach NLog, which utilizes a natural language processing based approach to obtain the key information from log messages and identify the same object in logs generated by different statements without any domain knowledge. We propose to use keyed message, a new log storage structure to store the parsed logs. We implement NLog and apply it to distributed data analytics frameworks Spark and MapReduce. Evaluation results show that NLog can accurately identify the objects in log messages even without explicit identifiers. By using keyed messages, users can have a concise as well as flexible view of the workflows. more »

Award ID(s):: 1816850

PAR ID:: 10142541

Author(s) / Creator(s):: Pi, Aidi; Chen, Wei; Zeller, Will; Zhou, Xiaobo

Date Published:: 2019-07-01

Journal Name:: 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Page Range / eLocation ID:: 446 to 451

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/IPDPSW.2019.00084

More Like this