Multi-source Log Clustering in Distributed Systems

Raffety, J.; Stone, B.; Svacina, J.; Woodahl, C.; Cerny, T.; Tisnovsky, P.

doi:10.1007/978-981-33-6385-4_4

Citation Details

Multi-source Log Clustering in Distributed Systems

Distributed systems are seeing wider use as software becomes more complex and cloud systems increase in popularity. Preforming anomaly detection and other log analysis procedures on distributed systems have not seen much research. To this end, we propose a simple and generic method of clustering log statements from separate log files to perform future log analysis. We identify variable components of log statements and find matches of these variables between the sources. After scoring the variables, we select the one with the highest score to be the clustering basis. We performed a case study of our method on the two open-source projects, to which we found success in the results of our method and created an open-source project log-matcher. more »

Award ID(s):: 1854049

PAR ID:: 10310334

Author(s) / Creator(s):: Raffety, J.; Stone, B.; Svacina, J.; Woodahl, C.; Cerny, T.; Tisnovsky, P.

Date Published:: 2021-04-03

Journal Name:: Information Science and Applications. Lecture Notes in Electrical Engineering

Volume:: 739

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1007/978-981-33-6385-4_4

More Like this