An automated, cross-layer instrumentation framework for diagnosing performance problems in distributed applications

Ates, Emre; Sturmann, Lily; Toslali, Mert; Krieger, Orran; Megginson, Richard; Coskun, Ayse K.; Sambasivan, Raja R.

doi:10.1145/3357223.3362704

Citation Details

An automated, cross-layer instrumentation framework for diagnosing performance problems in distributed applications

Diagnosing performance problems in distributed applications is extremely challenging. A significant reason is that it is hard to know where to place instrumentation a priori to help diagnose problems that may occur in the future. We present the vision of an automated instrumentation framework, Pythia, that runs alongside deployed distributed applications. In response to a newly-observed performance problem, Pythia searches the space of possible instrumentation choices to enable the instrumentation needed to help diagnose it. Our vision for Pythia builds on workflow-centric tracing, which records the order and timing of how requests are processed within and among a distributed application's nodes (i.e., records their workflows). It uses the key insight that localizing the sources high performance variation within the workflows of requests that are expected to perform similarly gives insight into where additional instrumentation is needed. more »

Award ID(s):: 1815323 2016178

PAR ID:: 10126344

Author(s) / Creator(s):: Ates, Emre; Sturmann, Lily; Toslali, Mert; Krieger, Orran; Megginson, Richard; Coskun, Ayse K.; Sambasivan, Raja R.

Date Published:: 2019-11-20

Journal Name:: Proceedings of the ACM Symposium on Cloud Computing

Page Range / eLocation ID:: 165 to 170

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3357223.3362704

More Like this