BayesPerf: minimizing performance monitoring errors using Bayesian statistics

Banerjee, Subho S.; Jha, Saurabh; Kalbarczyk, Zbigniew; Iyer, Ravishankar K.

doi:10.1145/3445814.3446739

Citation Details

BayesPerf: minimizing performance monitoring errors using Bayesian statistics

Hardware performance counters (HPCs) that measure low-level architectural and microarchitectural events provide dynamic contextual information about the state of the system. However, HPC measurements are error-prone due to non determinism (e.g., undercounting due to event multiplexing, or OS interrupt-handling behaviors). In this paper, we present BayesPerf, a system for quantifying uncertainty in HPC measurements by using a domain-driven Bayesian model that captures microarchitectural relationships between HPCs to jointly infer their values as probability distributions. We provide the design and implementation of an accelerator that allows for low-latency and low-power inference of the BayesPerf model for x86 and ppc64 CPUs. BayesPerf reduces the average error in HPC measurements from 40.1% to 7.6% when events are being multiplexed. The value of BayesPerf in real-time decision-making is illustrated with a simple example of scheduling of PCIe transfers. more »

Award ID(s):: 2029049 1337732 1624790

PAR ID:: 10292981

Author(s) / Creator(s):: Banerjee, Subho S.; Jha, Saurabh; Kalbarczyk, Zbigniew; Iyer, Ravishankar K.

Date Published:: 2021-04-17

Journal Name:: The 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems. (ASPLOS ‘21)

Page Range / eLocation ID:: 832 to 844

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3445814.3446739

More Like this