Predictive Modeling of HPC Job Queue Times: Improving User Decision-Making and Resource Utilization

Gaikwad, Bipin; Simakov, Nikolay A; Furlani, Thomas; White, Joseph Patrick; Patra, Abani

doi:10.1145/3708035.3736067

Citation Details

This content will become publicly available on July 18, 2026

Predictive Modeling of HPC Job Queue Times: Improving User Decision-Making and Resource Utilization

This work presents a framework for estimating job wait times in High-Performance Computing (HPC) scheduling queues, leverag- ing historical job scheduling data and real-time system metrics. Using machine learning techniques, specifically Random Forest and Multi-Layer Perceptron (MLP) models, we demonstrate high accuracy in predicting wait times, achieving 94.2% reliability within a 10-minute error margin. The framework incorporates key fea- tures such as requested resources, queue occupancy, and system utilization, with ablation studies revealing the significance of these features. Additionally, the framework offers users wait time esti- mates for different resource configurations, enabling them to select optimal resources, reduce delays, and accelerate computational workloads. Our approach provides valuable insights for both users and administrators to optimize job scheduling, contributing to more efficient resource management and faster time to scientific results. more »

Award ID(s):: 2137603

PAR ID:: 10625658

Author(s) / Creator(s):: Gaikwad, Bipin; Simakov, Nikolay A; Furlani, Thomas; White, Joseph Patrick; Patra, Abani

Publisher / Repository:: ACM

Date Published:: 2025-07-18

ISBN:: 9798400713989

Page Range / eLocation ID:: 1 to 4

Format(s):: Medium: X

Location:: Columbus Ohio USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on July 18, 2026
Conference Paper:
https://doi.org/10.1145/3708035.3736067

More Like this