HPC Andragogy: Automating Batch Scheduler Feedback

Tsoukalas, Kyriakos

doi:10.22369/issn.2153-4136/16/1/11

Citation Details

This content will become publicly available on March 1, 2026

HPC Andragogy: Automating Batch Scheduler Feedback

This paper proposes a monitoring system that emails feedback to users about submitted jobs and has the capability to stop and resubmit jobs to a batch scheduler. The proposed system has been implemented for a small supercomputing environment with a mix of high-performance and high-throughput computing jobs. User feedback includes alerts for over- and under-utilization of CPU and physical memory. This paper also discusses how predefined system thresholds were chosen and proposes three algorithms. An algorithm for the proposed monitoring system and two algorithms for the prediction of CPU and physical memory utilization. The latter algorithms are based on users' input of the identification string (job ID) of a similar job that should have finished execution without errors. Lastly, a git repository is shared to make the code accessible for review. more »

Award ID(s):: 2346664

PAR ID:: 10577944

Author(s) / Creator(s):: Tsoukalas, Kyriakos

Publisher / Repository:: The Journal of Computational Science Education

Date Published:: 2025-03-01

Journal Name:: The Journal of Computational Science Education

Volume:: 16

Issue:: 1

ISSN:: 2153-4136

Page Range / eLocation ID:: 57 to 61

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 1, 2026
Journal Article:
https://doi.org/10.22369/issn.2153-4136/16/1/11

More Like this