skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Vendor-neutral and Production-grade Job Power Management in High Performance Computing
Power management and energy efficiency are critical research areas for exascale computing and beyond, necessitating reliable telemetry and control for distributed systems. Despite this need, existing approaches present several limitations precluding their adoption in production. These limitations include, but are not limited to, lack of portability due to vendor-specific and closed-source solutions, lack of support for non-MPI applications, and lack of user-level customization. We present a job-level power management framework based on Flux. We introduce flux-power-monitor and demonstrate its effectiveness on the Lassen (IBM Power AC922) and Tioga (HPE Cray EX235A) systems with a low average overhead of 0.4%. We also present flux-power-manager, where we discuss a proportional sharing policy and introduce a hierarchical FFT-based dynamic power management algorithm (FPP). We demonstrate that FPP reduces energy by 1% compared to proportional sharing, and by 20% compared to the default IBM static power capping policy.  more » « less
Award ID(s):
1942182
PAR ID:
10566577
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
IEEE
Date Published:
ISBN:
979-8-3503-5554-3
Page Range / eLocation ID:
1845 to 1855
Format(s):
Medium: X
Location:
Atlanta, GA, USA
Sponsoring Org:
National Science Foundation
More Like this
  1. Optimal power management of battery energy storage systems (BESS) is crucial for their safe and efficient operation. Numerical optimization techniques are frequently utilized to solve the optimal power management problems. However, these techniques often fall short of delivering real-time solutions for large-scale BESS due to their computational complexity. To address this issue, this paper proposes a computationally efficient approach. We introduce a new set of decision variables called power-sharing ratios corresponding to each cell, indicating their allocated power share from the output power demand. We then formulate an optimal power management problem to minimize the system-wide power losses while ensuring compliance with safety, balancing, and power supply-demand match constraints. To efficiently solve this problem, a parameterized control policy is designed and leveraged to transform the optimal power management problem into a parameter estimation problem. We then implement the ensemble Kalman inversion to estimate the optimal parameter set. The proposed approach significantly reduces computational requirements due to 1) the much lower dimensionality of the decision parameters and 2) the estimation treatment of the optimal power management problem. Finally, we conduct extensive simulations to validate the effectiveness of the proposed approach. The results show promise in accuracy and computation time compared with explored numerical optimization techniques. 
    more » « less
  2. In this paper, we propose a droop-free distributed frequency control for the hybrid photovoltaic and battery energy storage (PV-BES) based microgrid. A distributed state of charge (SOC) balancing regulator achieves balanced SOC among the distributed generators (DGs) with BES utilizing a distributed average SOC estimator and the power sharing regulator ensures proportional power sharing among the PV-BES based DGs. These regulators generate two frequency correction terms which are then added to the microgrid rated frequency to generate references for the lower level controllers. The performance of the proposed distributed control is validated through real-time simulations in OPAL-RT, which demonstrates the effectiveness of the proposed control in achieving frequency regulation, SOC balancing, and active power sharing in the hybrid PV-BES units under both islanded and grid-connected operation modes. 
    more » « less
  3. Irfan Awan; Muhammad Younas; Jamal Bentahar; Salima Benbernou (Ed.)
    Multi-site clinical trial systems face security challenges when streamlining information sharing while protecting patient privacy. In addition, patient enrollment, transparency, traceability, data integrity, and reporting in clinical trial systems are all critical aspects of maintaining data compliance. A Blockchain-based clinical trial framework has been proposed by lots of researchers and industrial companies recently, but its limitations of lack of data governance, limited confidentiality, and high communication overhead made data-sharing systems insecure and not efficient. We propose 𝖲𝗈𝗍𝖾𝗋𝗂𝖺, a privacy-preserving smart contracts framework, to manage, share and analyze clinical trial data on fabric private chaincode (FPC). Compared to public Blockchain, fabric has fewer participants with an efficient consensus protocol. 𝖲𝗈𝗍𝖾𝗋𝗂𝖺 consists of several modules: patient consent and clinical trial approval management chaincode, secure execution for confidential data sharing, API Gateway, and decentralized data governance with adaptive threshold signature (ATS). We implemented two versions of 𝖲𝗈𝗍𝖾𝗋𝗂𝖺 with non-SGX deploys on AWS blockchain and SGX-based on a local data center. We evaluated the response time for all of the access endpoints on AWS Managed Blockchain, and demonstrated the utilization of SGX-based smart contracts for data sharing and analysis. 
    more » « less
  4. Abstract A primary challenge in advancing sustainability in rangelands and drylands is the lack of governance systems that are linked to information about highly variable ecosystem conditions. Here, we describe the national‐scale implementation of a resilience‐based management system in the rangelands of Mongolia. The system comprises several interacting elements. Land type‐specific information about rangeland conditions was captured in vegetation state‐and‐transition models (STMs) that allow interpretation of monitoring data and locally tailored restoration recommendations. Rangeland monitoring systems based on standardized protocols were developed and have been adopted by national government agencies, which provide annual, high‐quality data on rangeland conditions on which to base and adjust management decisions. Rangeland use agreements between local governments and herders' collective organizations, called Pasture Users' Groups, define their respective rights and responsibilities and introduce economic and policy incentives for management changes. Pasture Users' Groups also provide a platform for information sharing and collective action. Rangeland condition data and other indicators are linked to the Responsible Nomads product traceability system that provides consumers and industry a means to associate products with sustainable rangeland management practices. The collaboration between national agencies, international donors, scientists, and herders has been essential to initial success, but longer term support and monitoring will be needed to assess whether the adoption of resilience‐based management leads to positive social and ecological outcomes. We draw generalizations and lessons learned from this effort, which can lead to the successful implementation of new management systems across global rangelands. 
    more » « less
  5. null (Ed.)
    Many high-performance systems now include different types of memory devices within the same compute platform to meet strict performance and cost constraints. Such heterogeneous memory systems often include an upper-level tier with better performance, but limited capacity, and lower-level tiers with higher capacity, but less bandwidth and longer latencies for reads and writes. To utilize the different memory layers efficiently, current systems rely on hardware-directed, memory -side caching or they provide facilities in the operating system (OS) that allow applications to make their own data-tier assignments. Since these data management options each come with their own set of trade-offs, many systems also include mixed data management configurations that allow applications to employ hardware- and software-directed management simultaneously, but for different portions of their address space. Despite the opportunity to address limitations of stand-alone data management options, such mixed management modes are under-utilized in practice, and have not been evaluated in prior studies of complex memory hardware. In this work, we develop custom program profiling, configurations, and policies to study the potential of mixed data management modes to outperform hardware- or software-based management schemes alone. Our experiments, conducted on an Intel ® Knights Landing platform with high-bandwidth memory, demonstrate that the mixed data management mode achieves the same or better performance than the best stand-alone option for five memory intensive benchmark applications (run separately and in isolation), resulting in an average speedup compared to the best stand-alone policy of over 10 %, on average. 
    more » « less