NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

CHROME: Concurrency-Aware Holistic Cache Management Framework with Online Reinforcement Learning

https://doi.org/10.1109/HPCA57654.2024.00090

Lu, Xiaoyang; Najafi, Hamed; Liu, Jason; Sun, Xian-He (March 2024, 30th IEEE International Symposium on High-Performance Computer Architecture (HPCA))

Cache management is a critical aspect of computer architecture, encompassing techniques such as cache replacement, bypassing, and prefetching. Existing research has often focused on individual techniques, overlooking the potential benefits of joint optimization. Moreover, many of these approaches rely on static and intuition-driven policies, limiting their performance under complex and dynamic workloads. To address these challenges, this paper introduces CHROME, a novel concurrencyaware cache management framework. CHROME takes a holistic approach by seamlessly integrating intelligent cache replacement and bypassing with pattern-based prefetching. By leveraging online reinforcement learning, CHROME dynamically adapts cache decisions based on multiple program features and applies a reward for each decision that considers the accuracy of the action and the system-level feedback information. Our performance evaluation demonstrates that CHROME outperforms current state-of-the-art schemes, exhibiting significant improvements in cache management. Notably, CHROME achieves a remarkable performance boost of up to 13.7% over the traditional LRU method in multi-core systems with only modest overhead.
more » « less
Full Text Available
DeepSim: A Transformer Based Model For Fast Simulation And Exploring Computer System Design Space

https://doi.org/10.1145/3573900.3593634

Najafi, Hamed; Lu, Xiaoyang (June 2023, SIGSIM-PADS '23: Proceedings of the 2023 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation)

Full Text Available
The Memory-Bounded Speedup Model and Its Impacts in Computing

https://doi.org/10.1007/s11390-022-2911-1

Sun, Xian-He; Lu, Xiaoyang (February 2023, Journal of Computer Science and Technology)

Full Text Available
CARE: A Concurrency-Aware Enhanced Lightweight Cache Management Framework

https://doi.org/10.1109/HPCA56546.2023.10071125

Lu, Xiaoyang; Wang, Rujia; Sun, Xian-He (February 2023, 2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA))

Full Text Available
A Generalized Model for Modern Hierarchical Memory System

https://doi.org/10.1109/WSC57314.2022.10015298

Najafi, Hamed; Liu, Jason; Lu, Xiaoyang; Sun, Xian-He (December 2022, 2022 Winter Simulation Conference (WSC))

Memory system is critical to architecture design which can significantly impact application performance. Concurrent Average Memory Access Time (C-AMAT) is a model for analyzing and optimizing memory system performance using a recursive definition of the memory access latency along the memory hierarchy. The original C-AMAT model, however, does not provide the necessary granularity and flexibility for handling modern memory architectures with heterogeneous memory technologies and diverse system topology. We propose to augment C-AMAT to take into consideration the idiosyncrasies of individual cache/memory components as well as their topological arrangement in the memory architecture design. Through trace-based simulation, we validate the augmented model and examine the memory system performance with insight unavailable using the original C-AMAT model.
more » « less
Full Text Available
Accelerating Graph Processing With Lightweight Learning-Based Data Reordering

https://doi.org/10.1109/LCA.2022.3151087

Zou, Mo; Zhang, Mingzhe; Wang, Rujia; Sun, Xian-He; Ye, Xiaochun; Fan, Dongrui; Tang, Zhimin (January 2022, IEEE Computer Architecture Letters)

Full Text Available
Premier: A Concurrency-Aware Pseudo-Partitioning Framework for Shared Last-Level Cache

https://doi.org/10.1109/ICCD53106.2021.00068

Lu, Xiaoyang; Wang, Rujia; Sun, Xian-He (October 2021, IEEE 39th International Conference on Computer Design (ICCD))

Full Text Available
CoPIM: A Concurrency-aware PIM Workload Offloading Architecture for Graph Applications

https://doi.org/10.1109/ISLPED52811.2021.9502483

Yan, Liang; Zhang, Mingzhe; Wang, Rujia; Chen, Xiaoming; Zou, Xingqi; Lu, Xiaoyang; Han, Yinhe; Sun, Xian-He (July 2021, Proceedings of the 2021 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED202))

Full Text Available
A Study on Modeling and Optimization of Memory Systems

https://doi.org/10.1007/s11390-021-0771-8

Liu, Jason; Espina, Pedro; Sun, Xian-He (January 2021, Journal of Computer Science and Technology)
null (Ed.)
Full Text Available
Simulus: Easy Breezy Simulation in Python

https://doi.org/10.1109/WSC48552.2020.9383886

Liu, Jason (December 2020, 2020 Winter Simulation Conference)
null (Ed.)
This paper introduces Simulus, a full-fledged open-source discrete-event simulator, supporting both event-driven and process-oriented simulation world-views. Simulus is implemented in Python and aspires to be a part of the Python's ecosystem supporting scientific computing. Simulus also provides several advanced modeling constructs to ease common simulation tasks (e.g., complex queuing models, interprocess synchronizations, and message-passing communications). Simulus also provides organic support for simultaneously running a time-synchronized group of simulators, either sequentially or in parallel, thereby allowing composable simulation of individual simulators handling different aspects of a target system, and enabling large-scale simulation running on parallel computers. This paper describes the salient features of Simulus and examines its major design decisions.
more » « less
Full Text Available

« Prev Next »

Search for: All records