NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Flumen: Dynamic Processing in the Photonic Interconnect

https://doi.org/10.1145/3579371.3589110

Shiflett, Kyle; Karanth, Avinash; Bunescu, Razvan; Louri, Ahmed (June 2023, ACM/IEEE International Symposium on Computer Architecture (ISCA))

Full Text Available
Slack-Aware Packet Approximation for Energy-Efficient Network-on-Chips

https://doi.org/10.1109/TSUSC.2022.3213469

Chen, Yuechen; Louri, Ahmed; Liu, Shanshan; Lombardi, Fabrizio (January 2023, IEEE Transactions on Sustainable Computing)

Full Text Available
GShuttle: Optimizing Memory Access Efficiency for Graph Convolutional Neural Network Accelerators

https://doi.org/10.1007/s11390-023-2875-9

Li, Jiajun; Wang, Ke; Zheng, Hao; Louri, Ahmed (January 2023, Journal of computer science and technology)

Full Text Available
SPRINT: A High-Performance, Energy-Efficient, and Scalable Chiplet-Based Accelerator With Photonic Interconnects for CNN Inference

https://doi.org/10.1109/TPDS.2021.3139015

Li, Yuan; Louri, Ahmed; Karanth, Avinash (October 2022, IEEE Transactions on Parallel and Distributed Systems)

Full Text Available
FSA: An Efficient Fault-tolerant Systolic Array-based DNN Accelerator Architecture

https://doi.org/10.1109/ICCD56317.2022.00086

Zhao, Yingnan; Wang, Ke; Louri, Ahmed (October 2022, IEEE International Conference on Computer Design (ICCD))

Full Text Available
Ascend: A Scalable and Energy-Efficient Deep Neural Network Accelerator With Photonic Interconnects

https://doi.org/10.1109/TCSI.2022.3169953

Li, Yuan; Wang, Ke; Zheng, Hao; Louri, Ahmed; Karanth, Avinash (July 2022, IEEE Transactions on Circuits and Systems I: Regular Papers)

Full Text Available
SPACX: Silicon Photonics-based Scalable Chiplet Accelerator for DNN Inference

https://doi.org/10.1109/HPCA53966.2022.00066

Li, Yuan; Louri, Ahmed; Karanth, Avinash (April 2022, IEEE International Symposium on High-Performance Computer Architecture (HPCA))

In pursuit of higher inference accuracy, deep neural network (DNN) models have significantly increased in complexity and size. To overcome the consequent computational challenges, scalable chiplet-based accelerators have been proposed. However, data communication using metallic-based interconnects in these chiplet-based DNN accelerators is becoming a primary obstacle to performance, energy efficiency, and scalability. The photonic interconnects can provide adequate data communication support due to some superior properties like low latency, high bandwidth and energy efficiency, and ease of broadcast communication. In this paper, we propose SPACX: a Silicon Photonics-based Chiplet ACcelerator for DNN inference applications. Specifically, SPACX includes a photonic network design that enables seamless single-chiplet and cross-chiplet broadcast communications, and a tailored dataflow that promotes data broadcast and maximizes parallelism. Furthermore, we explore the broadcast granularities of the photonic network and implications on system performance and energy efficiency. A flexible bandwidth allocation scheme is also proposed to dynamically adjust communication bandwidths for different types of data. Simulation results using several DNN models show that SPACX can achieve 78% and 75% reduction in execution time and energy, respectively, as compared to other state-of-the-art chiplet-based DNN accelerators.
more » « less
Full Text Available
AGAPE: Anomaly Detection with Generative Adversarial Network for Improved Performance, Energy, and Security in Manycore Systems

https://doi.org/10.23919/DATE54114.2022.9774693

Wang, Ke; Zheng, Hao; Li, Yuan; Li, Jiajun; Louri, Ahmed (March 2022, Design, Automation & Test in Europe Conference & Exhibition (DATE))

The security of manycore systems has become increasingly critical. In system-on-chips (SoCs), Hardware Trojans (HTs) manipulate the functionalities of the routing components to saturate the on-chip network, degrade performance, and result in the leakage of sensitive data. Existing HT detection techniques, including runtime monitoring and state-of-the-art learning-based methods, are unable to timely and accurately identify the implanted HTs, due to the increasingly dynamic and complex nature of on-chip communication behaviors. We propose AGAPE, a novel Generative Adversarial Network (GAN)-based anomaly detection and mitigation method against HTs for secured on-chip communication. AGAPE learns the distribution of the multivariate time series of a number of NoC attributes captured by on-chip sensors under both HT-free and HT-infected working conditions. The proposed GAN can learn the potential latent interactions among different runtime attributes concurrently, accurately distinguish abnormal attacked situations from normal SoC behaviors, and identify the type and location of the implanted HTs. Using the detection results, we apply the most suitable protection techniques to each type of detected HTs instead of simply isolating the entire HT-infected router, with the aim to mitigate security threats as well as reducing performance loss. Simulation results show that AGAPE enhances the HT detection accuracy by 19%, reduces network latency and power consumption by 39% and 30%, respectively, as compared to state-of-the-art security designs.
more » « less
Full Text Available
Adapt-NoC: A Flexible Network-on-Chip Design for Heterogeneous Manycore Architectures

https://doi.org/10.1109/HPCA51647.2021.00066

Zheng, Hao; Wang, Ke; Louri, Ahmed (February 2021, IEEE International Symposium on High-Performance Computer Architecture (HPCA))
null (Ed.)
The increased computational capability in heterogeneous manycore architectures facilitates the concurrent execution of many applications. This requires, among other things, a flexible, high-performance, and energy-efficient communication fabric capable of handling a variety of traffic patterns needed for running multiple applications at the same time. Such stringent requirements are posing a major challenge for current Network-on-Chips (NoCs) design. In this paper, we propose Adapt-NoC, a flexible NoC architecture, along with a reinforcement learning (RL)-based control policy, that can provide efficient communication support for concurrent application execution. Adapt-NoC can dynamically allocate several disjoint regions of the NoC, called subNoCs, with different sizes and locations for the concurrently running applications. Each of the dynamically-allocated subNoCs is capable of adapting to a given topology such as a mesh, cmesh, torus, or tree thus tailoring the topology to satisfy application’s needs in terms of performance and power consumption. Moreover, we explore the use of RL to design an efficient control policy which optimizes the subNoC topology selection for a given application. As such, Adapt-NoC can not only provide several topology choices for concurrently running applications, but can also optimize the selection of the most suitable topology for a given application with the aim of improving performance and energy efficiency. We evaluate Adapt-NoC using both GPU and CPU benchmark suites. Simulation results show that the proposed Adapt-NoC can achieve up to 34% latency reduction, 10% overall execution time reduction and 53% NoC energy-efficiency improvement when compared to prior work.
more » « less
Full Text Available
GCNAX: A Flexible and Energy-efficient Accelerator for Graph Convolutional Neural Networks

https://doi.org/10.1109/HPCA51647.2021.00070

Li, Jiajun; Louri, Ahmed; Karanth, Avinash; Bunescu, Razvan (February 2021, 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records