NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Slack-Aware Packet Approximation for Energy-Efficient Network-on-Chips

https://doi.org/10.1109/TSUSC.2022.3213469

Chen, Yuechen; Louri, Ahmed; Liu, Shanshan; Lombardi, Fabrizio (January 2023, IEEE Transactions on Sustainable Computing)

Full Text Available
A Technique for Approximate Communication in Network-on-Chips for Image Classification

https://doi.org/10.1109/TETC.2022.3162165

Chen, Yuechen; Liu, Shanshan; Lombardi, Fabrizio; Louri, Ahmed (January 2023, IEEE Transactions on Emerging Topics in Computing)

Full Text Available
Approximate Network-on-Chips with Application to Image Classification

https://doi.org/10.1109/NAS55553.2022.9925540

Chen, Yuechen; Louri, Ahmed; Liu, Shanshan; Lombardi, Fabrizio (October 2022, IEEE International Conference on Networking, Architecture, and Storage (NAS))

Full Text Available
Low-Power Approximate RPR Scheme for Unsigned Integer Arithmetic Computation

https://doi.org/10.1109/OJNANO.2022.3153329

Chen, Ke; Liu, Weiqiang; Louri, Ahmed; Lombardi, Fabrizio (January 2022, IEEE Open Journal of Nanotechnology)

Full Text Available
Stochastic Dividers for Low Latency Neural Networks

https://doi.org/10.1109/TCSI.2021.3103926

Liu, Shanshan; Tang, Xiaochen; Niknia, Farzad; Reviriego, Pedro; Liu, Weiqiang; Louri, Ahmed; Lombardi, Fabrizio (October 2021, IEEE Transactions on Circuits and Systems I: Regular Papers)

Full Text Available
Learning-Based Quality Management for Approximate Communication in Network-on-Chips

https://doi.org/10.1109/TCAD.2020.3012235

Chen, Yuechen; Louri, Ahmed (November 2020, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)
null (Ed.)
Full Text Available
TSA-NoC: Learning-Based T hreat Detection and Mitigation for S ecure Network-on-Chip A rchitecture

https://doi.org/10.1109/MM.2020.3003576

Wang, Ke; Zheng, Hao; Louri, Ahmed (September 2020, IEEE Micro)
null (Ed.)
Full Text Available
An Approximate Communication Framework for Network-on-Chips

https://doi.org/10.1109/TPDS.2020.2968068

Chen, Yuechen; Louri, Ahmed (June 2020, IEEE Transactions on Parallel and Distributed Systems)

Full Text Available
CURE: A High-Performance, Low-Power, and Reliable Network-on-Chip Design Using Reinforcement Learning

https://doi.org/10.1109/TPDS.2020.2986297

Wang, Ke; Louri, Ahmed (April 2020, IEEE Transactions on Parallel and Distributed Systems)

We propose CURE, a deep reinforcement learning (DRL)-based NoC design framework that simultaneously reduces network latency, improves energy-efficiency, and tolerates transient errors and permanent faults. CURE has several architectural innovations and a DRL-based hardware controller to manage design complexity and optimize trade-offs. First, in CURE, we propose reversible multi-function adaptive channels (RMCs) to reduce NoC power consumption and network latency. Second, we implement a new fault-secure adaptive error correction hardware in each router to enhance reliability for both transient errors and permanent faults. Third, we propose a router power-gating and bypass design that powers off NoC components to reduce power and extend chip lifespan. Further, for the complex dynamic interactions of these techniques, we propose using DRL to train a proactive control policy to provide improved fault-tolerance, reduced power consumption, and improved performance. Simulation using the PARSEC benchmark shows that CURE reduces end-to-end packet latency by 39%, improves energy efficiency by 92%, and lowers static and dynamic power consumption by 24% and 38%, respectively, over conventional solutions. Using mean-time-to-failure, we show that CURE is 7.7x more reliable than the conventional NoC design.
more » « less
Full Text Available
Reduced Precision Redundancy for Reliable Processing of Data

https://doi.org/10.1109/TETC.2019.2947617

Liu, Shanshan; Chen, Ke; Reviriego, Pedro; Liu, Weiqiang; Louri, Ahmed; Lombardi, Fabrizio (October 2019, IEEE Transactions on Emerging Topics in Computing)

Information is an integral part of the correct and reliable operation of today's computing systems. Data either stored or provided as input to computation processing modules must be tolerant to many externally and internally induced destructive phenomena such as soft errors and faults, often of a transient nature but also in large numbers, thus causing catastrophic system failures. Together with error tolerance, reliable operation must be provided by reducing the large overheads often encountered at system-level when employing redundancy. While information-based techniques can also be used in some of these schemes, the complexity and limited capabilities for implementing high order correction functions for decoding limit their application due to poor performance; therefore, N Modular Redundancy (NMR) is often employed. In NMR the correct output is given by majority voting among the N input copies of data. Reduced Precision Redundancy (RPR) has been advocated to reduce the redundancy, mostly for the case of N = 3; in a 3RPR scheme, one full precision (FP) input is needed while two inputs require reduced precision (RP) (usually by truncating some of the least significant bits (LSBs) in the input data). However, its decision logic is more complex than a 3MR scheme. This paper proposes a novel NRPR scheme with a simple comparison-based approach; the realistic case of N = 5 is considered as an example to explain in detail such proposed scheme; different arrangements for the redundancy (with three or four FP data copies) are considered. In addition to the design of the decision circuit, a probabilistic analysis is also pursued to determine the conditions by which RPR data is provided as output; it is shown that its probability is very small. Different applications of the proposed NRPR system are presented; in these applications, data is used either as memory output and/or for computing the discrete cosine transform. In both cases, the proposed 5RPR scheme shows considerable advantages in terms of redundancy management and reliable image processing.
more » « less
Full Text Available

« Prev Next »

Search for: All records