NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Efficient Error Estimation for High-Level Design Space Exploration of Approximate Computing Systems

https://doi.org/10.1109/TVLSI.2023.3273478

Vaeztourshizi, Marzieh; Pedram, Massoud (July 2023, IEEE Transactions on Very Large Scale Integration (VLSI) Systems)

Full Text Available
Sparse Periodic Systolic Dataflow for Lowering Latency and Power Dissipation of Convolutional Neural Network Accelerators

https://doi.org/10.1145/3531437.3539715

Heo, Jung Hwan; Fayyazi, Arash; Esmaili, Amirhossein; Pedram, Massoud (August 2022, ISLPED)

Full Text Available
An Energy-Efficient Inference Method in Convolutional Neural Networks Based on Dynamic Adjustment of the Pruning Level

https://doi.org/10.1145/3460972

Maleki, Mohammad-Ali; Nabipour-Meybodi, Alireza; Kamal, Mehdi; Afzali-Kusha, Ali; Pedram, Massoud (August 2021, ACM Transactions on Design Automation of Electronic Systems)

In this article, we present a low-energy inference method for convolutional neural networks in image classification applications. The lower energy consumption is achieved by using a highly pruned (lower-energy) network if the resulting network can provide a correct output. More specifically, the proposed inference method makes use of two pruned neural networks (NNs), namely mildly and aggressively pruned networks, which are both designed offline. In the system, a third NN makes use of the input data for the online selection of the appropriate pruned network. The third network, for its feature extraction, employs the same convolutional layers as those of the aggressively pruned NN, thereby reducing the overhead of the online management. There is some accuracy loss induced by the proposed method where, for a given level of accuracy, the energy gain of the proposed method is considerably larger than the case of employing any one pruning level. The proposed method is independent of both the pruning method and the network architecture. The efficacy of the proposed inference method is assessed on Eyeriss hardware accelerator platform for some of the state-of-the-art NN architectures. Our studies show that this method may provide, on average, 70% energy reduction compared to the original NN at the cost of about 3% accuracy loss on the CIFAR-10 dataset.
more » « less
Full Text Available
JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services

https://doi.org/10.1109/TMC.2019.2947893

Eshratifar, Amir Erfan; Abrishami, Mohammad Saeed; Pedram, Massoud (February 2021, IEEE Transactions on Mobile Computing)
null (Ed.)
Full Text Available
DNR: A Tunable Robust Pruning Framework Through Dynamic Network Rewiring of DNNs

https://doi.org/10.1145/3394885.3431542

Kundu, Souvik; Nazemi, Mahdi; Beerel, Peter A.; Pedram, Massoud (January 2021, the 26th Asia and South Pacific Design Automation Conference)
null (Ed.)
Full Text Available
An Energy-Efficient Inference Method in Convolutional Neural Networks Based on Dynamic Adjustment of the Pruning Level

M. A. Maleki, A. Nabipour-Meybodi (January 2021, ACM transactions on design automation of electronic systems)

Full Text Available
SynergicLearning: neural network-based feature extraction for highly-accurate hyperdimensional learning

https://doi.org/10.1145/3400302.3415696

Nazemi, Mahdi; Fayyazi, Arash; Esmaili, Amirhossein; Pedram, Massoud (November 2020, Int’l Conf. on Computer-Aided Design)
null (Ed.)
Full Text Available

Search for: All records