NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SpikeSen: Low-Latency In-Sensor-Intelligence Design With Neuromorphic Spiking Neurons

https://doi.org/10.1109/TCSII.2023.3235888

Li, Ziru; Zheng, Qilin; Chen, Yiran; Li, Hai (June 2023, IEEE Transactions on Circuits and Systems II: Express Briefs)

Full Text Available
Photonic Bayesian Neural Network Using Programmed Optical Noises

https://doi.org/10.1109/JSTQE.2022.3217819

Wu, Changming; Yang, Xiaoxuan; Chen, Yiran; Li, Mo (March 2023, IEEE Journal of Selected Topics in Quantum Electronics)

Full Text Available
Approximate Computing and the Efficient Machine Learning Expedition

https://doi.org/10.1145/3508352.3561105

Henkel, Jörg; Li, Hai; Raghunathan, Anand; Tahoori, Mehdi B.; Venkataramani, Swagath; Yang, Xiaoxuan; Zervakis, Georgios (October 2022, the 41st IEEE/ACM International Conference on Computer-Aided Design)

Full Text Available
Processing-in-Memory Technology for Machine Learning: From Basic to ASIC

https://doi.org/10.1109/TCSII.2022.3168404

Taylor, Brady; Zheng, Qilin; Li, Ziru; Li, Shiyu; Chen, Yiran (June 2022, IEEE Transactions on Circuits and Systems II: Express Briefs)

Full Text Available
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization

Yang, H.; Duan, L.; Chen, Y.; Li, H. (May 2021, International Conference on Learning Representations)
null (Ed.)
Full Text Available
An Efficient 3D ReRAM Convolution Processor Design for Binarized Weight Networks

https://doi.org/10.1109/TCSII.2021.3067840

Kim, Bokyung; Hanson, Edward; Li, Hai (May 2021, IEEE Transactions on Circuits and Systems II: Express Briefs)
null (Ed.)
Full Text Available
Exploring Applications of STT-RAM in GPU Architectures

https://doi.org/10.1109/TCSI.2020.3031895

Liu, Xiaoxiao; Mao, Mengjie; Bi, Xiuyuan; Li, Hai; Chen, Yiran (January 2021, IEEE Transactions on Circuits and Systems I: Regular Papers)
null (Ed.)
Full Text Available
ReTransformer: ReRAM-based processing-in-memory architecture for transformer acceleration

https://doi.org/10.1145/3400302.3415640

Yang, Xiaoxuan; Yan, Bonan; Li, Hai; Chen, Yiran (November 2020, IEEE/ACM International Conference on Computer-Aided Design (ICCAD),)
null (Ed.)
Full Text Available
Leveraging 3D Vertical RRAM to Developing Neuromorphic Architecture for Pattern Classification

https://doi.org/10.1109/ISVLSI49217.2020.00054

Kim, Bokyung; Li, Hai (July 2020, IEEE Computer Society Annual Symposium on VLSI (ISVLSI))
null (Ed.)
Full Text Available
Learning Low-rank Deep Neural Networks via Singular Vector Orthogonality Regularization and Singular Value Sparsification

Yang, Huanrui; Tang, Minxue; Yan, Feng; Hu, Daniel; Li, Ang; Li, Hai; Chen, Yiran (June 2020, Joint Workshop on Efficient Deep Learning in Computer Vision)

Modern deep neural networks (DNNs) often require high memory consumption and large computational loads. In order to deploy DNN algorithms efficiently on edge or mobile devices, a series of DNN compression algorithms have been explored, including factorization methods. Factorization methods approximate the weight matrix of a DNN layer with the multiplication of two or multiple low-rank matrices. However, it is hard to measure the ranks of DNN layers during the training process. Previous works mainly induce low-rank through implicit approximations or via costly singular value decomposition (SVD) process on every training step. The former approach usually induces a high accuracy loss while the latter has a low efficiency. In this work, we propose SVD training, the first method to explicitly achieve low-rank DNNs during training without applying SVD on every step. SVD training first decomposes each layer into the form of its full-rank SVD, then performs training directly on the decomposed weights. We add orthogonality regularization to the singular vectors, which ensure the valid form of SVD and avoid gradient vanishing/exploding. Low-rank is encouraged by applying sparsity-inducing regularizers on the singular values of each layer. Singular value pruning is applied at the end to explicitly reach a low-rank model. We empirically show that SVD training can significantly reduce the rank of DNN layers and achieve higher reduction on computation load under the same accuracy, comparing to not only previous factorization methods but also state-of-the-art filter pruning methods.
more » « less
Full Text Available

« Prev Next »

Search for: All records