NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Towards Compact Neural Networks via End-to-End Training: A Bayesian Tensor Approach with Automatic Rank Determination

https://doi.org/10.1137/21M1391444

Hawkins, Cole; Liu, Xing; Zhang, Zheng (March 2022, SIAM Journal on Mathematics of Data Science)

Full Text Available
General-Purpose Bayesian Tensor Learning With Automatic Rank Determination and Uncertainty Quantification

https://doi.org/10.3389/frai.2021.668353

Zhang, Kaiqi; Hawkins, Cole; Zhang, Zheng (January 2022, Frontiers in Artificial Intelligence)

A major challenge in many machine learning tasks is that the model expressive power depends on model size. Low-rank tensor methods are an efficient tool for handling the curse of dimensionality in many large-scale machine learning models. The major challenges in training a tensor learning model include how to process the high-volume data, how to determine the tensor rank automatically, and how to estimate the uncertainty of the results. While existing tensor learning focuses on a specific task, this paper proposes a generic Bayesian framework that can be employed to solve a broad class of tensor learning problems such as tensor completion, tensor regression, and tensorized neural networks. We develop a low-rank tensor prior for automatic rank determination in nonlinear problems. Our method is implemented with both stochastic gradient Hamiltonian Monte Carlo (SGHMC) and Stein Variational Gradient Descent (SVGD). We compare the automatic rank determination and uncertainty quantification of these two solvers. We demonstrate that our proposed method can determine the tensor rank automatically and can quantify the uncertainty of the obtained results. We validate our framework on tensor completion tasks and tensorized neural network training tasks.
more » « less
Full Text Available
Bayesian tensorized neural networks with automatic rank selection

https://doi.org/10.1016/j.neucom.2021.04.117

Hawkins, Cole; Zhang, Zheng (September 2021, Neurocomputing)

Tensor decomposition is an effective approach to compress over-parameterized neural networks and to enable their deployment on resource-constrained hardware platforms. However, directly applying tensor compression in the training process is a challenging task due to the difficulty of choosing a proper tensor rank. In order to address this challenge, this paper proposes a low-rank Bayesian tensorized neural network. Our Bayesian method performs automatic model compression via an adaptive tensor rank determination. We also present approaches for posterior density calculation and maximum a posteriori (MAP) estimation for the end-to-end training of our tensorized neural network. We provide experimental validation on a two-layer fully connected neural network, a 6-layer CNN and a 110-layer residual neural network where our work produces 7.4X to 137X more compact neural networks directly from the training while achieving high prediction accuracy.
more » « less
Full Text Available
Sparse Tucker Tensor Decomposition on a Hybrid FPGA–CPU Platform

https://doi.org/10.1109/TCAD.2020.3032626

Jiang, Weiyun; Zhang, Kaiqi; Lin, Colin Yu; Xing, Feng; Zhang, Zheng (September 2021, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

Full Text Available
3U-EdgeAI: Ultra-Low Memory Training, Ultra-Low Bitwidth Quantization, and Ultra-Low Latency Acceleration

https://doi.org/10.1145/3453688.3461738

Chen, Yao; Hawkins, Cole; Zhang, Kaiqi; Zhang, Zheng; Hao, Cong (June 2021, Great Lakes Symposium on VLSI)

Full Text Available
On-FPGA training with ultra memory reduction: A low-precision tensor method

Zhang, Kaiqi; Hawkins, Cole; Zhang, Xiyuan; Hao, Cong; Zhang, Zheng (May 2021, ICLR Workshop on Hardware Aware Efficient Training)

Various hardware accelerators have been developed for energy-efficient and real-time inference of neural networks on edge devices. However, most training is done on high-performance GPUs or servers, and the huge memory and computing costs prevent training neural networks on edge devices. This paper proposes a novel tensor-based training framework, which offers orders-of-magnitude memory reduction in the training process. We propose a novel rank-adaptive tensorized neural network model, and design a hardware-friendly low-precision algorithm to train this model. We present an FPGA accelerator to demonstrate the benefits of this training method on edge devices. Our preliminary FPGA implementation achieves 59× speedup and 123× energy reduction compared to embedded CPU, and 292× memory reduction over a standard full-size training.
more » « less
Full Text Available
Fast Search of the Optimal Contraction Sequence in Tensor Networks

https://doi.org/10.1109/JSTSP.2021.3051231

Liang, Ling; Xu, Jianyu; Deng, Lei; Yan, Mingyu; Hu, Xing; Zhang, Zheng; Li, Guoqi; Xie, Yuan (April 2021, IEEE Journal of Selected Topics in Signal Processing)

Full Text Available
Hardware-Enabled Efficient Data Processing with Tensor-Train Decomposition

https://doi.org/10.1109/TCAD.2021.3058317

Qu, Zheng; Deng, Lei; Wang, Bangyan; Chen, Hengnu; Lin, Jilan; Liang, Ling; Li, Guoqi; Zhang, Zheng; Xie, Yuan (February 2021, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

Full Text Available
NeuroMeter: An Integrated Power, Area, and Timing Modeling Framework for Machine Learning Accelerators Industry Track Paper

https://doi.org/10.1109/HPCA51647.2021.00075

Tang, Tianqi; Li, Sheng; Nai, Lifeng; Jouppi, Norm; Xie, Yuan (February 2021, 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA))

Full Text Available
fuseGNN: Accelerating Graph Convolutional Neural Network Training on GPGPU

https://doi.org/doi.org/10.1145/3400302

Chen, Zhaodong and (November 2020, 2020 IEEE/ACM International Conference On Computer Aided Design (ICCAD))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records