- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources3
- Resource Type
-
0002100000000000
- More
- Availability
-
30
- Author / Contributor
- Filter by Author / Creator
-
-
Ding, Caiwen (3)
-
Wen, Wujie (3)
-
Xu, Xiaolin (3)
-
Duan, Shijin (2)
-
Luo, Yukui (2)
-
Peng, Hongwu (2)
-
Ran, Ran (2)
-
Wang, Chenghong (2)
-
Xu, Nuo (2)
-
Geng, Tong (1)
-
Liu, Tao (1)
-
Luo, Xinwei (1)
-
Mahmood, Kaleel (1)
-
Quan, Gang (1)
-
Wang, Wei (1)
-
Zhao, Jiahui (1)
-
Zhou, Shanglin (1)
-
#Tyler Phillips, Kenneth E. (0)
-
#Willis, Ciara (0)
-
& Abreu-Ramos, E. D. (0)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Peng, Hongwu; Zhou, Shanglin; Luo, Yukui; Xu, Nuo; Duan, Shijin; Ran, Ran; Zhao, Jiahui; Wang, Chenghong; Geng, Tong; Wen, Wujie; et al (, 2023 60th ACM/IEEE Design Automation Conference (DAC))
-
Ran, Ran; Luo, Xinwei; Wang, Wei; Liu, Tao; Quan, Gang; Xu, Xiaolin; Ding, Caiwen; Wen, Wujie (, Proceedings of the 40th International Conference on Machine Learning, PMLR)Homomorphic Encryption (HE) is a promising technology to protect clients’ data privacy for Machine Learning as a Service (MLaaS) on public clouds. However, HE operations can be orders of magnitude slower than their counterparts for plaintexts and thus result in prohibitively high inference latency, seriously hindering the practicality of HE. In this paper, we propose a HE-based fast neural network (NN) inference framework–SpENCNN built upon the co-design of HE operation-aware model sparsity and the single-instruction-multiple-data (SIMD)-friendly data packing, to improve NN inference latency. In particular, we first develop an encryption-aware HE-group convolution technique that can partition channels among different groups based on the data size and ciphertext size, and then encode them into the same ciphertext by novel group-interleaved encoding, so as to dramatically reduce the number of bottlenecked operations in HE convolution. We further tailor a HE-friendly sub-block weight pruning to reduce the costly HE-based convolution operation. Our experiments show that SpENCNN can achieve overall speedups of 8.37×, 12.11×, 19.26×, and 1.87× for LeNet, VGG-5, HEFNet, and ResNet-20 respectively, with negligible accuracy loss. Our code is publicly available at https://github.com/ranran0523/SPECNN.more » « less
An official website of the United States government

Full Text Available