Search for: All records

Award ID contains: 1955246

« Prev Next »

Total Resources

12

Resource Type
Conference Paper

6

Conference Proceeding

0

Dataset

0

Journal Article

6

Workshop Report

0

Availability
Full Text / Resource Available

11

Citation Only

1

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SpikeSen: Low-Latency In-Sensor-Intelligence Design With Neuromorphic Spiking Neurons

https://doi.org/10.1109/TCSII.2023.3235888

Li, Ziru ; Zheng, Qilin ; Chen, Yiran ; Li, Hai ( June 2023 , IEEE Transactions on Circuits and Systems II: Express Briefs)

Free, publicly-accessible full text available June 1, 2024
DefT: Boosting Scalability of Deformable Convolution Operations on GPUs

https://doi.org/10.1145/3582016.3582017

Hanson, Edward ; Horton, Mark ; Li, Hai ; Chen, Yiran ( March 2023 , The 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3 (ASPLOS ’23),)

Deformable Convolutional Networks (DCN) have been proposed as a powerful tool to boost the representation power of Convolutional Neural Networks (CNN) in computer vision tasks via adaptive sampling of the input feature map. Much like vision transformers, DCNs utilize a more flexible inductive bias than standard CNNs and have also been shown to improve performance of particular models. For example, drop-in DCN layers were shown to increase the AP score of Mask RCNN by 10.6 points while introducing only 1% additional parameters and FLOPs, improving the state-of-the art model at the time of publication. However, despite evidence that more DCN layers placed earlier in the network can further improve performance, we have not seen this trend continue with further scaling of deformations in CNNs, unlike for vision transformers. Benchmarking experiments show that a realistically sized DCN layer (64H×64W, 64 in-out channel) incurs a 4× slowdown on a GPU platform, discouraging the more ubiquitous use of deformations in CNNs. These slowdowns are caused by the irregular input-dependent access patterns of the bilinear interpolation operator, which has a disproportionately low arithmetic intensity (AI) compared to the rest of the DCN. To address the disproportionate slowdown of DCNs and enable their expanded use in CNNs, we propose DefT, a series of workload-aware optimizations for DCN kernels. DefT identifies performance bottlenecks in DCNs and fuses specific operators that are observed to limit DCN AI. Our approach also uses statistical information of DCN workloads to adapt the workload tiling to the DCN layer dimensions, minimizing costly out-of-boundary input accesses. Experimental results show that DefT mitigates up to half of DCN slowdown over the current-art PyTorch implementation. This translates to a layerwise speedup of up to 134% and a reduction of normalized training time of 46% on a fully DCN-enabled ResNet model.
more » « less
Full Text Available
DyNNamic: Dynamically Reshaping, High Data-Reuse Accelerator for Compact DNNs

https://doi.org/10.1109/TC.2022.3184272

Hanson, Edward ; Li, Shiyu ; Qian, Xuehai ; Li, Hai Helen ; Chen, Yiran ( March 2023 , IEEE Transactions on Computers)

Full Text Available
Photonic Bayesian Neural Network Using Programmed Optical Noises

https://doi.org/10.1109/JSTQE.2022.3217819

Wu, Changming ; Yang, Xiaoxuan ; Chen, Yiran ; Li, Mo ( March 2023 , IEEE Journal of Selected Topics in Quantum Electronics)

Full Text Available
Improving the Robustness and Efficiency of PIM-Based Architecture by SW/HW Co-Design

https://doi.org/10.1145/3566097.3568358

Yang, Xiaoxuan ; Li, Shiyu ; Zheng, Qilin ; Chen, Yiran ( January 2023 , 2023 28th Asia and South Pacific Design Automation Conference (ASP-DAC))

Processing-in-memory (PIM) based architecture shows great potential to process several emerging artificial intelligence workloads, including vision and language models. Cross-layer optimizations could bridge the gap between computing density and the available resources by reducing the computation and memory cost of the model and improving the model’s robustness against non-ideal hardware effects. We first introduce several hardware-aware training methods to improve the model robustness to the PIM device’s nonideal effects, including stuck-at-fault, process variation, and thermal noise. Then, we further demonstrate a software/hardware (SW/HW) co-design methodology to efficiently process the state-of-the-art attention-based model on PIM-based architecture by performing sparsity exploration for the attention-based model and circuit architecture co-design to support the sparse processing.
more » « less
Full Text Available
On Building Efficient and Robust Neural Network Designs

https://doi.org/10.1109/IEEECONF56349.2022.10051891

Yang, Xiaoxuan ; Yang, Huanrui ; Zhang, Jingchi ; Li, Hai Helen ; Chen, Yiran ( October 2022 , The 56th Asilomar Conference on Signals, Systems, and Computers)

Neural network models have demonstrated outstanding performance in a variety of applications, from image classification to natural language processing. However, deploying the models to hardware raises efficiency and reliability issues. From the efficiency perspective, the storage, computation, and communication cost of neural network processing is considerably large because the neural network models have a large number of parameters and operations. From the standpoint of robustness, the perturbation in hardware is unavoidable and thus the performance of neural networks can be degraded. As a result, this paper investigates effective learning and optimization approaches as well as advanced hardware designs in order to build efficient and robust neural network designs.
more » « less
Full Text Available
HERO: hessian-enhanced robust optimization for unifying and improving generalization and quantization performance

https://doi.org/10.1145/3489517.3530678

Yang, Huanrui ; Yang, Xiaoxuan ; Gong, Neil Zhenqiang ; Chen, Yiran ( July 2022 , The 59th ACM/IEEE Design Automation Conference)

With the recent demand of deploying neural network models on mobile and edge devices, it is desired to improve the model's generalizability on unseen testing data, as well as enhance the model's robustness under fixed-point quantization for efficient deployment. Minimizing the training loss, however, provides few guarantees on the generalization and quantization performance. In this work, we fulfill the need of improving generalization and quantization performance simultaneously by theoretically unifying them under the framework of improving the model's robustness against bounded weight perturbation and minimizing the eigenvalues of the Hessian matrix with respect to model weights. We therefore propose HERO, a Hessian-enhanced robust optimization method, to minimize the Hessian eigenvalues through a gradient-based training process, simultaneously improving the generalization and quantization performance. HERO enables up to a 3.8% gain on test accuracy, up to 30% higher accuracy under 80% training label perturbation, and the best post-training quantization accuracy across a wide range of precision, including a > 10% accuracy improvement over SGD-trained models for common model architectures on various datasets.
more » « less
Full Text Available
Cascading structured pruning: enabling high data reuse for sparse DNN accelerators

https://doi.org/10.1145/3470496.3527419

Hanson, Edward ; Li, Shiyu ; Li, Hai 'Helen' ; Chen, Yiran ( June 2022 , International Symposium on Computer Architecture (ISCA))

Full Text Available
Processing-in-Memory Technology for Machine Learning: From Basic to ASIC

https://doi.org/10.1109/TCSII.2022.3168404

Taylor, Brady ; Zheng, Qilin ; Li, Ziru ; Li, Shiyu ; Chen, Yiran ( June 2022 , IEEE Transactions on Circuits and Systems II: Express Briefs)

Full Text Available
Harnessing optoelectronic noises in a photonic generative network

https://doi.org/10.1126/sciadv.abm2956

Wu, Changming ; Yang, Xiaoxuan ; Yu, Heshan ; Peng, Ruoming ; Takeuchi, Ichiro ; Chen, Yiran ; Li, Mo ( January 2022 , Science Advances)

A photonic generative adversarial network that harnesses optoelectronic noises to generate handwritten numbers is demonstrated.
more » « less
Full Text Available

« Prev Next »