NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Contrastive quant: quantization makes stronger contrastive learning

https://doi.org/10.1145/3489517.3530419

Fu, Yonggan; Yu, Qixuan; Li, Meng; Ouyang, Xu; Chandra, Vikas; Lin, Yingyan (July 2022, DAC '22: Proceedings of the 59th ACM/IEEE Design Automation Conference)

Contrastive learning learns visual representations by enforcing feature consistency under different augmented views. In this work, we explore contrastive learning from a new perspective. Interestingly, we find that quantization, when properly engineered, can enhance the effectiveness of contrastive learning. To this end, we propose a novel contrastive learning framework, dubbed Contrastive Quant, to encourage feature consistency under both differently augmented inputs via various data transformations and differently augmented weights/activations via various quantization levels. Extensive experiments, built on top of two state-of-the-art contrastive learning methods SimCLR and BYOL, show that Contrastive Quant consistently improves the learned visual representation.
more » « less
Full Text Available
Early-Bird GCNs: Graph-Network Co-optimization towards More Efficient GCN Training and Inference via Drawing Early-Bird Lottery Tickets

https://doi.org/10.1609/aaai.v36i8.20873

You, Haoran; Lu, Zhihan; Zhou, Zijian; Fu, Yonggan; Lin, Yingyan (June 2022, Proceedings of the AAAI Conference on Artificial Intelligence)

Graph Convolutional Networks (GCNs) have emerged as the state-of-the-art deep learning model for representation learning on graphs. However, it remains notoriously challenging to train and inference GCNs over large graph datasets, limiting their application to large real-world graphs and hindering the exploration of deeper and more sophisticated GCN graphs. This is because as the graph size grows, the sheer number of node features and the large adjacency matrix can easily explode the required memory and data movements. To tackle the aforementioned challenges, we explore the possibility of drawing lottery tickets when sparsifying GCN graphs, i.e., subgraphs that largely shrink the adjacency matrix yet are capable of achieving accuracy comparable to or even better than their full graphs. Specifically, we for the first time discover the existence of graph early-bird (GEB) tickets that emerge at the very early stage when sparsifying GCN graphs, and propose a simple yet effective detector to automatically identify the emergence of such GEB tickets. Furthermore, we advocate graph-model co-optimization and develop a generic efficient GCN early-bird training framework dubbed GEBT that can significantly boost the efficiency of GCN training by (1) drawing joint early-bird tickets between the GCN graphs and models and (2) enabling simultaneously sparsification of both the GCN graphs and models. Experiments on various GCN models and datasets consistently validate our GEB finding and the effectiveness of our GEBT, e.g., our GEBT achieves up to 80.2% ~ 85.6% and 84.6% ~ 87.5% savings of GCN training and inference costs while offering a comparable or even better accuracy as compared to state-of-the-art methods. Our source code and supplementary appendix are available at https://github.com/RICE-EIC/Early-Bird-GCN.
more » « less
Full Text Available
EyeCoD: eye tracking system acceleration via flatcam-based algorithm & accelerator co-design

https://doi.org/10.1145/3470496.3527443

You, Haoran; Wan, Cheng; Zhao, Yang; Yu, Zhongzhi; Fu, Yonggan; Yuan, Jiayi; Wu, Shang; Zhang, Shunyao; Zhang, Yongan; Li, Chaojian; et al (June 2022, ISCA '22: Proceedings of the 49th Annual International Symposium on Computer Architecture)

Eye tracking has become an essential human-machine interaction modality for providing immersive experience in numerous virtual and augmented reality (VR/AR) applications desiring high throughput (e.g., 240 FPS), small-form, and enhanced visual privacy. However, existing eye tracking systems are still limited by their: (1) large form-factor largely due to the adopted bulky lens-based cameras; (2) high communication cost required between the camera and backend processor; and (3) potentially concerned low visual privacy, thus prohibiting their more extensive applications. To this end, we propose, develop, and validate a lensless FlatCambased eye tracking algorithm and accelerator co-design framework dubbed EyeCoD to enable eye tracking systems with a much reduced form-factor and boosted system efficiency without sacrificing the tracking accuracy, paving the way for next-generation eye tracking solutions. On the system level, we advocate the use of lensless FlatCams instead of lens-based cameras to facilitate the small form-factor need in mobile eye tracking systems, which also leaves rooms for a dedicated sensing-processor co-design to reduce the required camera-processor communication latency. On the algorithm level, EyeCoD integrates a predict-then-focus pipeline that first predicts the region-of-interest (ROI) via segmentation and then only focuses on the ROI parts to estimate gaze directions, greatly reducing redundant computations and data movements. On the hardware level, we further develop a dedicated accelerator that (1) integrates a novel workload orchestration between the aforementioned segmentation and gaze estimation models, (2) leverages intra-channel reuse opportunities for depth-wise layers, (3) utilizes input feature-wise partition to save activation memory size, and (4) develops a sequential-write-parallel-read input buffer to alleviate the bandwidth requirement for the activation global buffer. On-silicon measurement and extensive experiments validate that our EyeCoD consistently reduces both the communication and computation costs, leading to an overall system speedup of 10.95×, 3.21×, and 12.85× over general computing platforms including CPUs and GPUs, and a prior-art eye tracking processor called CIS-GEP, respectively, while maintaining the tracking accuracy. Codes are available at https://github.com/RICE-EIC/EyeCoD.
more » « less
Full Text Available
InstantNet: Automated Generation and Deployment of Instantaneously Switchable-Precision Networks

Fu, Yonggan; Yu, Zhongzhi; Zhang, Yongan; Jiang, Yifan; Li, Chaojian; Liang, Yongyuan; Jiang, Mingchao; Wang, Zhangyang; Lin, Yingyan (January 2021, The Design Automation Conference)
null (Ed.)
Full Text Available
SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-powered Intelligent PhlatCam

Fu, Yonggan; Zhang, Yang; Wang, Yue; Lu, Zhihan; Boominathan, Vivek; Veeraraghavan, Ashok; Lin, Yingyan (January 2021, IEEE/CVF International Conference on Computer Vision (ICCV 2021), 2021)
null (Ed.)
Full Text Available
2-in-1 Accelerator: Enabling Random Precision Switch for Winning Both Adversarial Robustness and Efficiency

Fu, Yonggan; Zhao, Yang; Yu, Qixuan; Li, Chaojian; Lin, Yingyan (January 2021, The 54th IEEE/ACM International Symposium on Microarchitecture (MICRO 2021), 2021)
null (Ed.)
Full Text Available
Dual Dynamic Inference: Enabling More Efficient, Adaptive and Controllable Deep Inference

https://doi.org/10.1109/JSTSP.2020.2979669

Wang, Yue; Shen, Jianghao; Hu, Ting-Kuei; Xu, Pengfei; Nguyen, Tan; Baraniuk, Richard G.; Wang, Zhangyang; Lin, Yingyan (March 2020, IEEE Journal of Selected Topics in Signal Processing)

Full Text Available

Search for: All records