NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition

Mai, Zheda; Zhang, Ping; Tu, Cheng-Hao; Chen, Hong-You; Zhang, Li; Chao, Wei-Lun (June 2025, IEEE)

Free, publicly-accessible full text available June 15, 2026
Fine-Tuning is Fine, if Calibrated

Mai, Zheda; Chowdhury, Arpita; Zhang, Ping; Tu, Cheng-Hao; Chen, Hong-You; Pahuja, Vardaan; Berger-Wolf, Tanya; Gao, Song; Stewart, Charles; Su, Yu; et al (December 2024, NeurIPS)

Full Text Available
Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge Graphs

Pahuja, Vardaan; Luo, Weidi; Gu, Yu; Tu, Cheng-Hao; Chen, Hong-You; Berger-Wolf, Tanya; Stewart, Charles; Gao, Song; Chao, Wei-Lun; Su, Yu (October 2024, ACM International Conference on Information and Knowledge Management (CIKM))

Full Text Available
Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge Graphs

https://doi.org/10.1145/3627673.3679545

Pahuja, Vardaan; Luo, Weidi; Gu, Yu; Tu, Cheng-Hao; Chen, Hong-You; Berger-Wolf, Tanya; Stewart, Charles; Gao, Song; Chao, Wei-Lun; Su, Yu (October 2024, ACM)

Full Text Available
Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning

Tu, Cheng Hao; Mai, Zheda; Chao, Wei-Lun (July 2023, IEEE/CVF Conference on Computer Vision and Pattern Recognition)

Intermediate features of a pre-trained model have been shown informative for making accurate predictions on downstream tasks, even if the model backbone is kept frozen. The key challenge is how to utilize these intermediate fea- tures given their gigantic amount. We propose visual query tuning (VQT), a simple yet effective approach to aggregate intermediate features of Vision Transformers. Through in- troducing a handful of learnable “query” tokens to each layer, VQT leverages the inner workings of Transformers to “summarize” rich intermediate features of each layer, which can then be used to train the prediction heads of downstream tasks. As VQT keeps the intermediate features intact and only learns to combine them, it enjoys memory efficiency in training, compared to many other parameter- efficient fine-tuning approaches that learn to adapt features and need back-propagation through the entire backbone. This also suggests the complementary role between VQT and those approaches in transfer learning. Empirically, VQT consistently surpasses the state-of-the-art approach that utilizes intermediate features for transfer learning and outperforms full fine-tuning in many cases. Compared to parameter-efficient approaches that adapt features, VQT achieves much higher accuracy under memory constraints. Most importantly, VQT is compatible with these approaches to attain even higher accuracy, making it a simple add- on to further boost transfer learning. Code is available at https://github.com/andytu28/VQT .
more » « less
Full Text Available
Holistic Transfer: Towards Non-Disruptive Fine-Tuning with Partial Target Data

Tu, Cheng-Hao; Chen, Hong-You; Mai, Zheda; Zhong, Jike; Pahuja, Vardaan; Berger-Wolf, Tanya; Gao, Song; Stewart, Charles; Su, Yu; Chao, Wei-Lun (February 2024, NeurIPS)

Full Text Available
Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning

https://doi.org/10.1109/cvpr52729.2023.00746

Tu, Cheng-Hao; Mai, Zheda; Chao, Wei-Lun (June 2023, IEEE/CVF Conference on Computer Vision and Pattern Recognition)

Full Text Available
Learning Fractals by Gradient Descent

https://doi.org/10.1609/aaai.v37i2.25342

Tu, Cheng-Hao; Chen, Hong-You; Carlyn, David; Chao, Wei-Lun (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Fractals are geometric shapes that can display complex and self-similar patterns found in nature (e.g., clouds and plants). Recent works in visual recognition have leveraged this property to create random fractal images for model pre-training. In this paper, we study the inverse problem --- given a target image (not necessarily a fractal), we aim to generate a fractal image that looks like it. We propose a novel approach that learns the parameters underlying a fractal image via gradient descent. We show that our approach can find fractal parameters of high visual quality and be compatible with different loss functions, opening up several potentials, e.g., learning fractals for downstream tasks, scientific understanding, etc.
more » « less
Full Text Available
On the Importance and Applicability of Pre-Training for Federated Learning.

Chen, Hong-You; Tu, Cheng-Hao; Li, Ziwei; Shen, Han-Wei; Chao, Wei-Lun (February 2023, ICLR 2023)

Full Text Available

Search for: All records