NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Model merging with SVD to tie the Knots

Stoica, George; Ramesh, Pratik; Ecsedi, Boglarka; Choshen, Leshem; Hoffman, Judy (May 2025, International Conference in Learning Representations)

Free, publicly-accessible full text available May 24, 2026
NewModel Merging with SVD to tie the KnOTS

Stoica, George; Ramesh, Pratik; Ecsedi, Boglarka; Choshen, Leshem; Hoffman, Judy (April 2025, International Conference on Learning Representations (ICLR))

Free, publicly-accessible full text available April 24, 2026
SKYSCENES: A Synthetic Dataset for Aerial Scene Understanding

https://doi.org/10.1007/978-3-031-72986-7_2

Khose, Sahil; Pal, Anisha; Agarwal, Aayushi; Deepanshi; Hoffman, Judy; Chattopadhyay, Prithvijit (November 2024, Springer Nature Switzerland)

Full Text Available
We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline

Kareer, S; Vijaykumar, V; Maheshwari, H; Chattopadhyay, P; Hoffman, J; Prabhu, V (January 2024, Transactions on machine learning research)

Full Text Available
AUGCAL: Improving Sim2Real Adaptation by Uncertainty Calibration on Augmented Synthetic Images

Chattopadhyay, P; Goyal, B; Ecsedi, B; Prabhu, V; Hoffman, J (January 2024, International Conference on Learning Representations)

Synthetic data (SIM) drawn from simulators have emerged as a popular alternative for training models where acquiring annotated real-world images is difficult. However, transferring models trained on synthetic images to real-world applications can be challenging due to appearance disparities. A commonly employed solution to counter this SIM2REAL gap is unsupervised domain adaptation, where models are trained using labeled SIM data and unlabeled REAL data. Mispredictions made by such SIM2REAL adapted models are often associated with miscalibration - stemming from overconfident predictions on real data. In this paper, we introduce AUGCAL, a simple training-time patch for unsupervised adaptation that improves SIM2REAL adapted models by - (1) reducing overall miscalibration, (2) reducing overconfidence in incorrect predictions and (3) improving confidence score reliability by better guiding misclassification detection - all while retaining or improving SIM2REAL performance. Given a base SIM2REAL adaptation algorithm, at training time, AUGCAL involves replacing vanilla SIM images with strongly augmented views (AUG intervention) and additionally optimizing for a training time calibration loss on augmented SIM predictions (CAL intervention). We motivate AUGCAL using a brief analytical justification of how to reduce miscalibration on unlabeled REAL data. Through our experiments, we empirically show the efficacy of AUGCAL across multiple adaptation methods, backbones, tasks and shifts.
more » « less
ZipIt! Merging Models from Different Tasks without Training

Stoica, G; Bolya, D; Bjorner, J; Ramesh, P; Hearn, T; Hoffman, J (January 2024, International Conference on Learning Representations)

Typical deep visual recognition models are capable of performing the one task they were trained on. In this paper, we tackle the extremely difficult problem of combining distinct models with different initializations, each solving a separate task, into one multi-task model without any additional training. Prior work in model merging permutes one model to the space of the other then averages them together. While this works for models trained on the same task, we find that this fails to account for the differences in models trained on disjoint tasks. Thus, we introduce "ZipIt!", a general method for merging two arbitrary models of the same architecture that incorporates two simple strategies. First, in order to account for features that aren't shared between models, we expand the model merging problem to allow for merging features within each model by defining a general "zip" operation. Second, we add support for partially zipping the models up until a specified layer, naturally creating a multi-head model. We find that these two changes combined account for 20-60% improvement over prior work, making it more feasible to merge models trained on disjoint tasks without retraining.
more » « less
LANCE: stress-testing visual models by generating language-guided counterfactual images

Prabhu, V; Yenamandra, S; Chattopadhyay, P; Hoffman, J (December 2023, International Conference on Neural Information Processing Systems)

We propose an automated algorithm to stress-test a trained visual model by generating language-guided counterfactual test images (LANCE). Our method leverages recent progress in large language modeling and text-based image editing to augment an IID test set with a suite of diverse, realistic, and challenging test images without altering model weights. We benchmark the performance of a diverse set of pretrained models on our generated data and observe significant and consistent performance drops. We further analyze model sensitivity across different types of edits, and demonstrate its applicability at surfacing previously unknown class-level model biases in ImageNet.
more » « less
PASTA: Proportional Amplitude Spectrum Training Augmentation for Syn-to-Real Domain Generalization

https://doi.org/10.1109/ICCV51070.2023.01767

Chattopadhyay, P; Sarangmath, K; Vijaykumar, V; Hoffman, J (October 2023, International Conference on Computer Vision)

Search for: All records