NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Bridging Human and LLM Judgments: Understanding and Narrowing the Gap

Polo, Felipe Maia; Wang, Xinhe; Yurochkin, Mikhail; Xu, Gongjun; Banerjee, Moulinath; Sun, Yuekai (October 2025, NeurIPS 2025)

Free, publicly-accessible full text available October 31, 2026
SPRI: Aligning Large Language Models with Context-Situated Principles

Zhan, Hongli; Azmat, Muneeza; Horesh, Raya; Li, Junyi Jessy; Yurochkin, Mikhail (July 2025, Proceedings of the Forty-Second International Conference on Machine Learning (ICML 2025))

Aligning Large Language Models to integrate and reflect human values, especially for tasks that demand intricate human oversight, is arduous since it is resource-intensive and time-consuming to depend on human expertise for context-specific guidance. Prior work has utilized predefined sets of rules or principles to steer the behavior of models (Bai et al., 2022; Sun et al., 2023). However, these principles tend to be generic, making it challenging to adapt them to each individual input query or context. In this work, we present Situated-PRInciples (SPRI), a framework requiring minimal or no human effort that is designed to automatically generate guiding principles in real-time for each input query and utilize them to align each response. We evaluate SPRI on three tasks, and show that 1) SPRI can derive principles in a complex domain-specific task that leads to on-par performance as expert-crafted ones; 2) SPRI-generated principles lead to instance-specific rubrics that outperform prior LLM-as-a-judge frameworks; 3) using SPRI to generate synthetic SFT data leads to substantial improvement on truthfulness.
more » « less
Free, publicly-accessible full text available July 13, 2026
Aligners: Decoupling LLMs and Alignment

https://doi.org/10.18653/v1/2024.findings-emnlp.808

Ngweta, Lilian; Agarwal, Mayank; Maity, Subha; Gittens, Alex; Sun, Yuekai; Yurochkin, Mikhail (November 2024, Association for Computational Linguistics)

Full Text Available
tinyBenchmarks: evaluating LLMs with fewer examples

Maia_Polo, Felipe; Weber, Lucas; Choshen, Leshem; Sun, Yuekai; Xu, Gongjun; Yurochkin, Mikhail (July 2024, Proceedings of Machine Learning Research)

Full Text Available
tinyBenchmarks: evaluating LLMs with fewer examples

Maia_Polo, Felipe; Weber, Lucas; Choshen, Leshem; Sun, Yuekai; Xu, Gongjun; Yurochkin, Mikhail (July 2024, Proceedings of Machine Learning Research)

Full Text Available
An Investigation of Representation and Allocation Harms in Contrastive Learning

Maity, Subha; Agarwal, Mayank; Yurochkin, Mikhail; Sun, Yuekai (January 2024, ICLR)

Full Text Available
k-Mixup Regularization for Deep Learning via Optimal Transport

Greenewald, Kristjan; Gu, Anming; Yurochkin, Mikhail; Solomon, Justin; Chien, Edward (November 2023, Transactions on Machine Learning Research)

Mixup is a popular regularization technique for training deep neural networks that improves generalization and increases robustness to certain distribution shifts. It perturbs input training data in the direction of other randomly-chosen instances in the training set. To better leverage the structure of the data, we extend mixup in a simple, broadly applicable way to k-mixup, which perturbs k-batches of training points in the direction of other k-batches. The perturbation is done with displacement interpolation, i.e. interpolation under the Wasserstein metric. We demonstrate theoretically and in simulations that k-mixup preserves cluster and manifold structures, and we extend theory studying the efficacy of standard mixup to the k-mixup case. Our empirical results show that training with k-mixup further improves generalization and robustness across several network architectures and benchmark datasets of differing modalities. For the wide variety of real datasets considered, the performance gains of k-mixup over standard mixup are similar to or larger than the gains of mixup itself over standard ERM after hyperparameter optimization. In several instances, in fact, k-mixup achieves gains in settings where standard mixup has negligible to zero improvement over ERM.
more » « less
Full Text Available
Understanding new tasks through the lens of training data via exponential tilting

Maity, Subha; Yurochkin, Mikhail; Banerjee, Moulinath; Sun, Yuekai (February 2023, International Conference on Learning Representations)

Full Text Available
Learning Proximal Operators to Discover Multiple Optima

Li, Lingxiao; Aigerman, Noam; Kim, Vladimir; Li, Jiajin; Greenewald, Kristjan; Yurochkin, Mikhail; Solomon, Justin (May 2023, International Conference on Learning Representations)

Finding multiple solutions of non-convex optimization problems is a ubiquitous yet challenging task. Most past algorithms either apply single-solution optimization methods from multiple random initial guesses or search in the vicinity of found solutions using ad hoc heuristics. We present an end-to-end method to learn the proximal operator of a family of training problems so that multiple local minima can be quickly obtained from initial guesses by iterating the learned operator, emulating the proximal-point algorithm that has fast convergence. The learned proximal operator can be further generalized to recover multiple optima for unseen problems at test time, enabling applications such as object detection. The key ingredient in our formulation is a proximal regularization term, which elevates the convexity of our training loss: by applying recent theoretical results, we show that for weakly-convex objectives with Lipschitz gradients, training of the proximal operator converges globally with a practical degree of over-parameterization. We further present an exhaustive benchmark for multi-solution optimization to demonstrate the effectiveness of our method.
more » « less
Full Text Available
Sampling with Mollified Interaction Energy Descent

Li, Lingxiao; Liu, Qiang; Korba, Anna; Yurochkin, Mikhail; Solomon, Justin (February 2023, International Conference on Learning Representations)

Sampling from a target measure whose density is only known up to a normalization constant is a fundamental problem in computational statistics and machine learning. In this paper, we present a new optimization-based method for sampling called mollified interaction energy descent (MIED). MIED minimizes a new class of energies on probability measures called mollified interaction energies (MIEs). These energies rely on mollifier functions---smooth approximations of the Dirac delta originated from PDE theory. We show that as the mollifier approaches the Dirac delta, the MIE converges to the chi-square divergence with respect to the target measure and the gradient flow of the MIE agrees with that of the chi-square divergence. Optimizing this energy with proper discretization yields a practical first-order particle-based algorithm for sampling in both unconstrained and constrained domains. We show experimentally that for unconstrained sampling problems our algorithm performs on par with existing particle-based algorithms like SVGD, while for constrained sampling problems our method readily incorporates constrained optimization techniques to handle more flexible constraints with strong performance compared to alternatives.
more » « less
Full Text Available

« Prev Next »

Search for: All records