NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Investigating Vaccine Buyer’s Remorse

Stanley, Miles; KhudaBukhsh, Ashique (January 2026, AAAI 2026)

Free, publicly-accessible full text available January 20, 2027
Investigating Vaccine Buyer’s Remorse

Stanley, Miles; KhudaBukhsh, Ashique (January 2026, AAAI 2026)

Free, publicly-accessible full text available January 20, 2027
Learning to Look: Cognitive Attention Alignment with Vision-Language Models

Yang, Ryan; Bhusal, Dipkamal; Rastogi, Nidhi (December 2025, 39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop: First Workshop on CogInterp: Interpreting Cognition in Deep Learning Models.)

Convolutional Neural Networks (CNNs) frequently “cheat” by exploiting superficial correlations, raising concerns about whether they make predictions for the right reasons. Inspired by cognitive science, which highlights the role of attention in robust human perception, recent methods have sought to guide model attention using concept-based supervision and explanation regularization. However, these techniques depend on labor-intensive, expert-provided annotations, limiting their scalability. We propose a scalable framework that leverages vision-language models to automatically generate semantic attention maps using natural language prompts. By introducing an auxiliary loss that aligns CNN attention with these language-guided maps, our approach promotes more reliable and cognitively plausible decision-making without manual annotation. Experiments on challenging datasets, ColoredMNIST and DecoyMNIST, show that our method achieves stateof- the-art performance on ColorMNIST and remains competitive with annotationheavy baselines on DecoyMNIST, demonstrating improved generalization, reduced shortcut reliance, and model attention that better reflects human intuition. Our code is available at https://github.com/ryanlyang/LearningToLook/.
more » « less
Free, publicly-accessible full text available December 7, 2026
Towards Understanding Self-play for LLM Reasoning

Chae, Justin Yang; Alam, Md Tanvirul; Rastogi, Nidhi (December 2025, 39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop: Math-AI.)

Recent advances in large language model (LLM) reasoning, led by reinforcement learning with verifiable rewards (RLVR), have inspired self-play post-training, where models improve by generating and solving their own problems. While selfplay has shown strong in-domain and out-of-domain gains, the mechanisms behind these improvements remain poorly understood. In this work, we analyze the training dynamics of self-play through the lens of the Absolute Zero Reasoner, comparing it against RLVR and supervised fine-tuning (SFT). Our study examines parameter update sparsity, entropy dynamics of token distributions, and alternative proposer reward functions. We further connect these dynamics to reasoning performance using pass@k evaluations. Together, our findings clarify how self-play differs from other post-training strategies, highlight its inherent limitations, and point toward future directions for improving LLM math reasoning through self-play.
more » « less
Free, publicly-accessible full text available December 6, 2026

Search for: All records