NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Advancing oil and gas emissions assessment through large language model data extraction

https://doi.org/10.1016/j.egyai.2025.100481

Chen, Zhenlin; Zhong, Roujia; Long, Wennan; Tang, Haoyu; Wang, Anjing; Liu, Zemin; Yang, Xuelin; Ren, Bo; Littlefield, James; Koyejo, Sanmi; et al (May 2025, Energy and AI)

Free, publicly-accessible full text available May 1, 2026
More than Marketing? On the Information Value of AI Benchmarks for Practitioners

https://doi.org/10.1145/3708359.3712152

Hardy, Amelia; Reuel, Anka; Jafari_Meimandi, Kiana; Soder, Lisa; Griffith, Allie; Asmar, Dylan M; Koyejo, Sanmi; Bernstein, Michael S; Kochenderfer, Mykel John (March 2025, International journal of studies on art and humanities)

Free, publicly-accessible full text available March 24, 2026
Trustworthy Machine Learning: From Data to Models

https://doi.org/10.1561/3300000043

Han, Bo; Yao, Jiangchao; Liu, Tongliang; Li, Bo; Koyejo, Sanmi; Liu, Feng (January 2025, Foundations and Trends® in Privacy and Security)

Full Text Available
Advancing science- and evidence-based AI policy

https://doi.org/10.1126/science.adu8449

Bommasani, Rishi; Arora, Sanjeev; Chayes, Jennifer; Choi, Yejin; Cuéllar, Mariano-Florentino; Fei-Fei, Li; Ho, Daniel E; Jurafsky, Dan; Koyejo, Sanmi; Lakkaraju, Hima; et al (July 2025, Science)

Policy must be informed by, but also facilitate the generation of, scientific evidence
more » « less
Free, publicly-accessible full text available July 31, 2026
Label Noise Robustness for Domain-Agnostic Fair Corrections via Nearest Neighbors Label Spreading

Stromberg, Nathan; Ayyagari, Rohan; Koyejo, Sanmi; Nock, Richard; Sankar, Lalitha (December 2024, Curran Associates, Inc. Advances in Neural Information Processing Systems)
Globerson, A; Mackey, L; Belgrave, D; Fan, A; Paquet, U; Tomczak, J; Zhang, C (Ed.)
Last-layer retraining methods have emerged as an efficient framework for correcting existing base models. Within this framework, several methods have been proposed to deal with correcting models for subgroup fairness with and without group membership information. Importantly, prior work has demonstrated that many methods are susceptible to noisy labels. To this end, we propose a drop-in correction for label noise in last-layer retraining, and demonstrate that it achieves state-ofthe-art worst-group accuracy for a broad range of symmetric label noise and across a wide variety of datasets exhibiting spurious correlations. Our proposed approach uses label spreading on a latent nearest neighbors graph and has minimal computational overhead compared to existing methods.
more » « less
Full Text Available
Robustness to Subpopulation Shift with Domain Label Noise via Regularized Annotation of Domains

Stromberg, Nathan; Ayyagari, Rohan; Welfert, Monica; Koyejo, Sanmi; Nock, Richard; Sankar, Lalitha (December 2024, Transactions on machine learning research)
NA (Ed.)
Existing methods for last layer retraining that aim to optimize worst-group accuracy (WGA) rely heavily on well-annotated groups in the training data. We show, both in theory and practice, that annotation-based data augmentations using either downsampling or upweighting for WGA are susceptible to domain annotation noise. The WGA gap is exacerbated in highnoise regimes for models trained with vanilla empirical risk minimization (ERM). To this end, we introduce Regularized Annotation of Domains (RAD) to train robust last layer classifiers without needing explicit domain annotations. Our results show that RAD is competitive with other recently proposed domain annotation-free techniques. Most importantly, RAD outperforms state-of-the-art annotation-reliant methods even with only 5% noise in the training data for several publicly available datasets.
more » « less
Full Text Available
Bridging gaps in automated acute myocardial infarction detection between high-income and low-income countries

https://doi.org/10.1371/journal.pgph.0003240

Chiou, Nicole; Koyejo, Sanmi; Ngaruiya, Christine (June 2024, PLOS Global Public Health)
Robinson, Julia (Ed.)
Full Text Available
Causally Inspired Regularization Enables Domain General Representations

Salaudeen, Olawale; Koyejo, Sanmi (May 2024, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics)

Full Text Available
Rethinking machine unlearning for large language models

https://doi.org/10.1038/s42256-025-00985-0

Liu, Sijia; Yao, Yuanshun; Jia, Jinghan; Casper, Stephen; Baracaldo, Nathalie; Hase, Peter; Yao, Yuguang; Liu, Chris Yuhao; Xu, Xiaojun; Li, Hang; et al (February 2025, Nature Machine Intelligence)

Free, publicly-accessible full text available February 1, 2026
The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa

https://doi.org/10.1145/3689904.3694708

Asiedu, Mercy Nyamewaa; Dieng, Awa; Haykel, Iskandar; Rostamzadeh, Negar; Pfohl, Stephen; Nagpal, Chirag; Nagawa, Maria; Oppong, Abigail; Koyejo, Sanmi; Heller, Katherine (October 2024, ACM)

Full Text Available

« Prev Next »

Search for: All records