- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources3
- Resource Type
-
0001100001000000
- More
- Availability
-
21
- Author / Contributor
- Filter by Author / Creator
-
-
Horesh, Raya (3)
-
Abreu_de_Paula, Rogério (1)
-
Azmat, Muneeza (1)
-
Calmon, Flavio P. (1)
-
Glicksberg, Benjamin S. (1)
-
Han, Barbara A. (1)
-
Kumar, Abhishek (1)
-
Li, Junyi Jessy (1)
-
Li, Ryan (1)
-
Majumdar, Subhabrata (1)
-
Mojsilović, Aleksandra (1)
-
Perer, Adam (1)
-
Shi, Weiyan (1)
-
Varshney, Kush R. (1)
-
Wei, Dennis (1)
-
Yang, Diyi (1)
-
Yurochkin, Mikhail (1)
-
Zhan, Hongli (1)
-
Zhang, Yutong (1)
-
Ziems, Caleb (1)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Aligning Large Language Models to integrate and reflect human values, especially for tasks that demand intricate human oversight, is arduous since it is resource-intensive and time-consuming to depend on human expertise for context-specific guidance. Prior work has utilized predefined sets of rules or principles to steer the behavior of models (Bai et al., 2022; Sun et al., 2023). However, these principles tend to be generic, making it challenging to adapt them to each individual input query or context. In this work, we present Situated-PRInciples (SPRI), a framework requiring minimal or no human effort that is designed to automatically generate guiding principles in real-time for each input query and utilize them to align each response. We evaluate SPRI on three tasks, and show that 1) SPRI can derive principles in a complex domain-specific task that leads to on-par performance as expert-crafted ones; 2) SPRI-generated principles lead to instance-specific rubrics that outperform prior LLM-as-a-judge frameworks; 3) using SPRI to generate synthetic SFT data leads to substantial improvement on truthfulness.more » « lessFree, publicly-accessible full text available July 13, 2026
-
Shi, Weiyan; Li, Ryan; Zhang, Yutong; Ziems, Caleb; Horesh, Raya; Abreu_de_Paula, Rogério; Yang, Diyi (, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing)
-
Han, Barbara A.; Majumdar, Subhabrata; Calmon, Flavio P.; Glicksberg, Benjamin S.; Horesh, Raya; Kumar, Abhishek; Perer, Adam; von Marschall, Elisa B.; Wei, Dennis; Mojsilović, Aleksandra; et al (, Epidemics)
An official website of the United States government

Full Text Available