NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Zhang, Zhihan; Li, Shiyang; Zhang, Zixuan; Liu, Xin; Jiang, Haoming; Tang, Xianfeng; Gao, Yifan; Li, Zheng; Wang, Haodong; Tan, Zhaoxuan; et al (April 2025, Association for Computational Linguistics)
Chiruzzo, Luis; Ritter, Alan; Wang, Lu (Ed.)
The instruction hierarchy, which establishes a priority order from system messages to user messages, conversation history, and tool outputs, is essential for ensuring consistent and safe behavior in language models (LMs). Despite its importance, this topic receives limited attention, and there is a lack of comprehensive benchmarks for evaluating models’ ability to follow the instruction hierarchy. We bridge this gap by introducing IHEval, a novel benchmark comprising 3,538 examples across nine tasks, covering cases where instructions in different priorities either align or conflict. Our evaluation of popular LMs highlights their struggle to recognize instruction priorities. All evaluated models experience a sharp performance decline when facing conflicting instructions, compared to their original instruction-following performance. Moreover, the most competitive open-source model only achieves 48% accuracy in resolving such conflicts. Our results underscore the need for targeted optimization in the future development of LMs.
more » « less
Free, publicly-accessible full text available April 27, 2026
Adaptive Preference Scaling for Reinforcement Learning with Human Feedback

Hong, Ilgee; Li, Zichong; Bukharin, Alexander; Li, Yixiao; Jiang, Haoming; Yang, Tianbao; Zhao, Tuo (December 2024, Conference on Neural Information Processing Systems)

Full Text Available
Robust Reinforcement Learning from Corrupted Human Feedback

Bukharin, Alexander; Hong, Ilgee; Jiang, Haoming; Li, Zichong; Zhang, Qingru; Zhang, Zixuan; Zhao, Tuo (December 2024, Conference on Neural Information Processing Systems)

Full Text Available
BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering

Wang, Haoyu; Li, Ruirui; Jiang, Haoming; Tian, Jinjin; Wang, Zhengyang; Luo, Chen; Tang, Xianfeng; Cheng, Monica Xiao; Zhao, Tuo; Gao, Jing (November 2024, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering

https://doi.org/10.18653/v1/2024.emnlp-main.58

Wang, Haoyu; Li, Ruirui; Jiang, Haoming; Tian, Jinjin; Wang, Zhengyang; Luo, Chen; Tang, Xianfeng; Cheng, Monica Xiao; Zhao, Tuo; Gao, Jing (November 2024, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
LightToken: a Task and Model-agnostic Lightweight Token Embedding Framework for Pre-trained Language Models

Wang, Haoyu; Li, Ruirui; Jiang, Haoming; Wang, Zhengyang; Tang, Xianfeng; Bi, Bin; Cheng, Monica; Yin, Bing; Wang, Yaqing; Zhao, Tuo; et al (August 2024, KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Full Text Available
Data Diversity Matters for Robust Instruction Tuning

https://doi.org/10.18653/v1/2024.findings-emnlp.195

Bukharin, Alexander; Li, Shiyang; Wang, Zhengyang; Yang, Jingfeng; Yin, Bing; Li, Xian; Zhang, Chao; Zhao, Tuo; Jiang, Haoming (January 2024, Association for Computational Linguistics)

Full Text Available
SMURF-THP: score matching-based uncertainty quantification for transformer Hawkes process

Li, Zichong; Xu, Yanbo; Zuo, Simiao; Jiang, Haoming; Zhang, Chao; Zhao, Tuo; Zha, Hongyuan (September 2023, International Conference on Machine Learning)

Full Text Available
LightToken: A Task and Model-agnostic Lightweight Token Embedding Framework for Pre-trained Language Models

Wang, Haoyu; Li, Ruirui; Jiang, Haoming; Wang, Zhengyang; Tang, Xianfeng; Bi, Bin; Cheng, Monica; Yin, Bing; Wang, Yaqing; Zhao, Tuo; et al (August 2023, KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Full Text Available
Context-Aware Query Rewriting for Improving Users’ Search Experience on E-commerce Websites

https://doi.org/10.18653/v1/2023.acl-industry.59

Zuo, Simiao; Yin, Qingyu; Jiang, Haoming; Xi, Shaohui; Yin, Bing; Zhang, Chao; Zhao, Tuo (January 2023, Association for Computational Linguistics)

Full Text Available

« Prev Next »

Search for: All records