NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

REAL: Response Embedding-based Alignment for LLMs

Zhang, Honggen; Zhao, Xufeng; Molybog, Igor; Zhang, June (July 2025, arXiV)

Aligning large language models (LLMs) to human preferences is a crucial step in building helpful and safe AI tools, which usually involve training on supervised datasets. Popular algorithms such as Direct Preference Optimization (DPO) rely on pairs of AI-generated responses ranked according to human annotation. The response pair annotation process might bring human bias. Building a correct preference dataset is the costly part of the alignment pipeline. To improve annotation efficiency and quality in the LLMs alignment, we propose REAL:Response Embedding-based Alignment for LLMs, a strategy for constructing a high-quality training dataset that focuses on acquiring the less ambiguous preference pairs for labeling out of a set of response candidates. Our selection process is based on the similarity of embedding responses independently of prompts, which guarantees the selection process in an off-policy setting, avoiding adaptively measuring the similarity during the training. Experimental results on real-world dataset SHP2 and synthetic HH-RLHF benchmarks indicate that choosing dissimilar response pairs enhances the direct alignment of LLMs while reducing inherited labeling errors. The model aligned with dissimilar response pairs obtained a better margin and win rate on the dialogue task. Our findings suggest that focusing on distinct pairs can reduce the label error and improve LLM alignment efficiency, saving up to 65% of annotators’ work. The code of the work can be found https://github.com/ honggen-zhang/REAL-Alignment.
more » « less
Free, publicly-accessible full text available July 16, 2026
No Spurious Solutions in Non-convex Matrix Sensing: Structure Compensates for Isometry

https://doi.org/10.23919/ACC50511.2021.9483256

Molybog, Igor; Sojoudi, Somayeh; Lavaei, Javad (July 2021, American Control Conference)

Full Text Available
No Spurious Solutions in Non-convex Matrix Sensing: Structure Compensates for Isometry

Molybog, Igor; Sojoudi, Somayeh; Lavaei, Javad (May 2021, Proceedings of the American Control Conference)
null (Ed.)
Full Text Available
Role of Sparsity and Structure in the Optimization Landscape of Non-convex Matrix Sensing

Molybog, Igor; Sojoudi, Somayeh; Lavaei, Javad (November 2020, Mathematical programming)
null (Ed.)
Full Text Available
On Sampling Complexity of the Semidefinite Affine Rank Feasibility Problem

Molybog, Igor; Lavaei, Javad (January 2019, AAAI Conference on Artificial Intelligence)

Full Text Available
Conic Optimization for Robust Quadratic Regression: Deterministic Bounds and Statistical Analysis

Molybog, Igor; Madani, Ramtin; Lavaei, Javad (December 2018, Conference on Decision and Control)

Full Text Available
Scalable and Robust State Estimation from Abundant but Untrusted Data

Jin, Ming; Molybog, Igor; Mohammadi-Ghazi, Reza; Lavaei, Javad (January 2019, IEEE transactions on smart grid)

Full Text Available
Towards Robust and Scalable Power System State Estimation

Jin, Min; Molybog, Igor; Mohammadi-Ghazi, Reza; Lavaei, Javad (January 2019, 58th IEEE Conference on Decision and Control)

Full Text Available

Search for: All records