NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Scalable and Robust Tensor Ring Decomposition for Large-Scale Data With Missing Data and Outliers

https://doi.org/10.1109/TCSVT.2024.3514614

He, Yicong; Atia, George K (May 2025, IEEE Transactions on Circuits and Systems for Video Technology)

Free, publicly-accessible full text available May 1, 2026
Model-Free Offline Reinforcement Learning with Enhanced Robustness

Zhang, Chi; Zain, Farhat U; Atia, George; Wang, Yue (April 2025, The Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment

https://doi.org/10.1609/aaai.v39i26.34979

Trivedi, Prashant; Chakraborty, Souradip; Reddy, Avinash; Aggarwal, Vaneet; Bedi, Amrit Singh; Atia, George K (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

The alignment of large language models (LLMs) with human values is critical as these models become increasingly integrated into various societal and decision-making processes. Traditional methods, such as reinforcement learning from human feedback (RLHF), achieve alignment by fine-tuning model parameters, but these approaches are often computationally expensive and impractical when models are frozen or inaccessible for parameter modification. In contrast, prompt optimization is a viable alternative to RLHF for LLM alignment. While the existing literature has shown empirical promise of prompt optimization, its theoretical underpinning remains under-explored. We address this gap by formulating prompt optimization as an optimization problem and try to provide theoretical insights into the optimality of such a framework. To analyze the performance of the prompt optimization, we study theoretical suboptimality bounds and provide insights in terms of how prompt optimization depends upon the given prompter and target model. We also provide empirical validation through experiments on various datasets, demonstrating that prompt optimization can effectively align LLMs, even when parameter fine-tuning is not feasible.
more » « less
Free, publicly-accessible full text available April 11, 2026
Explainable Adversarial Attacks on Coarse-to-Fine Classifiers

Heidarizadeh, Akram; Hatfield, Connor; Lazzarotto, Lorenzo; HanQin, Cai; Atia, George K (April 2025, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))

Free, publicly-accessible full text available April 6, 2026
Hybrid Offline Passive Grammatical Inference and Online Planning for Non-Markovian Tasks

Alinejad, Mahyar; Velasquez, Alvaro; Wang, Yue; Atia, George (April 2025, ICASSP)

Free, publicly-accessible full text available April 6, 2026
SADA: Unsupervised Domain Adaptation for Reliable Scene Awareness

Maghsoumi, Hossein; Fallah, Yaser; Atia, George (March 2025, 59th Annual Conference on Information Science and Systems (CISS))

Free, publicly-accessible full text available March 19, 2026
Robust Low-Tubal-Rank Tensor Completion Based on Tensor Factorization and Maximum Correntopy Criterion

https://doi.org/10.1109/TNNLS.2023.3280086

He, Yicong; Atia, George K. (October 2024, IEEE Transactions on Neural Networks and Learning Systems)

Full Text Available
Distributionally Robust Domain Adaptation via Optimal Transport

https://doi.org/10.1109/MLSP58920.2024.10734802

Awad, Akram S; Aeron, Shuchin; Atia, George K (September 2024, IEEE)

Full Text Available
Heterogeneous Tensor Domain Adaptation

https://doi.org/10.1109/MLSP58920.2024.10734808

He, Yicong; Atia, George K (September 2024, IEEE)

Full Text Available
Robust Average-Reward Reinforcement Learning

https://doi.org/10.1613/jair.1.15451

Wang, Yue; Velasquez, Alvaro; Atia, George; Prater-Bennette, Ashley; Zou, Shaofeng (May 2024, Journal of Artificial Intelligence Research)

Robust Markov decision processes (MDPs) aim to find a policy that optimizes the worst-case performance over an uncertainty set of MDPs. Existing studies mostly have focused on the robust MDPs under the discounted reward criterion, leaving the ones under the average-reward criterion largely unexplored. In this paper, we develop the first comprehensive and systematic study of robust average-reward MDPs, where the goal is to optimize the long-term average performance under the worst case. Our contributions are four-folds: (1) we prove the uniform convergence of the robust discounted value function to the robust average-reward function as the discount factor γ goes to 1; (2) we derive the robust average-reward Bellman equation, characterize the structure of its solution set, and prove the equivalence between solving the robust Bellman equation and finding the optimal robust policy; (3) we design robust dynamic programming algorithms, and theoretically characterize their convergence to the optimal policy; and (4) we design two model-free algorithms unitizing the multi-level Monte-Carlo approach, and prove their asymptotic convergence
more » « less
Full Text Available

« Prev Next »

Search for: All records