Efficient Duple Perturbation Robustness in Low-rank MDPs

Hu, Yang; Ma, Haitong; Li, Na; Dai, Bo

Citation Details

This content will become publicly available on June 4, 2026

Efficient Duple Perturbation Robustness in Low-rank MDPs

The pursuit of robustness has recently been a popular topic in reinforcement learning (RL) research, yet the existing methods generally suffer from computation issues that obstruct their real-world implementation. In this paper, we consider MDPs with low-rank structures, where the transition kernel can be written as a linear product of feature map and factors. We introduce *duple perturbation* robustness, i.e. perturbation on both the feature map and the factors, via a novel characterization of (𝜉,𝜂) -ambiguity sets featuring computational efficiency. Our novel low-rank robust MDP formulation is compatible with the low-rank function representation view, and therefore, is naturally applicable to practical RL problems with large or even continuous state-action spaces. Meanwhile, it also gives rise to a provably efficient and practical algorithm with theoretical convergence rate guarantee. Lastly, the robustness of our proposed approach is justified by numerical experiments, including classical control tasks with continuous state-action spaces. more »

Award ID(s):: 2401391 2403240

PAR ID:: 10600595

Author(s) / Creator(s):: Hu, Yang; Ma, Haitong; Li, Na; Dai, Bo

Editor(s):: Ozay, Necmiye; Balzano, Laura; Panagou, Dimitra; Abate, Alessandro

Publisher / Repository:: PMLR

Date Published:: 2025-06-04

Volume:: 283

Page Range / eLocation ID:: 723-737

Subject(s) / Keyword(s):: Spectral Representation reinforcement learning duple perturbation robustness low-rank MDPs

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on June 4, 2026
Conference Paper:
The DOI is not currently available.

More Like this