Coarse-Grained Smoothness for Reinforcement Learning in Metric Spaces

Gottesman, O; Asadi, K; Allen, C; Lobel, S; Konidaris, GD; Littman, ML

Citation Details

Principled decision-making in continuous state-action spaces is impossible without some assumptions. A common approach is to assume Lipschitz continuity of the Q-function. We show that, unfortunately, this property fails to hold in many typical domains. We propose a new coarse-grained smoothness definition that generalizes the notion of Lipschitz continuity, is more widely applicable, and allows us to compute significantly tighter bounds on Q-functions, leading to improved learning. We provide a theoretical analysis of our new smoothness definition, and discuss its implications and impact on control and exploration in continuous domains. more »

Award ID(s):: 1844960 1717569 1955361

NSF-PAR ID:: 10404720

Author(s) / Creator(s):: Gottesman, O; Asadi, K; Allen, C; Lobel, S; Konidaris, GD; Littman, ML

Date Published:: 2023-04-01

Journal Name:: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this