Model-Free Reinforcement Learning for Lexicographic Omega-Regular Objectives

Hahn, E.M.; Perez, M.; Schewe, S.; Somenzi, F.; Trivedi, A.; Wojtczak, D.

doi:10.1007/978-3-030-90870-6_8

Citation Details

Model-Free Reinforcement Learning for Lexicographic Omega-Regular Objectives

We study the problem of finding optimal strategies in Markov decision processes with lexicographic ω-regular objectives, which are ordered collections of ordinary ω-regular objectives. The goal is to compute strategies that maximise the probability of satisfaction of the first 𝜔-regular objective; subject to that, the strategy should also maximise the probability of satisfaction of the second ω-regular objective; then the third and so forth. For instance, one may want to guarantee critical requirements first, functional ones second and only then focus on the non-functional ones. We show how to harness the classic off-the-shelf model-free reinforcement learning techniques to solve this problem and evaluate their performance on four case studies. more »

Award ID(s):: 2009022

NSF-PAR ID:: 10329426

Author(s) / Creator(s):: Hahn, E.M.; Perez, M.; Schewe, S.; Somenzi, F.; Trivedi, A.; Wojtczak, D.

Editor(s):: Huisman, M.; Păsăreanu, C.; Zhan, N.

Date Published:: 2021-11-10

Journal Name:: Formal Methods (FM 2021)

Page Range / eLocation ID:: 142-159

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1007/978-3-030-90870-6_8

More Like this