State-Based Recurrent SPMNs for Decision-Theoretic Planning under Partial Observability

Hayes, Layton; Doshi, Prashant; Pawar, Swaraj; Tatavarti, Hari Teja

doi:10.24963/ijcai.2021/348

Citation Details

State-Based Recurrent SPMNs for Decision-Theoretic Planning under Partial Observability

The sum-product network (SPN) has been extended to model sequence data with the recurrent SPN (RSPN), and to decision-making problems with sum-product-max networks (SPMN). In this paper, we build on the concepts introduced by these extensions and present state-based recurrent SPMNs (S-RSPMNs) as a generalization of SPMNs to sequential decision-making problems where the state may not be perfectly observed. As with recurrent SPNs, S-RSPMNs utilize a repeatable template network to model sequences of arbitrary lengths. We present an algorithm for learning compact template structures by identifying unique belief states and the transitions between them through a state matching process that utilizes augmented data. In our knowledge, this is the first data-driven approach that learns graphical models for planning under partial observability, which can be solved efficiently. S-RSPMNs retain the linear solution complexity of SPMNs, and we demonstrate significant improvements in compactness of representation and the run time of structure learning and inference in sequential domains. more »

Award ID(s):: 1815598

PAR ID:: 10339357

Author(s) / Creator(s):: Hayes, Layton; Doshi, Prashant; Pawar, Swaraj; Tatavarti, Hari Teja

Date Published:: 2021-08-01

Journal Name:: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence. Main Track.

Page Range / eLocation ID:: 2526 to 2533

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.24963/ijcai.2021/348

More Like this