DiSProD: Differentiable Symbolic Propagation of Distributions for Planning

Chatterjee, Palash; Chapagain, Ashutosh; Chen, Weizhe; Khardon, Roni

doi:10.24963/ijcai.2023/591

Citation Details

DiSProD: Differentiable Symbolic Propagation of Distributions for Planning

The paper introduces DiSProD, an online planner developed forenvironments with probabilistic transitions in continuous state andaction spaces. DiSProD builds a symbolic graph that captures thedistribution of future trajectories, conditioned on a given policy,using independence assumptions and approximate propagation ofdistributions. The symbolic graph provides a differentiablerepresentation of the policy's value, enabling efficient gradient-basedoptimization for long-horizon search. The propagation of approximatedistributions can be seen as an aggregation of many trajectories, makingit well-suited for dealing with sparse rewards and stochasticenvironments. An extensive experimental evaluation compares DiSProD tostate-of-the-art planners in discrete-time planning and real-timecontrol of robotic systems. The proposed method improves over existingplanners in handling stochastic environments, sensitivity to searchdepth, sparsity of rewards, and large action spaces. Additionalreal-world experiments demonstrate that DiSProD can control groundvehicles and surface vessels to successfully navigate around obstacles. more »

Award ID(s):: 2246261

PAR ID:: 10499803

Author(s) / Creator(s):: Chatterjee, Palash; Chapagain, Ashutosh; Chen, Weizhe; Khardon, Roni

Publisher / Repository:: International Joint Conferences on Artificial Intelligence

Date Published:: 2023-08-01

ISBN:: 978-1-956792-03-4

Page Range / eLocation ID:: 5324 to 5332

Format(s):: Medium: X

Location:: Macau, SAR China

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.24963/ijcai.2023/591

More Like this