NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

POMDP inference and robust solution via deep reinforcement learning: an application to railway optimal maintenance

https://doi.org/10.1007/s10994-024-06559-2

Arcieri, Giacomo; Hoelzl, Cyprien; Schwery, Oliver; Straub, Daniel; Papakonstantinou, Konstantinos G; Chatzi, Eleni (October 2024, Machine Learning)

Abstract Partially Observable Markov Decision Processes (POMDPs) can model complex sequential decision-making problems under stochastic and uncertain environments. A main reason hindering their broad adoption in real-world applications is the unavailability of a suitable POMDP model or a simulator thereof. Available solution algorithms, such as Reinforcement Learning (RL), typically benefit from the knowledge of the transition dynamics and the observation generating process, which are often unknown and non-trivial to infer. In this work, we propose a combined framework for inference and robust solution of POMDPs via deep RL. First, all transition and observation model parameters are jointly inferred via Markov Chain Monte Carlo sampling of a hidden Markov model, which is conditioned on actions, in order to recover full posterior distributions from the available data. The POMDP with uncertain parameters is then solved via deep RL techniques with the parameter distributions incorporated into the solution via domain randomization, in order to develop solutions that are robust to model uncertainty. As a further contribution, we compare the use of Transformers and long short-term memory networks, which constitute model-free RL solutions and work directly on the observation space, with an approach termed the belief-input method, which works on the belief space by exploiting the learned POMDP model for belief inference. We apply these methods to the real-world problem of optimal maintenance planning for railway assets and compare the results with the current real-life policy. We show that the RL policy learned by the belief-input method is able to outperform the real-life policy by yielding significantly reduced life-cycle costs.
more » « less
Full Text Available
Deep reinforcement learning-driven life-cycle management of bridge and pavement systems

Bhattacharya, A; Saifullah, M; Papakonstantinou, KG (June 2024, Taylor Francis)

Full Text Available
Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systems

https://doi.org/10.1016/j.ress.2023.109496

Arcieri, Giacomo; Hoelzl, Cyprien; Schwery, Oliver; Straub, Daniel; Papakonstantinou, Konstantinos G; Chatzi, Eleni (November 2023, Reliability Engineering & System Safety)

Full Text Available
Hamiltonian MCMC methods for estimating rare events probabilities in high-dimensional problems

https://doi.org/10.1016/j.probengmech.2023.103485

Papakonstantinou, Konstantinos G; Nikbakht, Hamed; Eshra, Elsayed (October 2023, Probabilistic Engineering Mechanics)

Full Text Available
The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertainty

Saifullah, M; Andriotis, CP; Papakonstantinou, KG (July 2023, 14th International Conference on Applications of Statistics and Probability in Civil Engineering (ICASP14))

Full Text Available
Inference and dynamic decision-making for deteriorating systems with probabilistic dependencies through Bayesian networks and deep reinforcement learning

https://doi.org/10.1016/j.ress.2023.109144

Morato, P.G.; Andriotis, C.P.; Papakonstantinou, K.G.; Rigo, P. (July 2023, Reliability Engineering & System Safety)

Full Text Available
Scaled Spherical Simplex Filter and State-Space Damage-Plasticity Finite-Element Model for Computationally Efficient System Identification

https://doi.org/10.1061/(ASCE)EM.1943-7889.0001945

Amir, M.; Papakonstantinou, K. G.; Warn, G. P. (February 2022, Journal of Engineering Mechanics)

Full Text Available
Optimal inspection and maintenance planning for deteriorating structural components through dynamic Bayesian networks and Markov decision processes

https://doi.org/10.1016/j.strusafe.2021.102140

Morato, P.G.; Papakonstantinou, K.G.; Andriotis, C.P.; Nielsen, J.S.; Rigo, P. (January 2022, Structural Safety)

Full Text Available
Critical appraisal and mathematical properties of fragility analysis methods

Yi, S-r.; Papakonstantinou, K.G.; Andriotis, C.P.; Song, J. (January 2022, 13th International Conference on Structural Safety & Reliability (ICOSSAR))

Full Text Available
Deep reinforcement learning-based life-cycle management of deteriorating transportation systems

https://doi.org/10.1201/9781003322641-32

Saifullah, M.; Andriotis, C.P.; Papakonstantinou, K.G.; Stoffels, S.M. (January 2022, 11th International Conference on Bridge Maintenance, Safety and Management (IABMAS))

Full Text Available

« Prev Next »

Search for: All records