NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Statistical Reachability Analysis of Stochastic Cyber-Physical Systems Under Distribution Shift

https://doi.org/10.1109/TCAD.2024.3438072

Hashemi, Navid; Lindemann, Lars; Deshmukh, Jyotirmoy V (November 2024, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

Full Text Available
Scaling Learning-based Policy Optimization for Temporal Logic Tasks by Controller Network Dropout

https://doi.org/10.1145/3696112

Hashemi, Navid; Hoxha, Bardh; Prokhorov, Danil; Fainekos, Georgios; Deshmukh, Jyotirmoy V (October 2024, ACM Transactions on Cyber-Physical Systems)

This article introduces a model-based approach for training feedback controllers for an autonomous agent operating in a highly non-linear (albeit deterministic) environment. We desire the trained policy to ensure that the agent satisfies specific task objectives and safety constraints, both expressed in Discrete-Time Signal Temporal Logic (DT-STL). One advantage for reformulation of a task via formal frameworks, like DT-STL, is that it permits quantitative satisfaction semantics. In other words, given a trajectory and a DT-STL formula, we can compute therobustness, which can be interpreted as an approximate signed distance between the trajectory and the set of trajectories satisfying the formula. We utilize feedback control, and we assume a feed forward neural network for learning the feedback controller. We show how this learning problem is similar to training recurrent neural networks (RNNs), where the number of recurrent units is proportional to the temporal horizon of the agent’s task objectives. This poses a challenge: RNNs are susceptible to vanishing and exploding gradients, and naïve gradient descent-based strategies to solve long-horizon task objectives thus suffer from the same problems. To address this challenge, we introduce a novel gradient approximation algorithm based on the idea of dropout or gradient sampling. One of the main contributions is the notion ofcontroller network dropout, where we approximate the NN controller in several timesteps in the task horizon by the control input obtained using the controller in a previous training step. We show that our control synthesis methodology can be quite helpful for stochastic gradient descent to converge with less numerical issues, enabling scalable back-propagation over longer time horizons and trajectories over higher-dimensional state spaces. We demonstrate the efficacy of our approach on various motion planning applications requiring complex spatio-temporal and sequential tasks ranging over thousands of timesteps.
more » « less
Full Text Available
Survival of the Fittest: Evolutionary Adaptation of Policies for Environmental Shifts

Paul, S; Deshmukh, J (October 2024, IOS Press)

Full Text Available
Motion Planning for Automata-based Objectives using Efficient Gradient-based Methods

https://doi.org/10.1109/IROS58592.2024.10802177

Balakrishnan, Anand; Atasever, Merve; Deshmukh, Jyotirmoy V (October 2024, IEEE)

Full Text Available
Multi-agent Path Finding for Timed Tasks Using Evolutionary Games

Paul, S; Balakrishnan, A; Qin, X; Deshmukh, J (August 2024, International Conference on Quantitative Evaluation of Systems and Formal Modeling and Analysis of Timed Systems)

Full Text Available
Statistical Verification using Surrogate Models and Conformal Inference and a Comparison with Risk-Aware Verification

https://doi.org/10.1145/3635160

Qin, Xin; Xia, Yuan; Zutshi, Aditya; Fan, Chuchu; Deshmukh, Jyotirmoy V (April 2024, ACM Transactions on Cyber-Physical Systems)

Uncertainty in safety-critical cyber-physical systems can be modeled using a finite number of parameters or parameterized input signals. Given a system specification in Signal Temporal Logic (STL), we would like to verify that for all (infinite) values of the model parameters/input signals, the system satisfies its specification. Unfortunately, this problem is undecidable in general.Statistical model checking(SMC) offers a solution by providing guarantees on the correctness of CPS models by statistically reasoning on model simulations. We propose a new approach for statistical verification of CPS models for user-provided distribution on the model parameters. Our technique uses model simulations to learnsurrogate models, and usesconformal inferenceto provide probabilistic guarantees on the satisfaction of a given STL property. Additionally, we can provide prediction intervals containing the quantitative satisfaction values of the given STL property for any user-specified confidence level. We compare this prediction interval with the interval we get using risk estimation procedures. We also propose a refinement procedure based on Gaussian Process (GP)-based surrogate models for obtaining fine-grained probabilistic guarantees over sub-regions in the parameter space. This in turn enables the CPS designer to choose assured validity domains in the parameter space for safety-critical applications. Finally, we demonstrate the efficacy of our technique on several CPS models.
more » « less
Full Text Available
LB4TL: A Smooth Semantics for Temporal Logic to Train Neural Feedback Controllers

https://doi.org/10.1016/j.ifacol.2024.07.445

Hashemi, Navid; Williams, Samuel; Hoxha, Bardh; Prokhorov, Danil; Fainekos, Georgios; Deshmukh, Jyotirmoy (January 2024, IFAC-PapersOnLine)

Full Text Available
SOL: Sampling-based Optimal Linear bounding of arbitrary scalar functions

Yuriy Biktairov; Jyotirmoy Deshmukh (December 2023, Advances in Neural Information Processing Systems 36)

Full Text Available
Data-Driven Reachability Analysis of Stochastic Dynamical Systems with Conformal Inference

https://doi.org/10.1109/CDC49753.2023.10384213

Hashemi, Navid; Qin, Xin; Lindemann, Lars; Deshmukh, Jyotirmoy V. (December 2023, Proceedings of the IEEE Conference on Decision Control)

Full Text Available
Model-Free Reinforcement Learning for Spatiotemporal Tasks Using Symbolic Automata

https://doi.org/10.1109/CDC49753.2023.10383559

Balakrishnan, Anand; Jakšić, Stefan; Aguilar, Edgar A.; Ničković, Dejan; Deshmukh, Jyotirmoy V. (December 2023, Proceedings of the IEEE Conference on Decision Control)

Full Text Available

« Prev Next »

Search for: All records