NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Reachability analysis of recurrent neural networks

https://doi.org/10.1016/j.nahs.2025.101581

Choi, Sung Woo; Li, Yuntao; Yang, Xiaodong; Yamaguchi, Tomoya; Hoxha, Bardh; Fainekos, Georgios; Prokhorov, Danil; Tran, Hoang-Dung (May 2025, Nonlinear Analysis: Hybrid Systems)

Free, publicly-accessible full text available May 1, 2026
Scaling Learning-based Policy Optimization for Temporal Logic Tasks by Controller Network Dropout

https://doi.org/10.1145/3696112

Hashemi, Navid; Hoxha, Bardh; Prokhorov, Danil; Fainekos, Georgios; Deshmukh, Jyotirmoy V (October 2024, ACM Transactions on Cyber-Physical Systems)

This article introduces a model-based approach for training feedback controllers for an autonomous agent operating in a highly non-linear (albeit deterministic) environment. We desire the trained policy to ensure that the agent satisfies specific task objectives and safety constraints, both expressed in Discrete-Time Signal Temporal Logic (DT-STL). One advantage for reformulation of a task via formal frameworks, like DT-STL, is that it permits quantitative satisfaction semantics. In other words, given a trajectory and a DT-STL formula, we can compute therobustness, which can be interpreted as an approximate signed distance between the trajectory and the set of trajectories satisfying the formula. We utilize feedback control, and we assume a feed forward neural network for learning the feedback controller. We show how this learning problem is similar to training recurrent neural networks (RNNs), where the number of recurrent units is proportional to the temporal horizon of the agent’s task objectives. This poses a challenge: RNNs are susceptible to vanishing and exploding gradients, and naïve gradient descent-based strategies to solve long-horizon task objectives thus suffer from the same problems. To address this challenge, we introduce a novel gradient approximation algorithm based on the idea of dropout or gradient sampling. One of the main contributions is the notion ofcontroller network dropout, where we approximate the NN controller in several timesteps in the task horizon by the control input obtained using the controller in a previous training step. We show that our control synthesis methodology can be quite helpful for stochastic gradient descent to converge with less numerical issues, enabling scalable back-propagation over longer time horizons and trajectories over higher-dimensional state spaces. We demonstrate the efficacy of our approach on various motion planning applications requiring complex spatio-temporal and sequential tasks ranging over thousands of timesteps.
more » « less
Full Text Available
LB4TL: A Smooth Semantics for Temporal Logic to Train Neural Feedback Controllers

https://doi.org/10.1016/j.ifacol.2024.07.445

Hashemi, Navid; Williams, Samuel; Hoxha, Bardh; Prokhorov, Danil; Fainekos, Georgios; Deshmukh, Jyotirmoy (January 2024, IFAC-PapersOnLine)

Full Text Available
Verification of Recurrent Neural Networks with Star Reachability

https://doi.org/10.1145/3575870.3587128

Tran, Hoang Dung; Choi, Sung Woo; Yang, Xiaodong; Yamaguchi, Tomoya; Hoxha, Bardh; Prokhorov, Danil (May 2023, The 26th ACM International Conference on Hybrid Systems: Computation and Control)

The paper extends the recent star reachability method to verify the robustness of recurrent neural networks (RNNs) for use in safety-critical applications. RNNs are a popular machine learning method for various applications, but they are vulnerable to adversarial attacks, where slightly perturbing the input sequence can lead to an unexpected result. Recent notable techniques for verifying RNNs include unrolling, and invariant inference approaches. The first method has scaling issues since unrolling an RNN creates a large feedforward neural network. The second method, using invariant sets, has better scalability but can produce unknown results due to the accumulation of overapproximation errors over time. This paper introduces a complementary verification method for RNNs that is both sound and complete. A relaxation parameter can be used to convert the method into a fast overapproximation method that still provides soundness guarantees. The method is designed to be used with NNV, a tool for verifying deep neural networks and learning-enabled cyber-physical systems. Compared to state-of-the-art methods, the extended exact reachability method is 10 × faster, and the overapproximation method is 100 × to 5000 × faster.
more » « less
Full Text Available
Risk-Bounded Control Using Stochastic Barrier Functions

https://doi.org/10.1109/LCSYS.2020.3043287

Yaghoubi, Shakiba; Majd, Keyvan; Fainekos, Georgios; Yamaguchi, Tomoya; Prokhorov, Danil; Hoxha, Bardh (November 2021, IEEE Control Systems Letters)
null (Ed.)
Full Text Available

Search for: All records