Verification of Recurrent Neural Networks with Star Reachability

Tran, Hoang Dung; Choi, Sung Woo; Yang, Xiaodong; Yamaguchi, Tomoya; Hoxha, Bardh; Prokhorov, Danil

doi:10.1145/3575870.3587128

Citation Details

Verification of Recurrent Neural Networks with Star Reachability

The paper extends the recent star reachability method to verify the robustness of recurrent neural networks (RNNs) for use in safety-critical applications. RNNs are a popular machine learning method for various applications, but they are vulnerable to adversarial attacks, where slightly perturbing the input sequence can lead to an unexpected result. Recent notable techniques for verifying RNNs include unrolling, and invariant inference approaches. The first method has scaling issues since unrolling an RNN creates a large feedforward neural network. The second method, using invariant sets, has better scalability but can produce unknown results due to the accumulation of overapproximation errors over time. This paper introduces a complementary verification method for RNNs that is both sound and complete. A relaxation parameter can be used to convert the method into a fast overapproximation method that still provides soundness guarantees. The method is designed to be used with NNV, a tool for verifying deep neural networks and learning-enabled cyber-physical systems. Compared to state-of-the-art methods, the extended exact reachability method is 10 × faster, and the overapproximation method is 100 × to 5000 × faster. more »

Award ID(s):: 2220418

PAR ID:: 10451179

Author(s) / Creator(s):: Tran, Hoang Dung; Choi, Sung Woo; Yang, Xiaodong; Yamaguchi, Tomoya; Hoxha, Bardh; Prokhorov, Danil

Date Published:: 2023-05-09

Journal Name:: The 26th ACM International Conference on Hybrid Systems: Computation and Control

Page Range / eLocation ID:: 1 to 13

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3575870.3587128

More Like this