QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security Game

Oakley, Lisa; Oprea, Alina

doi:https://doi.org/10.1007/978-3-030-32430-8

Citation Details

QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security Game

A rise in Advanced Persistent Threats (APTs) has introduced a need for robustness against long-running, stealthy attacks which circumvent existing cryptographic security guarantees. FlipIt is a security game that models attacker-defender interactions in advanced scenarios such as APTs. Previous work analyzed extensively non-adaptive strategies in FlipIt, but adaptive strategies rise naturally in practical interactions as players receive feedback during the game. We model the FlipIt game as a Markov Decision Process and introduce QFlip, an adaptive strategy for FlipIt based on temporal difference reinforcement learning. We prove theoretical results on the convergence of our new strategy against an opponent playing with a Periodic strategy. We con firm our analysis experimentally by extensive evaluation of QFlip against speci fic opponents. QFlip converges to the optimal adaptive strategy for Peri- odic and Exponential opponents using associated state spaces. Finally, we introduce a generalized QFlip strategy with composite state space that outperforms a Greedy strategy for several distributions including Periodic and Uniform, without prior knowledge of the opponent's strategy. We also release an OpenAI Gym environment for FlipIt to facilitate future research. more »

Award ID(s):: 1717634

PAR ID:: 10176641

Author(s) / Creator(s):: Oakley, Lisa; Oprea, Alina

Date Published:: 2019-01-01

Journal Name:: Lecture notes in computer science

Volume:: 11836

ISSN:: 0302-9743

Page Range / eLocation ID:: 364-384

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/https://doi.org/10.1007/978-3-030-32430-8

More Like this