TrojDRL: Trojan Attacks on Deep Reinforcement Learning Agents. In Proc. 57th ACM/IEEE Design Automation Conference (DAC), 2020, March 2020

Panagiota, Kiourti; Kacper, Wardega; Jha, Susmit; Wenchao, Li.

Citation Details

We present TrojDRL, a tool for exploring and evaluating backdoor attacks on deep reinforcement learning agents.TrojDRL exploits the sequential nature of deep reinforcement learning (DRL) and considers different gradations of threat models. We show that untargeted attacks on state-of-the-art actor-critic algorithms can circumvent existing defenses built on the assumption of backdoors being targeted. We evaluated TrojDRL on a broad set of DRL benchmarks and showed that the attacks require only poisoning as little as 0.025% of the training data. Compared with existing works of backdoor attacks on classification models, TrojDRL provides a first step towards understanding the vulnerability of DRL agents. more »

Award ID(s):: 1740079 1750009

PAR ID:: 10181034

Author(s) / Creator(s):: Panagiota, Kiourti; Kacper, Wardega; Jha, Susmit; Wenchao, Li.

Date Published:: 2020-06-01

Journal Name:: Proc. 57th ACM/IEEE Design Automation Conference (DAC), 2020

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this