Topological Detection of Trojaned Neural Networks

Zheng Songzhu; Zhang Yikai; Wagner Hubert; Goswami Mayank; Chen Chao

Citation Details

Deep neural networks are known to have security issues. One particular threat is the Trojan attack. It occurs when the attackers stealthily manipulate the model's behavior through Trojaned training samples, which can later be exploited. Guided by basic neuroscientific principles we discover subtle -- yet critical -- structural deviation characterizing Trojaned models. In our analysis we use topological tools. They allow us to model high-order dependencies in the networks, robustly compare different networks, and localize structural abnormalities. One interesting observation is that Trojaned models develop short-cuts from input to output layers. Inspired by these observations, we devise a strategy for robust detection of Trojaned models. Compared to standard baselines it displays better performance on multiple benchmarks. more »

Award ID(s):: 1910873

PAR ID:: 10366262

Author(s) / Creator(s):: Zheng Songzhu; Zhang Yikai; Wagner Hubert; Goswami Mayank; Chen Chao

Date Published:: 2021-12-06

Journal Name:: Advances in neural information processing systems

ISSN:: 1049-5258

Page Range / eLocation ID:: 17258--17272

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this