Designing Trojan Detectors in Neural Networks Using Interactive Simulations.

Peter Bajcsy,Michael Majurski,Nicholas J Schaub

doi:10.3390/app11041865

Abstract

This paper addresses the problem of designing trojan detectors in neural networks (NNs) using interactive simulations. Trojans in NNs are defined as triggers in inputs that cause misclassification of such inputs into a class (or classes) unintended by the design of a NN-based model. The goal of our work is to understand encodings of a variety of trojan types in fully connected layers of neural networks. Our approach is (1) to simulate nine types of trojan embeddings into dot patterns, (2) to devise measurements of NN states, and (3) to design trojan detectors in NN-based classification models. The interactive simulations are built on top of TensorFlow Playground with in-memory storage of data and NN coefficients. The simulations provide analytical, visualization, and output operations performed on training datasets and NN architectures. The measurements of a NN include (a) model inefficiency using modified Kullback-Liebler (KL) divergence from uniformly distributed states and (b) model sensitivity to variables related to data and NNs. Using the KL divergence measurements at each NN layer and per each predicted class label, a trojan detector is devised to discriminate NN models with or without trojans. To document robustness of such a trojan detector with respect to NN architectures, dataset perturbations, and trojan types, several properties of the KL divergence measurement are presented. For the general use, the web-based simulations is deployed via GitHub pages at https://github.com/usnistgov/nn-calculator.

Highlights

The problem of detecting trojans in neural networks (NNs) models has been posed in the Trojan in Artificial Intelligence (TrojAI) challenge [1] by the Intelligence AdvancedResearch Projects Agency (IARPA)
We present a web-based trojan simulator with measurements and visualization of NN
The KL divergence has been thoroughly investigated for the purpose of detecting trojans embedded in NN models

Summary

Introduction

The problem of detecting trojans in neural networks (NNs) models has been posed in the Trojan in Artificial Intelligence (TrojAI) challenge [1] by the Intelligence AdvancedResearch Projects Agency (IARPA). The problem of detecting trojans in neural networks (NNs) models has been posed in the Trojan in Artificial Intelligence (TrojAI) challenge [1] by the Intelligence Advanced. For Rounds 1–4 of the TrojAI challenge, trojans in NNs are defined as triggers (local polygons or global filters) in input traffic sign images that cause misclassification of the input traffic sign class into another traffic sign class (or classes). A yellow region added to the stop sign in Figure 1 will change the classification outcome of the stop sign into a speed limit sign. The yellow region is considered as a trojan (or trigger) embedded in a stop sign region which will re-assign the images with trojan from Class A (stop sign) to Class B (speed limit 65). Additional information about simulating trojans and injecting trojans into images in TrojAI challenge datasets can be found in Appendix A

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jan 1, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Designing Trojan Detectors in Neural Networks Using Interactive Simulations.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Evaluation of Stability and Similarity of Latent Dirichlet Allocation
Jun Tang ... Jiali Yao
-
Jun Tang, et. al.Jun Tang ... Jiali Yao
01 Dec 2013
01 Dec 2013

A Vibration Signal Filtering Method Based on KL Divergence Genetic Algorithm – with Application to Low Speed Bearing Fault Diagnosis
Zhiqiang Liao ... Peng Chen
-
Zhiqiang Liao, et. al.Zhiqiang Liao ... Peng Chen
01 Nov 2018
01 Nov 2018

Information flow for security in control systems
Sean Weerakkody ... Bruno Sinopoli
-
Sean Weerakkody, et. al.Sean Weerakkody ... Bruno Sinopoli
01 Dec 2016
01 Dec 2016

<title>Video and image clustering using relative entropy</title>
Giridharan Iyengar ... Andrew B Lippman
-
Giridharan Iyengar, et. al.Giridharan Iyengar ... Andrew B Lippman
17 Dec 1998
17 Dec 1998

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Designing Trojan Detectors in Neural Networks Using Interactive Simulations.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences