Evaluating GAN-Based Image Augmentation for Threat Detection in Large-Scale Xray Security Images

Joanna Kazzandra Dumagpi,Yong-Jin Jeong

doi:10.3390/app11010036

Joanna Kazzandra Dumagpi, Yong-Jin Jeong

Open Access

https://doi.org/10.3390/app11010036

Copy DOI

Journal: Applied sciences	Publication Date: Dec 23, 2020
Citations: 19	License type: CC BY 4.0

Affiliation: Kwangwoon University

Abstract

The inherent imbalance in the data distribution of X-ray security images is one of the most challenging aspects of computer vision algorithms applied in this domain. Most of the prior studies in this field have ignored this aspect, limiting their application in the practical setting. This paper investigates the effect of employing Generative Adversarial Networks (GAN)-based image augmentation, or image synthesis, in improving the performance of computer vision algorithms on an imbalanced X-ray dataset. We used Deep Convolutional GAN (DCGAN) to generate new X-ray images of threat objects and Cycle-GAN to translate camera images of threat objects to X-ray images. We synthesized new X-ray security images by combining threat objects with background X-ray images, which are used to augment the dataset. Then, we trained various Faster (Region Based Convolutional Neural Network) R-CNN models using different augmentation approaches and evaluated their performance on a large-scale practical X-ray image dataset. Experiment results show that image synthesis is an effective approach to combating the imbalance problem by significantly reducing the false-positive rate (FPR) by up to 15.3%. The FPR is further improved by up to 19.9% by combining image synthesis and conventional image augmentation. Meanwhile, a relatively high true positive rate (TPR) of about 94% was maintained regardless of the augmentation method used.

Highlights

X-ray imaging has been an established technology used in various security systems deployed in ports, borders, and certain establishments [1]
The combined image synthesis and image transformation approaches balances a high true positive rate (TPR) and low false-positive rate (FPR) across all subsets, which has been supported by the results we obtained from the previous metrics
We evaluated the impact of Generative Adversarial Networks (GAN)-based image augmentation, called image synthesis, in the detection performance on a practical X-ray security image dataset

Summary

Introduction

X-ray imaging has been an established technology used in various security systems deployed in ports, borders, and certain establishments [1]. These datasets failed to capture the imbalanced nature of the distribution between positive samples, images that contain at least one threat, and negative samples, images that do not contain any threat, among other characteristics To address this issue, the first large-scale Xray security image database was made publicly available and it was shown that CNNbased detection model fails to achieve state-of-the-art performance in the presence of the imbalance problem [17]. We investigate the effects of using GAN-based image augmentation approaches to improve the performance of a threat detection model based on FasterRCNN [24] on a large-scale and imbalanced X-ray security dataset, referred to as a practical X-ray dataset.

Related Works

The Imbalanced Dataset

Image Synthesis

Experiment Setup

Experiment Results

Conclusions