Abstract

The inherent imbalance in the data distribution of X-ray security images is one of the most challenging aspects of computer vision algorithms applied in this domain. Most of the prior studies in this field have ignored this aspect, limiting their application in the practical setting. This paper investigates the effect of employing Generative Adversarial Networks (GAN)-based image augmentation, or image synthesis, in improving the performance of computer vision algorithms on an imbalanced X-ray dataset. We used Deep Convolutional GAN (DCGAN) to generate new X-ray images of threat objects and Cycle-GAN to translate camera images of threat objects to X-ray images. We synthesized new X-ray security images by combining threat objects with background X-ray images, which are used to augment the dataset. Then, we trained various Faster (Region Based Convolutional Neural Network) R-CNN models using different augmentation approaches and evaluated their performance on a large-scale practical X-ray image dataset. Experiment results show that image synthesis is an effective approach to combating the imbalance problem by significantly reducing the false-positive rate (FPR) by up to 15.3%. The FPR is further improved by up to 19.9% by combining image synthesis and conventional image augmentation. Meanwhile, a relatively high true positive rate (TPR) of about 94% was maintained regardless of the augmentation method used.

Highlights

  • X-ray imaging has been an established technology used in various security systems deployed in ports, borders, and certain establishments [1]

  • The combined image synthesis and image transformation approaches balances a high true positive rate (TPR) and low false-positive rate (FPR) across all subsets, which has been supported by the results we obtained from the previous metrics

  • We evaluated the impact of Generative Adversarial Networks (GAN)-based image augmentation, called image synthesis, in the detection performance on a practical X-ray security image dataset

Read more

Summary

Introduction

X-ray imaging has been an established technology used in various security systems deployed in ports, borders, and certain establishments [1]. These datasets failed to capture the imbalanced nature of the distribution between positive samples, images that contain at least one threat, and negative samples, images that do not contain any threat, among other characteristics To address this issue, the first large-scale Xray security image database was made publicly available and it was shown that CNNbased detection model fails to achieve state-of-the-art performance in the presence of the imbalance problem [17]. We investigate the effects of using GAN-based image augmentation approaches to improve the performance of a threat detection model based on FasterRCNN [24] on a large-scale and imbalanced X-ray security dataset, referred to as a practical X-ray dataset.

Related Works
The Imbalanced Dataset
Image Synthesis
Experiment Setup
Experiment Results
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call