A Generative Model based Adversarial Security of Deep Learning and Linear Classifier Models

Samed Sivaslioglu,Ferhat Ozgur Catak,Kevser Şahinbaş

doi:10.31449/inf.v45i1.3234

Samed Sivaslioglu, Ferhat Ozgur Catak + Show 1 more

Open Access

https://doi.org/10.31449/inf.v45i1.3234

Copy DOI

Journal: Informatica	Publication Date: Mar 15, 2021
Citations: 5	License type: cc-by

Affiliation: TUBITAK BILGEM, Simula Research Laboratory

Abstract

In recent years, machine learning algorithms have been applied widely in various fields such as health, transportation, and the autonomous car. With the rapid developments of deep learning techniques, it is critical to take the security concern into account for the application of the algorithms. While machine learning offers significant advantages in terms of the application of algorithms, the issue of security is ignored. Since it has many applications in the real world, security is a vital part of the algorithms. In this paper, we have proposed a mitigation method for adversarial attacks against machine learning models with an autoencoder model that is one of the generative ones. The main idea behind adversarial attacks against machine learning models is to produce erroneous results by manipulating trained models. We have also presented the performance of autoencoder models to various attack methods from deep neural networks to traditional algorithms by using different methods such as non-targeted and targeted attacks to multi-class logistic regression, a fast gradient sign method, a targeted fast gradient sign method and a basic iterative method attack to neural networks for the MNIST dataset.

Highlights

With the help of artificial intelligence technology, machine learning has been widely used in classification, decision making, voice and face recognition, games, financial assessment, and other fields [9, 12, 44, 45, 48]
We examine the robustness of autoencoder for adversarial machine learning with different machine learning algorithms and models to see that autoencoding can be a generalized solution and an easy to use defense mechanism for most adversarial attacks
fast gradient sign method (FGSM), targeted fast gradient sign method (T-FGSM), and basic iterative method (BIM) attacks have been used for the neural network machine learning model

Summary

Introduction

With the help of artificial intelligence technology, machine learning has been widely used in classification, decision making, voice and face recognition, games, financial assessment, and other fields [9, 12, 44, 45, 48]. Recent works have conducted in this area and demonstrated that the resistance is not very robust to attacks [10, 11]. These methods have shown success against a specific set of attack methods and have generally failed to provide complete and generic protection[43]. We use various linear machine learning model algorithms and neural network model algorithms against adversarial attacks. We select a linear model and a neural network model to demonstrate this effectiveness. In these models, we observe the robustness of different attack methods.

Objectives

Methods

Results

Discussion

Conclusion