An analysis of rotation matrix and colour constancy data augmentation in classifying images of animals

Emmanuel Okafor,Lambert Schomaker,Marco A Wiering

doi:10.1080/24751839.2018.1479932

Emmanuel Okafor, Lambert Schomaker + Show 1 more

Open Access

https://doi.org/10.1080/24751839.2018.1479932

Copy DOI

Abstract

ABSTRACTIn this paper, we examine a novel data augmentation (DA) method that transforms an image into a new image containing multiple rotated copies of the original image. The DA method creates a grid of cells, in which each cell contains a different randomly rotated image and introduces a natural background in the newly created image. We investigate the use of deep learning to assess the classification performance on the rotation matrix or original dataset with colour constancy versions of the datasets. For the colour constancy methods, we use two well-known retinex techniques: the multi-scale retinex and the multi-scale retinex with colour restoration for enhancing both original (ORIG) and rotation matrix (ROT) images. We perform experiments on three datasets containing images of animals, from which the first dataset is collected by us and contains aerial images of cows or non-cow backgrounds. To classify the Aerial UAV images, we use a convolutional neural network (CNN) architecture and compare two loss functions (hinge loss and cross-entropy loss). Additionally, we compare the CNN to classical feature-based techniques combined with a k-nearest neighbour classifier or a support vector machine. The best approach is then used to examine the colour constancy DA variants, ORIG and ROT-DA alone for three datasets (Aerial UAV, Bird-600 and Croatia fish). The results show that the rotation matrix data augmentation is very helpful for the Aerial UAV dataset. Furthermore, the colour constancy data augmentation is helpful for the Bird-600 dataset. Finally, the results show that the fine-tuned CNNs significantly outperform the CNNs trained from scratch on the Croatia fish and the Bird-600 datasets, and obtain very high accuracies on the Aerial UAV and Bird-600 datasets.

Highlights

Data augmentation (DA) has often been used in deep learning to increase the number of training images to obtain high classification accuracies
The study considers two broad forms of data augmentation based on their increase or no increase (ROT-DA alone) in the amount of training images
Proposed colour constancy data augmentation: This study examines the possibility of using the ORIG or rotation matrix (ROT) images that are fed as input to the multi-scale retinex (MSR) or multi-scale retinex for colour restoration (MSRCR) algorithm

Summary

Introduction

Data augmentation (DA) has often been used in deep learning to increase the number of training images to obtain high classification accuracies. A recent research by Pawara, Okafor, Schomaker, and Wiering (2017) examined the classification performances of two convolutional neural network (CNN) methods (AlexNet and GoogleNet) with several DA techniques for different plant datasets. This research investigates the rotation matrix and colour constancy algorithms as methods for data augmentation with the objective to use one or more machine learning algorithms to classify images within three animal datasets. A recent study investigated the relevance of the radial transform (Salehinejad, Valaee, Dowdell, & Barfett, 2018) as a method of data augmentation on character and medical multi-modal images. The research by Sladojevic, Arsenovic, Anderla, Culibrk, and Stefanovic (2016) attempts to develop a plant disease recognition CNN model with three image transformation techniques: affine, perspective and rotation

Objectives

Methods

Results

Conclusion