Abstract

Transfer learning helps the performance of a learning algorithm significantly when training deep learning models on challenging datasets. However, the pre-trained networks have certain constraints in terms of their architecture. For example, due to the wide availability of color images, state-of-the-art pre-trained networks expect an input image with three color channels. Grayscale images have small sizes as compared to color images and thus can enable real time computer vision applications in scenarios where there are constraints on device memory and bandwidth. Therefore, in this work we propose an approach to run pre-trained models on grayscale images for image classification tasks. We have used the VGG16 pre-trained model to classify Kaggle Dogs vs. Cats dataset. We have compared our results with VGG16 applied on color images. Our results have shown that when the weights for the first hidden layer are initialized as the mean of the pre-trained network weights then the classification accuracy with only 0.04% error can be achieved. Our analysis has shown that comparable benefits can be reaped when using grayscale images for deep learning based classification tasks with only one-third of the bandwidth and storage requirements.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call