Building an efficient convolution neural network from scratch: A case study on detecting and localizing slums

Tarik El Moudden,Mohamed Amnai

doi:10.1016/j.sciaf.2023.e01612

Abstract

Designing a convolution neural network from scratch is one of the biggest challenges facing the creation of reproducible models. Despite feeding the model with an adequate amount of labeled data to mitigate fluctuations during training, the model still suffers from high variance in the final overall accuracy and loss among identical training runs. Many of the reasons behind this are the randomness in data shuffling and augmentation, the behavior of gradient decent function, and the non-determinism in convnet layers and floating-point computation of the GPU.The method used to address the aforementioned issues, specifically in the case of negative transfer learning, is divided into three steps: First, designing an efficient lightweight convnet architecture with respect to available resources. Second, mitigating oscillations during training. Third, after setting the random seed across the training, select the appropriate weight initialization. Our extensive experiments in the use case of binary slum localization and detection show that our method improves the reproducibility of our model from scratch with an accuracy of 98.88±1.15%, a loss of 0.03±0.05 for a confidence level of 99.73%. These results make this model a strong competitor to the pre-trained models using transfer learning.

Full Text