Abstract

Abstract. Nowadays methods based on deep neural networks show the best performance among image recognition and object detection algorithms. Nevertheless, such methods require to have large databases of multispectral images of various objects to achieve state-of-the-art results. Therefore the dataset generation is one of the major challenges for the successful training of a deep neural network. However, infrared image datasets that are large enough for successful training of a deep neural network are not available in the public domain. Generation of synthetic datasets using 3D models of various scenes is a time-consuming method that requires long computation time and is not very realistic. This paper is focused on the development of the method for thermal image synthesis using a GAN (generative adversarial network). The aim of the presented work is to expand and complement the existing datasets of real thermal images. Today, deep convolutional networks are increasingly used for the goal of synthesizing various images. Recently a new generation of such algorithms commonly called GAN has become a promising tool for synthesizing images of various spectral ranges. These networks show effective results for image-to-image translations. While it is possible to generate a thermal texture for a single object, generation of environment textures is extremely difficult due to the presence of a large number of objects with different emission sources. The proposed method is based on a joint approach that uses 3D modeling and deep learning. Synthesis of background textures and objects textures is performed using a generative-adversarial neural network and semantic and geometric information about objects generated using 3D modeling. The developed approach significantly improves the realism of the synthetic images, especially in terms of the quality of background textures.

Highlights

  • In modern computer vision systems (enhanced vision (Vygolov et al, 2017) (Kniaz, 2014) system, autonomous driving (Kniaz, 2015)) the ability is most demanded to detect and recognize various objects with high probability in degraded visual conditions, such as fog, rain, night

  • Development of a new type of neural networks known as generative adversarial networks, made it possible to take a significant step forward in the field of synthesizing various images (Goodfellow et al, 2014)

  • GAN consists of two deep convolutional neural networks: a Generator network tries to synthesize an image that visually indistinguishable from a given sample of images in the target domain

Read more

Summary

INTRODUCTION

In modern computer vision systems (enhanced vision (Vygolov et al, 2017) (Kniaz, 2014) system, autonomous driving (Kniaz, 2015)) the ability is most demanded to detect and recognize various objects with high probability in degraded visual conditions, such as fog, rain, night. A robust algorithm is required for detecting and recognizing objects in multispectral images. Deep convolutional neural networks have proven to be a reliable algorithm for detecting and recognizing objects in images of the visible range. The most important factor in the success of DCNN (deep convolutional neural network) learning is large multispectral datasets, which are very difficult to obtain using experiments. A new generation of such algorithms commonly called generative adversarial network has become a promising tool for synthesizing images of various spectral ranges. These networks show effective results for image-to-image translations. The pipeline of the GIS method thermal image synthesis using a GAN and 3D modelling. The aim of the presented work is to expand and complement the existing datasets of real thermal images

RELATED WORK
APPROACH
Method
CNNs architecture
Framework
EXPERIMENTS
Training CNN
CNN evaluation
Findings
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call