Application of Deep Neural Networks in the Problem of Obtaining Depth Maps from Two-Dimensional Images

D I Mihalchenko,A G Ivin,E A Aksamentov,O Yu Sivchenko

doi:10.21869/2223-1560-2019-23-3-113-134

D I Mihalchenko, A G Ivin + Show 2 more

Open Access

PDF Available

https://doi.org/10.21869/2223-1560-2019-23-3-113-134

Copy DOI

Export

Save

Cite

Journal: Proceedings of the Southwest State University	Publication Date: Sep 7, 2019
Citations: 2	License type: cc-by

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Purpose of research is to study approaches to the depth map generation for deep neural networks testing and learning. The problem of obtaining information about the distance from the camera to the scenery object using a 2D image by means of deep neural networks without applying a stereocamera is considered. Methods. Generation of 3D scenery for training and assessment of the neural network was carried out using the 3D-computer graphics application Blender. The standard deviation (RMS) was used to estimate the accuracy of learning. Machine learning was implemented using the Keras library and optimization was implemented using the AdaGrad approach. Results . The architecture of a deep neural network which receives three sequences of 2D images from the 3D scenery video stream in the input and outputs the predicted depth map for the considered 3D scenery, is provided. The method for creating training data sets containing information about the depth of the map using Blender software is described. The problem of overtraining involving the fact that the created models work using specially generated data sets but still can not predict the correct depth map for random images is studied. The results of the testing actual methods for depth maps creation using deep neural networks are presented. Conclusion . The main problem of the proposed method is overtraining which can be expressed in predicting a certain average value for different images or predicting the same output for different inputs. To solve this problem, we can use already trained networks or training and variation samples containing 2D images of different sceneries.

Highlights

Generation of 3D scenery for training and assessment of the neural network was carried out using the 3Dcomputer graphics application Blender
The main problem of the proposed method is overtraining which can be expressed in predicting a certain average value for different images or predicting the same output for different inputs
Для создания валидной базы данных карты глубины требуются дорогие и не всегда доступные датчики, например лазерный дальномер

Summary

Материалы и методы решения задачи

Существует несколько работ, описывающих оценку карты глубины по метрии (VGG) [29], причем, эта сеть первоначально была обучена классифицировать объекты. В работе [30] авторы уделяют особое внимание неустранимым неоднозначностям, возникающим при восстановлении 3D-изображений из 2D-изображения. Авторы [15] также делают акцент на использовании сверточных нейронных сетей и предлагают самостоятельно разработанную систему критериев оценки признаков, разработанную на материале множества данных RGB-D. В [31] представлен метод оценки глубины одного изображения. Авторы используют многочастичный алгоритм распространения доверия для вывода графической модели. В более позднем исследовании достигнутое значение СКО этого метода составило 0,824 [17]. Для создания карты глубины они используют стереоизображение. В статье показан хороший пример использования ГНС для создания алгоритма быстрой оценки глубины. В более старых работах, например в [35], авторы используют Марковскую сеть и обучение с учителем для создания карт глубины по одному изображению. В результате авторы создают каразного масштаба, имеющих форму бучественно правильные 3D-модели для 64,9% из 588 изображений

Результаты и их обсуждение

Список литературы