Single Image Depth Estimation Research Articles

Abstract. Depth is an essential component for various scene understanding tasks and for reconstructing the 3D geometry of the scene. Estimating depth from stereo images requires multiple views of the same scene to be captured which is often not possible when exploring new environments with a UAV. To overcome this monocular depth estimation has been a topic of interest with the recent advancements in computer vision and deep learning techniques. This research has been widely focused on indoor scenes or outdoor scenes captured at ground level. Single image depth estimation from aerial images has been limited due to additional complexities arising from increased camera distance, wider area coverage with lots of occlusions. A new aerial image dataset is prepared specifically for this purpose combining Unmanned Aerial Vehicles (UAV) images covering different regions, features and point of views. The single image depth estimation is based on image reconstruction techniques which uses stereo images for learning to estimate depth from single images. Among the various available models for ground-level single image depth estimation, two models, 1) a Convolutional Neural Network (CNN) and 2) a Generative Adversarial model (GAN) are used to learn depth from aerial images from UAVs. These models generate pixel-wise disparity images which could be converted into depth information. The generated disparity maps from these models are evaluated for its internal quality using various error metrics. The results show higher disparity ranges with smoother images generated by CNN model and sharper images with lesser disparity range generated by GAN model. The produced disparity images are converted to depth information and compared with point clouds obtained using Pix4D. It is found that the CNN model performs better than GAN and produces depth similar to that of Pix4D. This comparison helps in streamlining the efforts to produce depth from a single aerial image.

Read full abstract

Depth prediction from single image is a challenging task due to the intra scale ambiguity and unavailability of prior information. The prediction of an unambiguous depth from single RGB image is very important aspect for computer vision applications. In this paper, an end-to-end sparse-to-dense network (S2DNet) is proposed for single image depth estimation (SIDE). The proposed network processes single image along with the additional sparse depth samples for depth estimation. The additional sparse depth sample are acquired either with a low-resolution depth sensor or calculated by visual simultaneous localization and mapping (SLAM) algorithms. In first stage, the proposed S2DNet estimates coarse-level depth map using sparse-to-dense coarse network (S2DCNet). In second stage, the estimated coarse-level depth map is concatenated with the input image and used as an input to the sparse-to-dense fine network (S2DFNet) for fine-level depth map estimation. The proposed S2DFNet comprises of attention map architecture which helps to estimate the prominent depth information. The quantitative and qualitative performance evaluation of the proposed network has been carried out using the error metrics. We perform complete evaluation of S2DNet on four publically available benchmark data sets i.e. NYU Depth-V2 indoor dataset [1] , KITTI odometry outdoor dataset [2] , KITTI depth completion test database [3] and SUN-RGB database [4] . Further, we have extended the proposed S2DNet for image de-hazing. The experimental analysis shows that the proposed S2DNet outperforms the existing state-of-the-art methods for both single image depth estimation and image de-hazing.

Read full abstract

Single Image Depth Estimation Research Articles

Articles published on Single Image Depth Estimation

Consistent video depth estimation

DEEP LEARNING FOR MONOCULAR DEPTH ESTIMATION FROM UAV IMAGES

A self-supervised method of single-image depth estimation by feeding forward information using max-pooling layers

S2DNet: Depth Estimation From Single Image and Sparse Samples

Comparison of monocular depth estimation methods using geometrically relevant metrics on the IBims-1 dataset

MONOCULAR-DEPTH ASSISTED SEMI-GLOBAL MATCHING

Aggregation of Rich Depth-Aware Features in a Modified Stacked Generalization Model for Single Image Depth Estimation

Depth‐based end‐to‐end deep network for human action recognition

Deep Monocular Depth Estimation via Integration of Global and Local Predictions.

Single image depth estimation based on convolutional neural network and sparse connected conditional random field

GPU-Accelerated Single Image Depth Estimation with Color-Filtered Aperture

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Single Image Depth Estimation Research Articles

Articles published on Single Image Depth Estimation

Consistent video depth estimation

DEEP LEARNING FOR MONOCULAR DEPTH ESTIMATION FROM UAV IMAGES

A self-supervised method of single-image depth estimation by feeding forward information using max-pooling layers

S2DNet: Depth Estimation From Single Image and Sparse Samples

Comparison of monocular depth estimation methods using geometrically relevant metrics on the IBims-1 dataset

MONOCULAR-DEPTH ASSISTED SEMI-GLOBAL MATCHING

Aggregation of Rich Depth-Aware Features in a Modified Stacked Generalization Model for Single Image Depth Estimation

Depth‐based end‐to‐end deep network for human action recognition

Deep Monocular Depth Estimation via Integration of Global and Local Predictions.

Single image depth estimation based on convolutional neural network and sparse connected conditional random field

GPU-Accelerated Single Image Depth Estimation with Color-Filtered Aperture