Analysis of Depth and Semantic Mask for Perceiving a Physical Environment Using Virtual Samples Generated by a GAN

Javier Maldonado-Romo,Mario Aldape-Perez,Alejandro Rodriguez-Molina

doi:10.1109/access.2021.3137797

Javier Maldonado-Romo, Mario Aldape-Perez + Show 1 more

Open Access

https://doi.org/10.1109/access.2021.3137797

Copy DOI

Abstract

Micro aerial vehicles (MAVs) can make explorations in 3D environments using technologies capable of perceiving the environment to map and estimate the location of objects that could cause collisions, such as Simultaneous Localization and Mapping (SLAM). Nevertheless, the agent needs to move during the environment mapping, reducing the flying time to employ additional activities. It has to be noted that adding more devices (sensors) to MAVs implies more power consumption. Since more energy to perform tasks is required, growing the dimensions of MAVs limits the flying time. Contrarily, Generative Adversarial Networks (GAN) have demonstrated the usefulness of creating images from one domain to another, but the GAN domain changes require a large number of samples. Therefore, an interoperability coefficient is employed to determine a minimum number of samples to connect the different domains. In order to prove the coefficient, the performance to estimate the depth and semantic mask between authentic and virtual samples with the number limited of samples is analyzed. Consequently, an RGB-D sensor can be replaced by a few samples of a real scenario based on GANs. Although GAN allows creating images with depth and semantic mask information, there is an additional problem to be tackled: the presence of intrinsic noise, where a simple GAN architecture is not enough. In this proposal, the performance of this solution against a physical RGB-D sensor (Microsoft Kinect V1) and other state-of-the-art approaches is compared. Experimental results allow us to affirm that this proposal is a viable option to replace a physical RGB-D sensor with limited information.

Highlights

Robotics is a research area whose fundamental challenges have been obstacle detection and collision avoidance
We propose a double-Generative Adversarial Networks (GAN)-based architecture with noise reduction to estimate authentic images with depth and semantic mask using virtual samples
Double-GAN with noise reduction results is labeled as Double-GAN-Noise Reduction (NR)-2 and Double-GAN-NR-5 for both distances

Summary

INTRODUCTION

Robotics is a research area whose fundamental challenges have been obstacle detection and collision avoidance. The importance of optimizing the resources available to the MAV Most of these vehicles already have a built-in camera, so this resource can be taken advantage of and used as a perception system to estimate authentic images’ depth and semantic mask without adding additional devices. This paper proposes a double-GAN-based architecture with noise reduction to estimate authentic images’ depth and semantic mask using information generated by a virtual environment representation dataset with limited samples. This approach can effectively represent an RGB-D sensor using few samples of a real scenario based on a double-GAN approach.

RELATED WORKS

PROPOSED WORK

SIMILARITY BETWEEN IMAGES

INTEROPERABILITY COEFFICIENT FOR CONNECT VIRTUAL AND REAL ENVIRONMENTS

ARCHITECTURE

EXPERIMENTAL PHASE

METRICS

RESULTS

CONCLUSIONS

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE access : practical innovations, open solutions	Publication Date: Jan 1, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Analysis of Depth and Semantic Mask for Perceiving a Physical Environment Using Virtual Samples Generated by a GAN

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions

Lead the way for us

Similar Papers

Towards Aerial Interaction of MAVs in GPS-Denied Environments
Aaron Lopez Luna ... Israel Cruz Vega
-
Aaron Lopez Luna, et. al.Aaron Lopez Luna ... Israel Cruz Vega
01 Nov 2019
01 Nov 2019

Robust Visual Simultaneous Localization and Mapping for MAV Using Smooth Variable Structure Filter
Abdelkrim Nemra ... Alejandro Gómez
-
Abdelkrim Nemra, et. al.Abdelkrim Nemra ... Alejandro Gómez
02 Dec 2015
02 Dec 2015

Indoor SLAM for Micro Aerial Vehicles Using Visual and Laser Sensor Fusion
Elena López ... Abdelkrim Nemra
-
Elena López, et. al.Elena López ... Abdelkrim Nemra
02 Dec 2015
02 Dec 2015

A Hybrid SLAM Method for Indoor Micro Aerial Vehicles
Yiwei Zheng ... Yang Xu
-
Yiwei Zheng, et. al.Yiwei Zheng ... Yang Xu
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of Depth and Semantic Mask for Perceiving a Physical Environment Using Virtual Samples Generated by a GAN

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions