Automatic 3D Landmark Extraction System Based on an Encoder–Decoder Using Fusion of Vision and LiDAR

Jeonghoon Kwak,Yunsick Sung

doi:10.3390/rs12071142

Jeonghoon Kwak, Yunsick Sung

Open Access

PDF Available

https://doi.org/10.3390/rs12071142

Copy DOI

Export

Save

Cite

Journal: Remote Sensing	Publication Date: Apr 3, 2020
Citations: 3	License type: CC BY 4.0

Affiliation: Dongguk University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

To provide a realistic environment for remote sensing applications, point clouds are used to realize a three-dimensional (3D) digital world for the user. Motion recognition of objects, e.g., humans, is required to provide realistic experiences in the 3D digital world. To recognize a user’s motions, 3D landmarks are provided by analyzing a 3D point cloud collected through a light detection and ranging (LiDAR) system or a red green blue (RGB) image collected visually. However, manual supervision is required to extract 3D landmarks as to whether they originate from the RGB image or the 3D point cloud. Thus, there is a need for a method for extracting 3D landmarks without manual supervision. Herein, an RGB image and a 3D point cloud are used to extract 3D landmarks. The 3D point cloud is utilized as the relative distance between a LiDAR and a user. Because it cannot contain all information the user’s entire body due to disparities, it cannot generate a dense depth image that provides the boundary of user’s body. Therefore, up-sampling is performed to increase the density of the depth image generated based on the 3D point cloud; the density depends on the 3D point cloud. This paper proposes a system for extracting 3D landmarks using 3D point clouds and RGB images without manual supervision. A depth image provides the boundary of a user’s motion and is generated by using 3D point cloud and RGB image collected by a LiDAR and an RGB camera, respectively. To extract 3D landmarks automatically, an encoder–decoder model is trained with the generated depth images, and the RGB images and 3D landmarks are extracted from these images with the trained encoder model. The method of extracting 3D landmarks using RGB depth (RGBD) images was verified experimentally, and 3D landmarks were extracted to evaluate the user’s motions with RGBD images. In this manner, landmarks could be extracted according to the user’s motions, rather than by extracting them using the RGB images. The depth images generated by the proposed method were 1.832 times denser than the up-sampling-based depth images generated with bilateral filtering.

Highlights

Three-dimensional (3D) digital environments are provided to take advantage of various platforms such as remotely controlled unmanned aerial vehicles and autonomous vehicles [1,2,3,4,5,6]
A depth image provides the boundary of the user’s motion, and the depth image is generated from the collected 3D point cloud and the difference image generated from the background image and collected red green blue (RGB) image
An improved encoder–decoder model is trained for extracting 3D landmarks using the generated user’s RGB depth (RGBD) images. 3D landmarks for user motions are extracted based on the trained encoder model and using the RGBD images

Summary

Introduction

Three-dimensional (3D) digital environments are provided to take advantage of various platforms such as remotely controlled unmanned aerial vehicles and autonomous vehicles [1,2,3,4,5,6]. This paper proposes a system that automatically extracts 3D landmarks without manual supervision It uses RGB images and 3D point clouds collected with a vision system and a LiDAR, respectively, to recognize a user’s motions and does not depend on the number of points in the 3D point cloud. By utilizing the difference between the background RGB image and the RGB image that captures a user’s motion, the depth image of the user’s motion is generated by correcting the disparities in the 3D point cloud Based on these depth images, 3D landmarks of the user’s motions are automatically extracted with an improved encoder–decoder model; the 3D landmarks are generated by the trained encoder.

Landmark Extraction Using Supervised Learning

Landmark Extraction Using Unsupervised Learning

Depth Image Generation Using Up-Sampling

Overview

Encoder–Decoder Model Training for 3D Landmark Extraction Phase

Trained Encoder Model-Based 3D Landmark Extraction Phase

Encoder–Decoder Model Training for 3D Landmark Extraction Phase Results

12 RGB1D4 image 16

Findings

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Automatic 3D Landmark Extraction System Based on an Encoder–Decoder Using Fusion of Vision and LiDAR

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

Improved Kiwifruit Detection Using Pre-Trained VGG16 With RGB and NIR Information Fusion
Zhihao Liu ... Yali Feng
IEEE Access | VOL. 8
Zhihao Liu, et. al.Zhihao Liu ... Yali Feng
01 Jan 2020
IEEE Access | VOL. 8

GrainPointNet: A deep-learning framework for non-invasive sorghum panicle grain count phenotyping
Chrisbin James ... Scott C Chapman
Computers and Electronics in Agriculture | VOL. 217
Chrisbin James, et. al.Chrisbin James ... Scott C Chapman
21 Dec 2023
Computers and Electronics in Agriculture | VOL. 217

Transfer Learning Model Training Time Comparison for Osteoporosis Classification on Knee Radiograph of RGB and Grayscale Images
Usman Bello Abubakar ... Moussa Mahamat Boukar
WSEAS TRANSACTIONS ON ELECTRONICS | VOL. 13
Usman Bello Abubakar, et. al.Usman Bello Abubakar ... Moussa Mahamat Boukar
13 Sep 2022
WSEAS TRANSACTIONS ON ELECTRONICS | VOL. 13

Application of delta-wing airplane remote sensing system with dual-camera in mapping vegetation fraction
Amin Wen ... Jianghua Zheng
-
Amin Wen, et. al.Amin Wen ... Jianghua Zheng
01 Jun 2015
01 Jun 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Automatic 3D Landmark Extraction System Based on an Encoder–Decoder Using Fusion of Vision and LiDAR

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Remote Sensing