Generating Bird’s Eye View from Egocentric RGB Videos

Vanita Jain,Shivam Grover,Gopal Chaudhary,Qiming Wu,Qiaozhi Hua,Kshitij Sidana,San Hlaing Myint

doi:10.1155/2021/7479473

Abstract

In this paper, we present a method for generating bird’s eye video from egocentric RGB videos. Working with egocentric views is tricky since such the view is highly warped and prone to occlusions. On the other hand, a bird’s eye view has a consistent scaling in at least the two dimensions it shows. Moreover, most of the state‐of‐the‐art systems for tasks such as path prediction are built for bird’s eye views of the subjects. We present a deep learning‐based approach that transfers the egocentric RGB images captured from a dashcam of a car to bird’s eye view. This is a task of view translation, and we perform two experiments. The first one uses an image‐to‐image translation method, and the other uses a video‐to‐video translation. We compare the results of our work with homographic transformation, and our SSIM values are better by a margin of 77% and 14.4%, and the RMSE errors are lower by 40% and 14.6% for image‐to‐image translation and video‐to‐video translation, respectively. We also visually show the efficacy and limitations of each method with helpful insights for future research. Compared to previous works that use homography and LIDAR for 3D point clouds, our work is more generalizable and does not require any expensive equipment.

Highlights

Egocentric videos, commonly referred to as first-person videos, are captured from the POV of a subject
We presented an end-to-end method for translating egocentric views from RGB cameras such as those installed on vehicles into bird’s eye views of the environment the subject vehicle was present in
One of the biggest hurdles is that egocentric views have a high level of distortion due to perspective, whereas a bird’s eye view has a consistent scaling

Summary

Introduction

Egocentric videos, commonly referred to as first-person videos, are captured from the POV of a subject (in our case from the POV of an autonomous vehicle). Egocentric videos are easy to capture and are accessible in real-time to the vehicle. They are deviously hard to for a computer to comprehend and work with. This is because egocentric videos are prone to occlusions, and there is a significant warping effect due to perspective which causes the objects closer to the camera to look inflated. Another drawback of the egocentric view is the nonlinear nature of objects in motion. With advancements in self-driving autonomous vehicle technology, it becomes important that we devise a way to overcome the shortcomings of egocentric perspective and make their accessibility useful [1,2,3]

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Wireless Communications and Mobile Computing	Publication Date: Jan 1, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Generating Bird’s Eye View from Egocentric RGB Videos

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing

Lead the way for us

Similar Papers

Pishgu: Universal Path Prediction Network Architecture for Real-time Cyber-physical Edge Systems
Ghazal Alinezhad Noghre ... Hamed Tabkhi
-
Ghazal Alinezhad Noghre, et. al.Ghazal Alinezhad Noghre ... Hamed Tabkhi
09 May 2023
09 May 2023

Analysis of Short Term Path Prediction of Human Locomotion for Augmented and Virtual Reality Applications
Thomas Nescher ... Andreas Kunz
-
Thomas Nescher, et. al.Thomas Nescher ... Andreas Kunz
01 Sep 2012
01 Sep 2012

Німецькомовна безеквівалентна лексика: способи відтворення українською мовою
N S Olkhovska ... M S Skokova
Mìžnarodnij fìlologìčnij časopis | VOL. 10
N S Olkhovska, et. al.N S Olkhovska ... M S Skokova
23 Oct 2019
Mìžnarodnij fìlologìčnij časopis | VOL. 10

Accurate and Real-Time Object Detection Based on Bird's Eye View on 3D Point Clouds
Yi Zhang ... Shuya Chen
-
Yi Zhang, et. al.Yi Zhang ... Shuya Chen
01 Sep 2019
01 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generating Bird’s Eye View from Egocentric RGB Videos

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing