Learning Compressible 360 ° Video Isomers.

Yu-Chuan Su,Kristen Grauman

doi:10.1109/tpami.2020.2974472

Abstract

Standard video encoders developed for conventional narrow field-of-view video are widely applied to 360° video as well, with reasonable results. However, while this approach commits arbitrarily to a projection of the spherical frames, we observe that some orientations of a 360° video, once projected, are more compressible than others. We introduce an approach to predict the sphere rotation that will yield the maximal compression rate. Given video clips in their original encoding, a convolutional neural network learns the association between a clip's visual content and its compressibility at different rotations of a cubemap projection. Given a novel video, our learning-based approach efficiently infers the most compressible direction in one shot, without repeated rendering and compression of the source video. We validate our idea on thousands of video clips and multiple popular video codecs. The results show that this untapped dimension of 360° compression has substantial potential-"good" rotations are typically 8-18 percent more compressible than bad ones, and our learning approach can predict them reliably 78 percent of the time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Jan 1, 2020
Citations: 2	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Learning Compressible 360 ° Video Isomers.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Similar Papers

Learning Compressible 360° Video Isomers
Yu-Chuan Su ... Kristen Grauman
-
Yu-Chuan Su, et. al.Yu-Chuan Su ... Kristen Grauman
01 Jun 2018
01 Jun 2018

Supporting physiology learning: the development of interactive concept-based video clips
Richard Guy ... Bruce Byrne
Advances in Physiology Education | VOL. 38
Richard Guy, et. al.Richard Guy ... Bruce Byrne
01 Mar 2014
Advances in Physiology Education | VOL. 38

Predicting brand confusion in imagery markets based on deep learning of visual advertisement content
Atsuho Nakayama ... Daniel Baier
Advances in Data Analysis and Classification | VOL. 14
Atsuho Nakayama, et. al.Atsuho Nakayama ... Daniel Baier
19 Nov 2020
Advances in Data Analysis and Classification | VOL. 14

Probabilistic Skimlets Fusion for Summarizing Multiple Consumer Landmark Videos
Luming Zhang ... Richang Hong
IEEE Transactions on Multimedia | VOL. 17
Luming Zhang, et. al.Luming Zhang ... Richang Hong
01 Jan 2015
IEEE Transactions on Multimedia | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Compressible 360 ° Video Isomers.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence