Abstract

Because of the capacity of capturing both the spatial and angular information of the light rays simultaneously, light field images (LFIs) contain richer scene information compared with conventional images, but at the cost of huge volume. This paper proposes a novel LFI sparse compression framework driven by convolutional neural network (CNN). The epipolar plane image (EPI) super-resolution is for compensating the information loss caused by sparse sampling and the decoder-side sub-aperture images (SAIs) quality enhancement is for compensating the information loss caused by lossy compression. Specifically, we choose those SAIs both in odd rows and odd columns as our key SAIs and compress them using standard video encoder. For those non-key SAIs, we predict them using decompressed key SAIs by taking advantage of the special structure of EPI. The low-resolution EPIs generated from the sparse SAIs are super-resolved by a CNN and the outputs, high-resolution EPIs, are used to rebuild the dense SAIs. Moreover, in order to improve the quality of the predicted SAI, we add decoder-side quality enhancement before prediction. We propose a multi-scale dense residual network (MSDRN) to implement both EPI super-resolution and quality enhancement. Transfer learning strategies are used to improve the training performance of quality enhancement. The experimental results show the superior performance of the proposed framework over existing methods in terms of rate-distortion performance.

Highlights

  • Light Field (LF) is becoming the most recent research focus due to its extraordinary advantages in terms of free viewpoint visualization

  • DATASET To verify the effectiveness of our proposed light field images (LFIs) compression framework and show the superiority of the designed network architecture, some LFIs are tested as the objects

  • The specific 12 LFIs are chosen because most LF compression methods, such as [20], [39], [40], take them as test images and it is for the convenience of comparison

Read more

Summary

Introduction

Light Field (LF) is becoming the most recent research focus due to its extraordinary advantages in terms of free viewpoint visualization It is regarded as a promising technique for the future 3D TV, because different from conventional stereoscopic technologies, LF is able to provide high quality 3D images with full parallax and continuous viewing angle without any need of additional devices such as dedicated glasses. The 7D function is denoted as L (x, y, z, θ, ∅, γ , t), where (x, y, z) denotes spatial coordinate, (θ, ∅) denotes emitting angle, γ denotes wavelength and t denotes time. Such high dimensional data are difficult to record and handle in practice. An oriented light ray defined in the system travels and intersects the

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call