Abstract

Joint photographic experts group (JPEG) compression is lossy compression, and degradation of image quality worsens at high compression ratios. Therefore, a reconstruction process is required for a visually pleasant image. In this paper, we propose an end-to-end deep learning architecture for restoring JPEG images with high compression ratios. The proposed architecture changes a core principle of the squeeze and excitation network for low-level vision tasks where pixel-level accuracy is important. Instead of extracting global features, our network extracts locally embedded features and fine-tunes each feature value by using depthwise convolution. To reduce the computational complexity and parameters with large receptive fields, we use a combination of the recursive structure and feature map down- and up-scaling processes. We also propose a compact version of the proposed model by decreasing the number of filters and simplifying the network, which has about one-twentieth of the parameters of the baseline model. Experimental results reveal that our network outperforms conventional networks quantitatively, and the restored images are clear with sharp edges and smooth blocking boundaries. Furthermore, the compact model shows higher objective results while maintaining a low number of parameters. In addition, at a high compression ratio, the overall information, including details in the blocks, are lost owing to high quantization errors. We apply a generative adversarial network structure to restore these highly damaged blocks, and the results reveal that the image produced has details similar to those of the ground truth.

Highlights

  • Joint photographic experts group (JPEG) compression is a popular standard for still image compression, and it is a lossy compression technique due to the quantization of the discrete cosine transformation (DCT) coefficients

  • Compared with ResBlock, the proposed block has a 2% increase in the number of parameters and a 0.03 dB increase in the peak signal-to-noise ratio (PSNR). These results indicate that our local excitation block is more effective in removing JPEG artifacts than are other existing blocks developed in high-level vision tasks

  • We have proposed a baseline model (LEJR) that maximizes objective performances, a compact model (LEJR_compact) that greatly reduces the number of parameters, and successfully applied a generative adversarial network (GAN)-based model (LEJR_GAN) that gives restored images which are more realistic

Read more

Summary

INTRODUCTION

JPEG compression is a popular standard for still image compression, and it is a lossy compression technique due to the quantization of the discrete cosine transformation (DCT) coefficients. The DnCNN [21] used global residual learning (GRL) and successfully trained a deeper network with a depth of 20 to enhance performance, and MemNet [15] adopts a recursive unit that reuses the weights of the convolution layer several times and a densely-connected block structure, thereby improving the performance while maintaining a few parameters These methods train the network using a loss function based on the l2 norm, which yeilds a high peak signal-to-noise ratio (PSNR); the resulting image tends to be blurred. Application of the GAN structure to remove JPEG artifacts to restore more realistic images at a high compression ratio

RELATED WORKS
EXPERIMENTAL RESULTS
IMPLEMENTATION DETAILS
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call