Lossy image compression is likely to produce blurred images which leads to erroneous image-level understanding. The human visual system (HVS) is focused on the area of interest present in the image. Motivated by this fact, we propose a compression-decompression algorithm focusing on the important content present in different parts of the image. Using the concept of the Guided Grad-CAM (Gradient-weighted Class Activation Mapping) technique to produce heat maps with the help of a heat map generator trained on ResNet-50, a saliency-guided encoding–decoding algorithm is developed. A wider multi-scale saliency guided convolutional neural network (MsSG-CNN) is designed, in which the notion of convolution with different size filters helps to obtain unique but different features. The feature extraction followed by multilevel fusion of features helps deep neural network (DNN) to capture contextual information and obtain high quality, good resolution, visually pleasing images with fine details. The proposed algorithm is tested on the Kodak benchmark dataset, CLIC 2019 challenging dataset, and FDDB facial images dataset. At low bit rates, the MS-SSIM of the proposed algorithm is found to be superior to JPEG, JPEG2000, BPG, WebP, and Minnen’s approaches with approximately up to 60%, 24.80%, 11.43%, 23.08% & 75% gains respectively, which is quite a significant improvement, when tested on the Kodak dataset. Similarly, at high bit rates, the improvement in MS-SSIM is approximately up to 41.67%, 37.30%, 23.90% 34.21% & 13.33% when compared with JPEG, JPEG2000, BPG, WebP, and Minnen’s approaches respectively. The improvement in PSNR at low and high bit rates is approximately up to 11.32%, 5%, 5.26% and 5.6%, 10.29%, 8.7% as compared to JPEG, Balle’s, and Lee’s algorithms respectively. The PSNR-HVS has been improved by approximately up to 27.27%, 19.15%, and 28.33%, 28% as compared to JPEG and Toderici’s algorithms respectively at low and high bit-rates. A similar type of improvement is obtained with FDDB and CLIC 2019 datasets also, which is discussed in the paper.
Read full abstract