WavNet — Visual saliency detection using Discrete Wavelet Convolutional Neural Network

Reshmi Sasibhooshan,Suresh Kumaraswamy,Santhoshkumar Sasidharan

doi:10.1016/j.jvcir.2021.103236

Reshmi Sasibhooshan, Suresh Kumaraswamy + Show 1 more

Open Access

https://doi.org/10.1016/j.jvcir.2021.103236

Copy DOI

Abstract

In the recent advancements in image and video analysis, the detection of salient regions in the image becomes the initial step. This plays a crucial role in deciding the performance of such algorithms. In this work, a Multi-Resolution Feature Extraction (MRFE) technique that makes use of Discrete Wavelet Convolutional Neural Network (DWCNN) for generating features is employed. An Enhanced Feature Extraction (EFE) module extracts additional features from the high level features of the DWCNN, which are used to frame both channel as well as spatial attention models for yielding contextual attention maps. A new hybrid loss function is also proposed, which is a combination of Balanced Cross Entropy (BCE) loss and Edge based Structural Similarity (ESSIM) loss that effectively identifies and segments the salient regions with clear boundaries. The method is tested exhaustively with five different benchmark datasets and is proved superior to the existing state-of-the-art methods with a minimum Mean Absolute error (MAE) of 0.03 and F-measure of 0.956.

Full Text