WaveletFormerNet: A Transformer-based wavelet network for real-world non-homogeneous and dense fog removal

Shengli Zhang,Zhiyong Tao,Sen Lin

doi:10.1016/j.imavis.2024.105014

Abstract

Although deep convolutional neural networks have achieved remarkable success in removing synthetic fog, it is essential to be able to process images taken in complex foggy conditions, such as dense or non-homogeneous fog, in the real world. However, the haze distribution in the real world is complex, and downsampling can lead to color distortion or loss of detail in the output results as the resolution of a feature map or image resolution decreases. Moreover, the over-stacking of convolutional blocks might increase the model complexity. In addition to the challenges of obtaining sufficient training data, overfitting can also arise in deep learning techniques for foggy image processing, which can limit the generalization abilities of the model, posing challenges for its practical applications in real-world scenarios. Considering these issues, this paper proposes a Transformer-based wavelet network (WaveletFormerNet) for real-world foggy image recovery. We embed the discrete wavelet transform into the Vision Transformer by proposing the WaveletFormer and IWaveletFormer blocks, aiming to alleviate texture detail loss and color distortion in the image due to downsampling. We introduce parallel convolution in the Transformer block, which allows for the capture of multi-frequency information in a lightweight mechanism. Such a structure reduces computational expenses and improves the effectiveness of the network. Additionally, we have implemented a feature aggregation module (FAM) to maintain image resolution and enhance the feature extraction capacity of our model, further contributing to its impressive performance in real-world foggy image recovery tasks. Through extensive experiments on real-world fog datasets, we have demonstrated that our WaveletFormerNet achieves superior performance compared to state-of-the-art methods, as shown through quantitative and qualitative evaluations of minor model complexity. Additionally, our satisfactory results on real-world dust removal and application tests showcase the superior generalization ability and improved performance of WaveletFormerNet in computer vision-related applications compared to existing state-of-the-art methods, further confirming our proposed approach's effectiveness and robustness. Our code is available at https://github.com/shengli666666/WaveletFormerNet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

WaveletFormerNet: A Transformer-based wavelet network for real-world non-homogeneous and dense fog removal

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Similar Papers

A hybrid image dataset toward bridging the gap between real and simulation environments for robotics
Ertugrul Bayraktar ... Pinar Boyraz
Machine Vision and Applications | VOL. 30
Ertugrul Bayraktar, et. al.Ertugrul Bayraktar ... Pinar Boyraz
01 Aug 2018
Machine Vision and Applications | VOL. 30

A robust deep attention dense convolutional neural network for plant leaf disease identification and classification from smart phone captured real world images
Akshay Pandey ... Kamal Jain
Ecological Informatics | VOL. 70
Akshay Pandey, et. al.Akshay Pandey ... Kamal Jain
22 Jun 2022
Ecological Informatics | VOL. 70

Author response: Invariant representation of physical stability in the human brain
RT Pramod ... Joshua B Tenenbaum
-
RT Pramod, et. al.RT Pramod ... Joshua B Tenenbaum
09 Feb 2022
09 Feb 2022

A Mobile-Based Deep Learning Model for Cassava Disease Diagnosis.
Amanda Ramcharan ... James Legg
Frontiers in Plant Science | VOL. 10
Amanda Ramcharan, et. al.Amanda Ramcharan ... James Legg
20 Mar 2019
Frontiers in Plant Science | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

WaveletFormerNet: A Transformer-based wavelet network for real-world non-homogeneous and dense fog removal

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing