Abstract

We endeavor on a rarely explored task named thermal infrared video denoising. Perception in the thermal infrared significantly enhances the capabilities of machine vision. Nonetheless, noise in imaging systems is one of the factors that hampers the large-scale application of equipment. Existing thermal infrared denoising methods, primarily focusing on the image level, inadequately utilize time-domain information and insufficiently conduct investigation of system-level mixed noise, presenting the inferior ability in the video-recorded era; while video denoising methods, commonly applied to RGB cameras, exhibit uncertain effectiveness owing to substantial dissimilarities in the noise models and modalities between RGB and thermal infrared images. In sight of this, we initially revisit the imaging mechanism, while concurrently introducing a physics-inspired noise generator based on the sources and characteristics of system noise. Subsequently, a thermal infrared video denoising dataset consisting of 518 real-world videos is constructed. Lastly, we propose a denoising model called multi-domain infrared video denoising network, capable of concentrating features from the time, space, and frequency domains to restore high-fidelity videos. Extensive experiments demonstrate that the proposed method achieves state-of-the-art denoising quality and can be successfully applied to commercial cameras and downstream vision tasks, providing a new avenue for clear videography in the thermal infrared world. The dataset and code will be available.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call