Joint learning of frequency and spatial domains for dense image prediction

Shaocheng Jia,Wei Yao

doi:10.1016/j.isprsjprs.2022.11.001

Abstract

Current artificial neural networks mainly conduct the learning process in the spatial domain but neglect the frequency domain learning. However, the learning course performed in the frequency domain can be more efficient than that in the spatial domain. In this paper, we fully explore frequency domain learning and propose a joint learning paradigm of frequency and spatial domains. This paradigm can take full advantage of the combined preponderances of frequency learning and spatial learning; specifically, frequency and spatial domain learning can effectively capture intrinsic global and local information, respectively. To achieve this, an innovative but effective linear learning block is proposed to conduct the learning process directly in the frequency domain. Together with the prevailing spatial learning operation, i.e., convolution, a powerful and scalable joint learning framework is further proposed. Exhaustive experiments on the diverse Benchmark datasets — KITTI, Make3D, and Cityscapes demonstrate the effectiveness and superiority of the proposed joint learning paradigm in dense image prediction tasks, including self-supervised depth estimation, ego-motion estimation, and semantic segmentation. In particular, the proposed model can achieve performance competitive to those of state-of-the-art methods in all three tasks, even without pretraining. Moreover, the proposed model reduces the number of parameters by over 78% for self-supervised depth estimation on the KITTI dataset while retaining the time complexity on par with other state-of-the-art methods; this provides a great chance to develop real-world applications. We hope that the proposed method can encourage more research in cross-domain learning. The codes are publicly available at https://github.com/shaochengJia/FSLNet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Joint learning of frequency and spatial domains for dense image prediction

Abstract

Talk to us

Similar Papers

More From: ISPRS Journal of Photogrammetry and Remote Sensing

Lead the way for us

Journal: ISPRS Journal of Photogrammetry and Remote Sensing	Publication Date: Nov 16, 2022
Citations: 7

Similar Papers

Invisible watermarking schemes in spatial and frequency domains
Saba Riaz ... M Younus Javed
-
Saba Riaz, et. al.Saba Riaz ... M Younus Javed
01 Oct 2008
01 Oct 2008

Benchmarking Image Processing Algorithms for Unmanned Aerial System-Assisted Crack Detection in Concrete Structures
Sattar Dorafshan ... Robert J Thomas
Infrastructures | VOL. 4
Sattar Dorafshan, et. al.Sattar Dorafshan ... Robert J Thomas
30 Apr 2019
Infrastructures | VOL. 4

Combined analysis of tunable phase mask within spatial and frequency domain
Zhou Liang ... Liu Zhao-Hui
Acta Physica Sinica | VOL. 64
Zhou Liang, et. al. Zhou Liang ... Liu Zhao-Hui
01 Jan 2015
Acta Physica Sinica | VOL. 64

Secure Optical Image Communication Using Double Random Transformation and Memristive Chaos
Heping Wen ... Jiahao Wu
IEEE Photonics Journal | VOL. 15
Heping Wen, et. al.Heping Wen ... Jiahao Wu
01 Feb 2023
IEEE Photonics Journal | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Joint learning of frequency and spatial domains for dense image prediction

Abstract

Talk to us

Similar Papers

More From: ISPRS Journal of Photogrammetry and Remote Sensing