Deep M2CDL: Deep Multi-Scale Multi-Modal Convolutional Dictionary Learning Network.

Xin Deng,Mai Xu,Fangyuan Gao,Jingyi Xu,Xiancheng Sun

doi:10.1109/tpami.2023.3334624

Abstract

For multi-modal image processing, network interpretability is essential due to the complicated dependency across modalities. Recently, a promising research direction for interpretable network is to incorporate dictionary learning into deep learning through unfolding strategy. However, the existing multi-modal dictionary learning models are both single-layer and single-scale, which restricts the representation ability. In this paper, we first introduce a multi-scale multi-modal convolutional dictionary learning ( M2CDL) model, which is performed in a multi-layer strategy, to associate different image modalities in a coarse-to-fine manner. Then, we propose a unified framework namely Deep M2CDL derived from the M2CDL model for both multi-modal image restoration (MIR) and multi-modal image fusion (MIF) tasks. The network architecture of Deep M2CDL fully matches the optimization steps of the M2CDL model, which makes each network module with good interpretability. Different from handcrafted priors, both the dictionary and sparse feature priors are learned through the network. The performance of the proposed Deep M2CDL is evaluated on a wide variety of MIR and MIF tasks, which shows the superiority of it over many state-of-the-art methods both quantitatively and qualitatively. In addition, we also visualize the multi-modal sparse features and dictionary filters learned from the network, which demonstrates the good interpretability of the Deep M2CDL network.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep M2CDL: Deep Multi-Scale Multi-Modal Convolutional Dictionary Learning Network.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: May 1, 2024
Citations: 1

Similar Papers

Multimodal medical image fusion towards future research: A review
Sajid Ullah Khan ... Muhammad Javed
Journal of King Saud University - Computer and Information Sciences | VOL. 35
Sajid Ullah Khan, et. al.Sajid Ullah Khan ... Muhammad Javed
29 Aug 2023
Journal of King Saud University - Computer and Information Sciences | VOL. 35

A review on multimodal medical image fusion: Compendious analysis of medical modalities, multimodal databases, fusion techniques and quality metrics
Muhammad Adeel Azam ... Amir H Gandomi
Computers in Biology and Medicine | VOL. 144
Muhammad Adeel Azam, et. al.Muhammad Adeel Azam ... Amir H Gandomi
03 Feb 2022
Computers in Biology and Medicine | VOL. 144

Person Re-Identification by Cross-View Multi-Level Dictionary Learning.
Sheng Li ... Yun Fu
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 40
Sheng Li, et. al.Sheng Li ... Yun Fu
26 Oct 2017
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 40

Designing CNNs for Multimodal Image Restoration and Fusion via Unfolding the Method of Multipliers
Iman Marivani ... Nikos Deligiannis
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Iman Marivani, et. al.Iman Marivani ... Nikos Deligiannis
01 Sep 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep M2CDL: Deep Multi-Scale Multi-Modal Convolutional Dictionary Learning Network.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence