A ViT-AMC Network With Adaptive Model Fusion and Multiobjective Optimization for Interpretable Laryngeal Tumor Grading From Histopathological Images.

Pan Huang,Peng He,Jing Qin,Antonella Santone,Mingrui Ma,Francesco Mercaldo,Peng Feng,Hualiang Xiao,Sukun Tian

doi:10.1109/tmi.2022.3202248

Abstract

The tumor grading of laryngeal cancer pathological images needs to be accurate and interpretable. The deep learning model based on the attention mechanism-integrated convolution (AMC) block has good inductive bias capability but poor interpretability, whereas the deep learning model based on the vision transformer (ViT) block has good interpretability but weak inductive bias ability. Therefore, we propose an end-to-end ViT-AMC network (ViT-AMCNet) with adaptive model fusion and multiobjective optimization that integrates and fuses the ViT and AMC blocks. However, existing model fusion methods often have negative fusion: 1). There is no guarantee that the ViT and AMC blocks will simultaneously have good feature representation capability. 2). The difference in feature representations learning between the ViT and AMC blocks is not obvious, so there is much redundant information in the two feature representations. Accordingly, we first prove the feasibility of fusing the ViT and AMC blocks based on Hoeffding's inequality. Then, we propose a multiobjective optimization method to solve the problem that ViT and AMC blocks cannot simultaneously have good feature representation. Finally, an adaptive model fusion method integrating the metrics block and the fusion block is proposed to increase the differences between feature representations and improve the deredundancy capability. Our methods improve the fusion ability of ViT-AMCNet, and experimental results demonstrate that ViT-AMCNet significantly outperforms state-of-the-art methods. Importantly, the visualized interpretive maps are closer to the region of interest of concern by pathologists, and the generalization ability is also excellent. Our code is publicly available at https://github.com/Baron-Huang/ViT-AMCNet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A ViT-AMC Network With Adaptive Model Fusion and Multiobjective Optimization for Interpretable Laryngeal Tumor Grading From Histopathological Images.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Medical Imaging

Lead the way for us

Journal: IEEE Transactions on Medical Imaging	Publication Date: Jan 1, 2023
Citations: 44

Similar Papers

A Histopathological Image Feature Representation Method Based on Deep Learning
Gang Zhang ... Yi Zhang
-
Gang Zhang, et. al.Gang Zhang ... Yi Zhang
01 Nov 2015
01 Nov 2015

Modeling Short-Term and Long-Term Dependencies of the Speech Signal for Paralinguistic Emotion Classification
Oxana Verkholyak ... Alexey Karpov
Труды СПИИРАН | VOL. 18
Oxana Verkholyak, et. al.Oxana Verkholyak ... Alexey Karpov
11 Feb 2019
Труды СПИИРАН | VOL. 18

LPCANet: Classification of Laryngeal Cancer Histopathological Images Using a CNN with Position Attention and Channel Attention Mechanisms.
Xiaoli Zhou ... Francesco Mercaldo
Interdisciplinary Sciences: Computational Life Sciences | VOL. 13
Xiaoli Zhou, et. al.Xiaoli Zhou ... Francesco Mercaldo
17 Jun 2021
Interdisciplinary Sciences: Computational Life Sciences | VOL. 13

The Importance of Feature Representation for Visual Tracking Systems with Discriminative Methods
Jialin Lu ... Hongxin Li
-
Jialin Lu, et. al.Jialin Lu ... Hongxin Li
01 Aug 2015
01 Aug 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A ViT-AMC Network With Adaptive Model Fusion and Multiobjective Optimization for Interpretable Laryngeal Tumor Grading From Histopathological Images.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Medical Imaging