DeMT: Deformable Mixer Transformer for Multi-Task Learning of Dense Prediction

Yangyang Xu,Yibo Yang,Lefei Zhang

doi:10.1609/aaai.v37i3.25411

Abstract

Convolution neural networks (CNNs) and Transformers have their own advantages and both have been widely used for dense prediction in multi-task learning (MTL). Most of the current studies on MTL solely rely on CNN or Transformer. In this work, we present a novel MTL model by combining both merits of deformable CNN and query-based Transformer for multi-task learning of dense prediction. Our method, named DeMT, is based on a simple and effective encoder-decoder architecture (i.e., deformable mixer encoder and task-aware transformer decoder). First, the deformable mixer encoder contains two types of operators: the channel-aware mixing operator leveraged to allow communication among different channels (i.e., efficient channel location mixing), and the spatial-aware deformable operator with deformable convolution applied to efficiently sample more informative spatial locations (i.e., deformed features). Second, the task-aware transformer decoder consists of the task interaction block and task query block. The former is applied to capture task interaction features via self-attention. The latter leverages the deformed features and task-interacted features to generate the corresponding task-specific feature through a query-based Transformer for corresponding task predictions. Extensive experiments on two dense image prediction datasets, NYUD-v2 and PASCAL-Context, demonstrate that our model uses fewer GFLOPs and significantly outperforms current Transformer- and CNN-based competitive models on a variety of metrics. The code is available at https://github.com/yangyangxu0/DeMT.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DeMT: Deformable Mixer Transformer for Multi-Task Learning of Dense Prediction

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 7

Similar Papers

A Novel Multi-Task Learning Model with PSAE Network for Simultaneous Estimation of Surface Quality and Tool Wear in Milling of Nickel-Based Superalloy Haynes 230.
Minghui Cheng ... Jie Sun
Sensors (Basel, Switzerland) | VOL. 22
Minghui Cheng, et. al.Minghui Cheng ... Jie Sun
30 Jun 2022
Sensors (Basel, Switzerland) | VOL. 22

An overview of multi-task learning
Yu Zhang ... Qiang Yang
National Science Review | VOL. 5
Yu Zhang, et. al.Yu Zhang ... Qiang Yang
01 Sep 2017
National Science Review | VOL. 5

Multi-task Neural Networks Convolutional Learning Model for Maize Disease Identification
Diane Niyomwungere ... Waweru Mwangi
-
Diane Niyomwungere, et. al.Diane Niyomwungere ... Waweru Mwangi
16 May 2022
16 May 2022

Multi-population genomic prediction using a multi-task Bayesian learning model.
Liuhong Chen ... Flavio Schenkel
BMC genetics | VOL. 15
Liuhong Chen, et. al.Liuhong Chen ... Flavio Schenkel
01 Jan 2014
BMC genetics | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DeMT: Deformable Mixer Transformer for Multi-Task Learning of Dense Prediction

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence