Learning Conflict-Noticed Architecture for Multi-Task Learning

Zhixiong Yue,Yu Zhang,Jie Liang

doi:10.1609/aaai.v37i9.26312

Abstract

Multi-task learning has been widely used in many applications to enable more efficient learning by sharing part of the architecture across multiple tasks. However, a major challenge is the gradient conflict when optimizing the shared parameters, where the gradients of different tasks could have opposite directions. Directly averaging those gradients will impair the performance of some tasks and cause negative transfer. Different from most existing works that manipulate gradients to mitigate the gradient conflict, in this paper, we address this problem from the perspective of architecture learning and propose a Conflict-Noticed Architecture Learning (CoNAL) method to alleviate the gradient conflict by learning architectures. By introducing purely-specific modules specific to each task in the search space, the CoNAL method can automatically learn when to switch to purely-specific modules in the tree-structured network architectures when the gradient conflict occurs. To handle multi-task problems with a large number of tasks, we propose a progressive extension of the CoNAL method. Extensive experiments on computer vision, natural language processing, and reinforcement learning benchmarks demonstrate the effectiveness of the proposed methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Conflict-Noticed Architecture for Multi-Task Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 2

Similar Papers

An overview of multi-task learning
Yu Zhang ... Qiang Yang
National Science Review | VOL. 5
Yu Zhang, et. al.Yu Zhang ... Qiang Yang
01 Sep 2017
National Science Review | VOL. 5

TaskFusion: An Efficient Transfer Learning Architecture with Dual Delta Sparsity for Multi-Task Natural Language Processing
Zichen Fan ... Hun-Seok Kim
-
Zichen Fan, et. al.Zichen Fan ... Hun-Seok Kim
17 Jun 2023
17 Jun 2023

Multitask Policy Adversarial Learning for Human-Level Control With Large State Spaces
Jun Ping Wang ... Ian Thomas
IEEE Transactions on Industrial Informatics | VOL. 15
Jun Ping Wang, et. al.Jun Ping Wang ... Ian Thomas
01 Apr 2019
IEEE Transactions on Industrial Informatics | VOL. 15

Multi-task Semi-supervised Semantic Feature Learning for Classification
Changying Du ... Fuzhen Zhuang
-
Changying Du, et. al.Changying Du ... Fuzhen Zhuang
01 Dec 2012
01 Dec 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Conflict-Noticed Architecture for Multi-Task Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence