Amalgamating Multi-Task Models with Heterogeneous Architectures

Jidapa Thadajarassiri,Walter Gerych,Elke Rundensteiner,Xiangnan Kong

doi:10.1609/aaai.v38i14.29459

Abstract

Multi-task learning (MTL) is essential for real-world applications that handle multiple tasks simultaneously, such as selfdriving cars. MTL methods improve the performance of all tasks by utilizing information across tasks to learn a robust shared representation. However, acquiring sufficient labeled data tends to be extremely expensive, especially when having to support many tasks. Recently, Knowledge Amalgamation (KA) has emerged as an effective strategy for addressing the lack of labels by instead learning directly from pretrained models (teachers). KA learns one unified multi-task student that masters all tasks across all teachers. Existing KA for MTL works are limited to teachers with identical architectures, and thus propose layer-to-layer based approaches. Unfortunately, in practice, teachers may have heterogeneous architectures; their layers may not be aligned and their dimensionalities or scales may be incompatible. Amalgamating multi-task teachers with heterogeneous architectures remains an open problem. For this, we design Versatile Common Feature Consolidator (VENUS), the first solution to this problem. VENUS fuses knowledge from the shared representations of each teacher into one unified generalized representation for all tasks. Specifically, we design the Feature Consolidator network that leverages an array of teacher-specific trainable adaptors. These adaptors enable the student to learn from multiple teachers, even if they have incompatible learned representations. We demonstrate that VENUS outperforms five alternative methods on numerous benchmark datasets across a broad spectrum of experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Amalgamating Multi-Task Models with Heterogeneous Architectures

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

An overview of multi-task learning
Yu Zhang ... Qiang Yang
National Science Review | VOL. 5
Yu Zhang, et. al.Yu Zhang ... Qiang Yang
01 Sep 2017
National Science Review | VOL. 5

Multi-population genomic prediction using a multi-task Bayesian learning model.
Liuhong Chen ... Stephen Miller
BMC genetics | VOL. 15
Liuhong Chen, et. al.Liuhong Chen ... Stephen Miller
01 Jan 2014
BMC genetics | VOL. 15

Multitask and Transfer Learning Approach for Joint Classification and Severity Estimation of Dysphonia.
Dosti Aziz ... Sztahó Dávid
IEEE Journal of Translational Engineering in Health and Medicine | VOL. 12
Dosti Aziz, et. al.Dosti Aziz ... Sztahó Dávid
01 Jan 2024
IEEE Journal of Translational Engineering in Health and Medicine | VOL. 12

Multi-Task Learning with Capsule Networks
Kai Lei ... Yuzhi Liang
-
Kai Lei, et. al.Kai Lei ... Yuzhi Liang
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Amalgamating Multi-Task Models with Heterogeneous Architectures

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence