A Comparison of Loss Weighting Strategies for Multi task Learning in Deep Neural Networks

Ting Gong,Venkata Renduchintala,Oguz H Elibol,Anthony Ndirango,Gokce Keskin,Tyler Lee,Suchismita Padhy,Cory Stephenson

doi:10.1109/access.2019.2943604

Ting Gong, Venkata Renduchintala + Show 6 more

Open Access

https://doi.org/10.1109/access.2019.2943604

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 80	License type: CC BY 4.0

Affiliation: Intel (United States)

Abstract

With the success of deep learning in a wide variety of areas, many deep multi-task learning (MTL) models have been proposed claiming improvements in performance obtained by sharing the learned structure across several related tasks. However, the dynamics of multi-task learning in deep neural networks is still not well understood at either the theoretical or experimental level. In particular, the usefulness of different task pairs is not known a priori. Practically, this means that properly combining the losses of different tasks becomes a critical issue in multi-task learning, as different methods may yield different results. In this paper, we benchmarked different multi-task learning approaches using shared trunk with task specific branches architecture across three different MTL datasets. For the first dataset, i.e. Multi-MNIST (Modified National Institute of Standards and Technology database), we thoroughly tested several weighting strategies, including simply adding task-specific cost functions together, dynamic weight average (DWA) and uncertainty weighting methods each with various amounts of training data per-task. We find that multi-task learning typically does not improve performance for a user-defined combination of tasks. Further experiments evaluated on diverse tasks and network architectures on various datasets suggested that multi-task learning requires careful selection of both task pairs and weighting strategies to equal or exceed the performance of single task learning.

Highlights

The goal of multi-task learning (MTL) is to learn multiple different yet related tasks simultaneously [1]
Our contributions are as follows: a) To the best of our knowledge, this is the first meta-analysis that extensively compares the different weighting approaches combining multiple loss functions in the context of both heterogeneous MTL and homogeneous MTL. b) The key observation is that MTL approaches which enforce shared data representations could show more efficacy when the training samples are sparse. c) We find that many of the results obtained with any chosen set of tasks, which we refer to user-defined tasks, may not achieve performance gains over single task learning (STL), which calls the attention of the deep learning community for more rigorous theoretical analysis
EXPERIMENTS We developed a basic experiment to run on the aforementioned Multi-MNIST dataset to demonstrate the effect of MTL

Summary

Introduction

The goal of multi-task learning (MTL) is to learn multiple different yet related tasks simultaneously [1]. The weighting approaches we evaluated include uniform combination of losses from different tasks, dynamic weight average (DWA) [10] and uncertainty weighting methods [11] [12] with various amounts of training data per-task. Experiments of STL and MTL with two heads of classification tasks on Multi-MNIST dataset.

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparison of Loss Weighting Strategies for Multi task Learning in Deep Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A Deep Neural Networks Based on Multi-task Learning and Its Application
Mengru Zhao ... Deyuan Sun
-
Mengru Zhao, et. al.Mengru Zhao ... Deyuan Sun
26 Jul 2021
26 Jul 2021

Multi-task Learning of Deep Neural Networks for Low-resource Speech Recognition
Dongpeng Chen ... Brian Mak
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 23
Dongpeng Chen, et. al.Dongpeng Chen ... Brian Mak
01 Jan 2015
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 23

Adversarial Learning Guided Task Relatedness Refinement for Multi-Task Deep Learning
Yuchun Fang ... Zhengchen Li
IEEE Transactions on Multimedia | VOL. 25
Yuchun Fang, et. al.Yuchun Fang ... Zhengchen Li
01 Jan 2023
IEEE Transactions on Multimedia | VOL. 25

A comprehensive multi-task deep learning approach for predicting metabolic syndrome with genetic, nutritional, and clinical data
Minhyuk Lee ... Mira Park
Scientific Reports | VOL. 14
Minhyuk Lee, et. al.Minhyuk Lee ... Mira Park
01 Aug 2024
Scientific Reports | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparison of Loss Weighting Strategies for Multi task Learning in Deep Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access