Exploiting deep representations for natural language processing

Zi-Yi Dou,Xing Wang,Shuming Shi,Zhaopeng Tu

doi:10.1016/j.neucom.2019.12.060

Abstract

Advanced neural network models generally implement systems as multiple layers to model complex functions and capture complicated linguistic structures at different levels [1]. However, only the top layers of deep networks are leveraged in the subsequent process, which misses the opportunity to exploit the useful information embedded in other layers. In this work, we propose to expose all of these embedded signals with two types of mechanisms, namely deep connections and iterative routings. While deep connections allow better information and gradient flow across layers, iterative routings directly combine the layer representations to form a final output with iterative routing-by-agreement mechanism. Experimental results on both machine translation and language representation tasks demonstrate the effectiveness and universality of the proposed approaches, which indicates the necessity of exploiting deep representations for natural language processing tasks. While the two strategies individually boost performance, combining them can further improve performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploiting deep representations for natural language processing

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Dec 18, 2019
Citations: 8

Similar Papers

Research on the Application of Prompt Learning Pretrained Language Model in Machine Translation Task with Reinforcement Learning
Canjun Wang ... Zhengyu Ju
Electronics | VOL. 12
Canjun Wang, et. al.Canjun Wang ... Zhengyu Ju
09 Aug 2023
Electronics | VOL. 12

Can Transformers Jump Around Right in Natural Language? Assessing Performance Transfer from SCAN
Rahma Chaabouni ... Roberto Dessì
-
Rahma Chaabouni, et. al.Rahma Chaabouni ... Roberto Dessì
01 Jan 2020
01 Jan 2020

Learning cross-lingual phonological and orthagraphic adaptations: a case study in improving neural machine translation between low-resource languages
Saurav Jha ... Anil Kumar Singh
Journal of Language Modelling | VOL. 7
Saurav Jha, et. al.Saurav Jha ... Anil Kumar Singh
16 Sep 2019
Journal of Language Modelling | VOL. 7

An Effective Ensemble Model Related to Incremental Learning in Neural Machine Translation
Pumeng Shi
-
Pumeng ShiPumeng Shi
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploiting deep representations for natural language processing

Abstract

Talk to us

Similar Papers

More From: Neurocomputing