Structurally Comparative Hinge Loss for Dependency-Based Neural Text Representation

Kexin Wang,Chengqing Zong,Shaonan Wang,Jiajun Zhang,Yu Zhou

doi:10.1145/3387633

Abstract

Dependency-based graph convolutional networks (DepGCNs) are proven helpful for text representation to handle many natural language tasks. Almost all previous models are trained with cross-entropy (CE) loss, which maximizes the posterior likelihood directly. However, the contribution of dependency structures is not well considered by CE loss. As a result, the performance improvement gained by using the structure information can be narrow due to the failure in learning to rely on this structure information. To face the challenge, we propose the novel structurally comparative hinge (SCH) loss function for DepGCNs. SCH loss aims at enlarging the margin gained by structural representations over non-structural ones. From the perspective of information theory, this is equivalent to improving the conditional mutual information of model decision and structure information given text. Our experimental results on both English and Chinese datasets show that by substituting SCH loss for CE loss on various tasks, for both induced structures and structures from an external parser, performance is improved without additional learnable parameters. Furthermore, the extent to which certain types of examples rely on the dependency structure can be measured directly by the learned margin, which results in better interpretability. In addition, through detailed analysis, we show that this structure margin has a positive correlation with task performance and structure induction of DepGCNs, and SCH loss can help model focus more on the shortest dependency path between entities. We achieve the new state-of-the-art results on TACRED, IMDB, and Zh. Literature datasets, even compared with ensemble and BERT baselines.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Structurally Comparative Hinge Loss for Dependency-Based Neural Text Representation

Abstract

Published Version

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: May 18, 2020
Citations: 1

Similar Papers

Direction-sensitive relation extraction using Bi-SDP attention model
Hailin Wang ... Guisong Liu
Knowledge-Based Systems | VOL. 198
Hailin Wang, et. al.Hailin Wang ... Guisong Liu
24 Apr 2020
Knowledge-Based Systems | VOL. 198

Tree kernel-based protein–protein interaction extraction from biomedical literature
Longhua Qian ... Guodong Zhou
Journal of Biomedical Informatics | VOL. 45
Longhua Qian, et. al.Longhua Qian ... Guodong Zhou
25 Feb 2012
Journal of Biomedical Informatics | VOL. 45

BiLSTM-SSVM: Training the BiLSTM with a Structured Hinge Loss for Named-Entity Recognition
Hanieh Poostchi ... Massimo Piccardi
IEEE Transactions on Big Data | VOL. 8
Hanieh Poostchi, et. al.Hanieh Poostchi ... Massimo Piccardi
01 Feb 2022
IEEE Transactions on Big Data | VOL. 8

Continual Pre-Training of Language Models for Concept Prerequisite Learning with Graph Neural Networks
Xin Tang ... Kunjia Liu
Mathematics | VOL. 11
Xin Tang, et. al.Xin Tang ... Kunjia Liu
20 Jun 2023
Mathematics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Structurally Comparative Hinge Loss for Dependency-Based Neural Text Representation

Abstract

Published Version

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing