Multi-Head Self-Attention Gated-Dilated Convolutional Neural Network for Word Sense Disambiguation

Chun-Xiang Zhang,Xue-Yao Gao,Yu-Long Zhang

doi:10.1109/access.2023.3243574

Chun-Xiang Zhang, Xue-Yao Gao + Show 1 more

Open Access

https://doi.org/10.1109/access.2023.3243574

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2023
Citations: 5	License type: CC BY 4.0

Affiliation: Harbin University of Science and Technology

Abstract

Word sense disambiguation (WSD) is to determine correct sense of ambiguous word based on its context. WSD is widely used in text classification, machine translation and information retrieval and so on. In order to improve accuracy of simplified Chinese WSD, a WSD model based on multi-head self-attention and gated-dilated convolutional neural network(AGDCNN) is proposed. Ambiguous word is viewed as the center and 4 adjacent lexical units are extracted successively toward the left and right side. Words, parts of speech, and semantic categories in 4 adjacent lexical units are vectorized and the vectorized results are input into gated-dilated convolutional neural network to get discriminative features. Then, multi-head self-attention is adopted to learn the difference and connection among discriminative features fully. Finally, classification weights are output from adaptive average pooling layer. Experiments are conducted on SemEval-2007: Task#5 and SemEval-2021: Task#2. Experimental results show that AGDCNN model has higher accuracy compared with other methods. Our goal is to improve the quality of simplified Chinese WSD as much as possible based on current linguistic resources and machine learning methods. The challenge we face is to extract effective discriminative features and design disambiguation model in high quality. Our novelty lies in that gated-dilated convolution is combined with multi-head self-attention to extract effective discriminative features, and learn their difference and connection from word form, parts of speech, and semantic categories.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Head Self-Attention Gated-Dilated Convolutional Neural Network for Word Sense Disambiguation

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Graph Convolutional Network for Word Sense Disambiguation
Chun-Xiang Zhang ... Rui Liu
Discrete Dynamics in Nature and Society | VOL. 2021
Chun-Xiang Zhang, et. al.Chun-Xiang Zhang ... Rui Liu
30 Sep 2021
Discrete Dynamics in Nature and Society | VOL. 2021

Word Sense Disambiguation Based on Semantic Knowledge
Rui-Yan Liang ... Chun-Xiang Zhang
-
Rui-Yan Liang, et. al.Rui-Yan Liang ... Chun-Xiang Zhang
01 Jan 2019
01 Jan 2019

Contextual word sense tuning and disambiguation
Roberto Basili ... Maria Teresa Pazienza
Applied Artificial Intelligence | VOL. 11
Roberto Basili, et. al.Roberto Basili ... Maria Teresa Pazienza
01 Apr 1997
Applied Artificial Intelligence | VOL. 11

Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.
Hua Xu ... Rositsa Dimova
BMC Bioinformatics | VOL. 7
Hua Xu, et. al.Hua Xu ... Rositsa Dimova
05 Jul 2006
BMC Bioinformatics | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Head Self-Attention Gated-Dilated Convolutional Neural Network for Word Sense Disambiguation

Abstract

Talk to us

Similar Papers

More From: IEEE Access