VisPhone: Chinese named entity recognition model enhanced by visual and phonetic features

Baohua Zhang,Jiahao Cai,Huaping Zhang,Jianyun Shang

doi:10.1016/j.ipm.2023.103314

Baohua Zhang, Jiahao Cai + Show 2 more

Open Access

https://doi.org/10.1016/j.ipm.2023.103314

Copy DOI

Abstract

Many Chinese NER models only focus on lexical and radical information, ignoring the fact that there are also certain rules for the pronunciation of Chinese entities. In this paper, we propose VisPhone, which incorporates Chinese characters’ Phonetic features into Transformer Encoder along with the Lattice and Visual features. We present the common rules for the pronunciation of Chinese entities and explore the most appropriate method to encode it. VisPhone uses two identical cross transformer encoders to fuse the visual and phonetic features of the input characters with the text embedding. A selective fusion module is used to get the final features. We conducted experiments on four well-known Chinese NER benchmark datasets: OntoNotes4.0, MSRA, Resume, and Weibo, with F1 scores of 82.63%, 96.07%, 96.26%, 70.79% respectively, improving the performance by 0.79%, 0.32%, 0.39%, and 3.47%. Our ablation experiments have also demonstrated the effectiveness of VisPhone.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information Processing & Management	Publication Date: Feb 13, 2023
Citations: 13	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

VisPhone: Chinese named entity recognition model enhanced by visual and phonetic features

Abstract

Talk to us

Similar Papers

More From: Information Processing & Management

Lead the way for us

Similar Papers

Pronounce differently, mean differently: A multi-tagging-scheme learning method for Chinese NER integrated with lexicon and phonetic features
Chengcheng Mai ... Yihua Huang
Information Processing & Management | VOL. 59
Chengcheng Mai, et. al.Chengcheng Mai ... Yihua Huang
04 Aug 2022
Information Processing & Management | VOL. 59

Fusing Phonetic Features and Chinese Character Representation for Sentiment Analysis
Haiyun Peng ... Yang Li
-
Haiyun Peng, et. al.Haiyun Peng ... Yang Li
01 Jan 2023
01 Jan 2023

Multimodal Neural Machine Translation Using CNN and Transformer Encoder
Hiroki Takushima ... Takashi Ninomiya
-
Hiroki Takushima, et. al.Hiroki Takushima ... Takashi Ninomiya
02 Apr 2019
02 Apr 2019

A General Approach to Multimodal Document Quality Assessment
Aili Shen ... Jianzhong Qi
Journal of Artificial Intelligence Research | VOL. 68
Aili Shen, et. al.Aili Shen ... Jianzhong Qi
22 Jul 2020
Journal of Artificial Intelligence Research | VOL. 68

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

VisPhone: Chinese named entity recognition model enhanced by visual and phonetic features

Abstract

Talk to us

Similar Papers

More From: Information Processing &amp; Management

More From: Information Processing & Management