Knowledge enhancement BERT based on domain dictionary mask

Xianglin Cao,Wenchao Jiang,Hong Xiao

doi:10.3233/jhs-222013

Abstract

Semantic matching is one of the critical technologies for intelligent customer service. Since Bidirectional Encoder Representations from Transformers (BERT) is proposed, fine-tuning on a large-scale pre-training language model becomes a general method to implement text semantic matching. However, in practical application, the accuracy of the BERT model is limited by the quantity of pre-training corpus and proper nouns in the target domain. An enhancement method for knowledge based on domain dictionary to mask input is proposed to solve the problem. Firstly, for modul input, we use keyword matching to recognize and mask the word in domain. Secondly, using self-supervised learning to inject knowledge of the target domain into the BERT model. Thirdly, we fine-tune the BERT model with public datasets LCQMC and BQboost. Finally, we test the model’s performance with a financial company’s user data. The experimental results show that after using our method and BQboost, accuracy increases by 12.12% on average in practical applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Knowledge enhancement BERT based on domain dictionary mask

Abstract

Talk to us

Similar Papers

More From: Journal of High Speed Networks

Lead the way for us

Similar Papers

Engineering Document Summarization Using Sentence Representations Generated by Bidirectional Language Model
Yunjian Qiu ... Yan Jin
-
Yunjian Qiu, et. al.Yunjian Qiu ... Yan Jin
17 Aug 2021
17 Aug 2021

Oversampling effect in pretraining for bidirectional encoder representations from transformers (BERT) to localize medical BERT and enhance biomedical BERT
Shoya Wada ... Yasushi Matsumura
Artificial Intelligence In Medicine | VOL. 153
Shoya Wada, et. al.Shoya Wada ... Yasushi Matsumura
05 May 2024
Artificial Intelligence In Medicine | VOL. 153

Bert model fine-tuning for text classification in knee OA radiology reports
L Chen ... V Pedoia
Osteoarthritis and Cartilage | VOL. 28
L Chen, et. al.L Chen ... V Pedoia
01 Apr 2020
Osteoarthritis and Cartilage | VOL. 28

Contextual semantic embeddings based on fine-tuned AraBERT model for Arabic text multi-class categorization
Fatima-Zahra El-Alami ... Noureddine En Nahnahi
Journal of King Saud University - Computer and Information Sciences | VOL. 34
Fatima-Zahra El-Alami, et. al.Fatima-Zahra El-Alami ... Noureddine En Nahnahi
18 Feb 2021
Journal of King Saud University - Computer and Information Sciences | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Knowledge enhancement BERT based on domain dictionary mask

Abstract

Talk to us

Similar Papers

More From: Journal of High Speed Networks