Ancient Text Translation Model Optimized with GujiBERT and Entropy-SkipBERT

Fuxing Yu,Rui Han,Yanchao Zhang,Yang Han

doi:10.3390/electronics13224492

Abstract

To cope with the challenges posed by the complex linguistic structure and lexical polysemy in ancient texts, this study proposes a two-stage translation model. First, we combine GujiBERT, GCN, and LSTM to categorize ancient texts into historical and non-historical categories. This categorization lays the foundation for the subsequent translation task. To improve the efficiency of word vector generation and reduce the limitations of the traditional Word2Vec model, we integrated the entropy weight method in the hopping lattice training process and spliced the word vectors with GujiBERT. This improved method improves the efficiency of word vector generation and enhances the model’s ability to accurately represent lexical polysemy and grammatical structure in ancient documents through dependency weighting. In training the translation model, we used a different dataset for each text category, significantly improving the translation accuracy. Experimental results show that our categorization model improves the accuracy by 5% compared to GujiBERT. In contrast, the Entropy-SkipBERT improves the BLEU scores by 0.7 and 0.4 on historical and non-historical datasets. Ultimately, the proposed two-stage model improves the BLEU scores by 2.7 over the baseline model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ancient Text Translation Model Optimized with GujiBERT and Entropy-SkipBERT

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Journal: Electronics	Publication Date: Nov 15, 2024
License type: CC BY 4.0

Similar Papers

Benjamin A. Elman (Editor). Antiquarianism, Language, and Medical Philology: From Early Modern to Modern Sino-Japanese Medical Discourses. (Sir Henry Wellcome Asian Series, 12.) viii + 232 pp., figs., index. Leiden/Boston: Brill, 2015. $135 (cloth).
Angelika C Messner
Isis | VOL. 108
Angelika C MessnerAngelika C Messner
01 Mar 2017
Isis | VOL. 108

A Character-Enhanced Chinese Word Embedding Model
Gang Yang ... Tianhao He
-
Gang Yang, et. al.Gang Yang ... Tianhao He
01 Jul 2019
01 Jul 2019

Dynamic Fusion: Attentional Language Model for Neural Machine Translation
Michiki Kurosawa ... Mamoru Komachi
-
Michiki Kurosawa, et. al.Michiki Kurosawa ... Mamoru Komachi
01 Jan 2020
01 Jan 2020

A Study on Chinese-English Machine Translation Based on Migration Learning and Neural Networks
Fan Ying
International Journal on Artificial Intelligence Tools | VOL. 31
Fan YingFan Ying
01 Aug 2022
International Journal on Artificial Intelligence Tools | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ancient Text Translation Model Optimized with GujiBERT and Entropy-SkipBERT

Abstract

Talk to us

Similar Papers

More From: Electronics