Abstract

Abstract Cultural dissemination and display methods with the progress of science and technology requirements continue to improve, especially in the AIGC era. The traditional cultural dissemination methods are no longer applicable, and “non-legacy digitization” needs to become more and more urgent. This paper analyzes the neural machine translation method and constructs a back-translation model by synthesizing a corpus and generating parameters. BLEU and other evaluation indexes are used to optimize the factors that affect the computation of synthetic data. An enhanced synthetic corpus is created using the kNN algorithm to construct a monolingual translation memory. Using the semantic information-sharing strategy, generate the parameters of the embedding layer of the non-legacy translation model while introducing the encoding layer to complete the overall construction of the non-legacy translation model. Comparative experiments and empirical analyses are set up to study the heterogeneity of the quality and dissemination effect of English translation of NRLs, and the experiments show that in the first set of translation tasks, the BLEU scores of the model proposed in this paper are 1 and 1.4 higher than those of the wait-k model when the value of k is 5 and 7, and the translation quality of the model proposed in this paper is better. The values of primary and secondary term coefficients of the English translation of non-legacy are 14.974 and -3.071, respectively, in the case of not establishing samples, showing an inverted U-shaped linear relationship.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.