Abstract

With the increasing application of the perceptron genetic algorithm neural network in Chinese-English two-way translation, there are many translation problems to be solved. In order to solve the translation problem of Chinese-English parallel corpus, the multilayer perceptron method, genetic word alignment model (GA), language model, and neural network method (including the translation model and bilingual pretraining model) are designed, which are combined into the ga-mlp-nn combination model to measure the parallelism of Chinese and English sentences from different emphases. The results show that the ga-mlp-nn model has good performance in filtering high-quality parallel corpus. The final experimental results show that compared with a single system, the improved multisystem fusion method based on weight multiplication has achieved better results in the test set. In the last five groups of evaluation results, the system submitted in this paper ranks second and first in multiple datasets, which has a certain reference value for the research of corpus filtering.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call