Abstract

With the rapid development of artificial intelligence in the current era of big data, the construction of translation corpus has become a key factor in effectively achieving a highly intelligent translation. In the era of big data, the data sources and data types of translation corpus are becoming more and more diversified, which will inevitably bring about a new revolution in the construction of translation corpus. The construction of the translation corpus in the era of big data can fully rely on third-party open source data, crowd-sourcing translation, machine closed-loop, human-machine collaboration and other multiple modes to comprehensively improve the quality of translation corpus construction to better serve translation practice.

Highlights

  • Driven by the full power of information technology, the construction of translation corpus has become an important foundation for the current development of AI translation and will gradually become the main body of language services. (Chai Mingying, 2016) It has become an urgent call for the times development to scientifically conform to the characteristics of translation corpus in the context of current big data, fully consider the inherent relationship between translation corpus and AI translation, and

  • The rapid development of information technology will inevitably facilitate a high degree of integration of information extraction and knowledge, integrate the data that has not been connected, promote a trend of knowledge mapping in the translation corpus, and realize the effective connection of massive fragment translation data information

  • The current existing online translation systems and translation memory systems provide a solid foundation for obtaining parallel corpus data, which can be seen from Google, Youdao, Baidu and other translation systems

Read more

Summary

INTRODUCTION

Actively construct new translation corpus projects that meet the requirements of big data. The rapid development of big data technology in the current society has fully promoted machine translation to the stage of intelligent translation of neural networks. The advent of the big data era has fully triggered a new wave of big data translation and has greatly reignited people’s hope for machine translation and the construction of a new translation model that is closely integrated with high-tech and adapted to the characteristics of the big data era. (Chai Mingying, 2016) It has become an urgent call for the times development to scientifically conform to the characteristics of translation corpus in the context of current big data, fully consider the inherent relationship between translation corpus and AI translation, and Driven by the full power of information technology, the construction of translation corpus has become an important foundation for the current development of AI translation and will gradually become the main body of language services. (Chai Mingying, 2016) It has become an urgent call for the times development to scientifically conform to the characteristics of translation corpus in the context of current big data, fully consider the inherent relationship between translation corpus and AI translation, and

Examples of types of Translation Corpus in the Context of Big Data
Data Sources of Translation Corpus in the Context of Big Data
The Characteristics of Translation Corpus in the Context of Big Data
DISCUSSION
To Fully Rely on A Third-party Comparison Model of Open Source Data
Sharing Mode of Crowdsourced Translation Corpus Data
Self-closed-loop Learning Model Based on Machine Translation
Collaboration Model Based on HumanMachine Cooperation
CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.