Abstract

The structure of English long sentences is generally complicated, with complicated logical levels, many parallel elements, many modifiers and conjunctions, and long post attributives. Moreover, pronouns in English long sentences often need to be judged according to the context. Complex English long sentences are very common in English and even run through the whole English article. The analysis results of these complex long sentences will seriously affect the quality and readability of machine translation. In this paper, HNC (hierarchical network of concepts) method is improved to realize the segmentation of long sentences, so as to simplify sentence patterns. According to the characteristics of professional literature, this paper puts forward a translation optimization method of professional literature combining HNC with statistics, which greatly improves the recognition efficiency of unknown words by extracting professional terms. The results show that the word segmentation method based on HNC and statistics proposed in this paper has achieved a good word segmentation effect in the open test environment, with an accuracy rate of 93.38% and a recall rate of 94.51%. The conclusion shows that our method can make full use of the knowledge of the source tree database, thereby improving the accuracy of the syntactic model on the target tree database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.