Study on Chinese-English corpus construction toward multiple-domain resources

Xiao-Guang Li

doi:10.3724/sp.j.1087.2008.00146

Study on Chinese-English corpus construction toward multiple-domain resources

Xiao-Guang Li

https://doi.org/10.3724/sp.j.1087.2008.00146

Copy DOI

Journal: Journal of Computer Applications

Publication Date: Jul 10, 2008

#Alignment Model #Features Of Regularity + Show 4 more

Abstract
Full-Text PDF
Similar Papers

Abstract

With the consideration of the features of open, multiple-domain and layout regularity of bilingual resources on Web, a mixture probabilistic alignment model was proposed to reveal the domain-specific and position-specific characteristic for aligning texts. Compared to the traditional lengthen-based aligning model, the model in this paper achieves 37% and 40.4% improvement on precise and recall respectively with the extensive experiments.

Full Text