The Use of MSVM and HMM for Sentence Alignment

Mohamed Abdel Fattah

doi:10.3745/jips.2012.8.2.301

Abstract

In this paper, two new approaches to align English-Arabic sentences in bilingual parallel corpora based on the Multi-Class Support Vector Machine (MSVM) and the Hidden Markov Model (HMM) classifiers are presented. A feature vector is extracted from the text pair that is under consideration. This vector contains text features such as length, punctuation score, and cognate score values. A set of manually prepared training data was assigned to train the Multi-Class Support Vector Machine and Hidden Markov Model. Another set of data was used for testing. The results of the MSVM and HMM outperform the results of the length based approach. Moreover these new approaches are valid for any language pairs and are quite flexible since the feature vector may contain less, more, or different features, such as a lexical matching feature and Hanzi characters in Japanese-Chinese texts, than the ones used in the current research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Use of MSVM and HMM for Sentence Alignment

Abstract

Talk to us

Similar Papers

More From: Journal of Information Processing Systems

Lead the way for us

Journal: Journal of Information Processing Systems	Publication Date: Jun 30, 2012
Citations: 11

Similar Papers

Sentence alignment using P-NNT and GMM
Mohamed Abdel Fattah ... Shingo Kuroiwa
Computer Speech & Language | VOL. 21
Mohamed Abdel Fattah, et. al.Mohamed Abdel Fattah ... Shingo Kuroiwa
04 Feb 2007
Computer Speech & Language | VOL. 21

Malayalam POS Tagger—A Comparison Using SVM and HMM
K Usha ... S Lakshmana Pandian
-
K Usha, et. al.K Usha ... S Lakshmana Pandian
09 Sep 2020
09 Sep 2020

HMM Mixtures (HMM2) for Robust Speech Recognition

-

01 Jan 2003
01 Jan 2003

Classifying G-protein coupled receptors with support vector machines.
Rachel Karchin ... David Haussler
Bioinformatics | VOL. 18
Rachel Karchin, et. al.Rachel Karchin ... David Haussler
01 Jan 2002
Bioinformatics | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Use of MSVM and HMM for Sentence Alignment

Abstract

Talk to us

Similar Papers

More From: Journal of Information Processing Systems