Hybrid method for modeless Japanese input using N-gram based binary classification and dictionary

Yukino Ikegami,Setsuo Tsuruta

doi:10.1007/s11042-013-1805-1

Abstract

The rapid growth of globalization requires handling a large number of multilingual documents, where Japanese input co-exist with English and other languages, which use the Roman alphabet. Conventional methods for Japanese input require Japanese users to switch the input mode between Japanese and the Latin alphabet. As current solution, there is a modeless Japanese input method that automatically switches the input mode. However, those need training with a large amount of text data for improving the performance. This paper proposes a hybrid modeless Japanese input method that is based on the non-Japanese word dictionary and n-gram character sequence features to decide whether to convert and switch to Kana input or not. The aim of using the non-Japanese word dictionary is decreasing false positive against non-Japanese language words. This dictionary is composed by text data available on the Web. The n-gram based discriminative model are learned by a Support Vector Machine from a balanced corpus, which contains various domain texts. The evaluation of our method has shown that its statistical accuracy according to F-measure for prediction of non-Kana characters improves 7.7 % compared to n-gram only based method. In addition, the real user test has shown the average value of inputted time was agreeside for our method, against disagree side for conventional Japanese input method that requires switching input mode.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hybrid method for modeless Japanese input using N-gram based binary classification and dictionary

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Jan 11, 2014
Citations: 10

Similar Papers

Modeless Japanese Input Method Using Multiple Character Sequence Features
Y Ikegami ... Y Sakurai
-
Y Ikegami, et. al.Y Ikegami ... Y Sakurai
01 Nov 2012
01 Nov 2012

Switchable dual dial: a Japanese text input method for VR contents based on flick input by using two VR controllers
Tomoya Hirabayashi ... Tokiichiro Takahashi
-
Tomoya Hirabayashi, et. al.Tomoya Hirabayashi ... Tokiichiro Takahashi
26 Mar 2023
26 Mar 2023

R&D of the Japanese Input Method using an eye-controlled communication device for users with disabilities and evaluation with NIRS
Shinji Kotani ... Tetsu Komasaki
-
Shinji Kotani, et. al.Shinji Kotani ... Tetsu Komasaki
01 Oct 2010
01 Oct 2010

Lung disease classification using machine learning algorithms
Murat Aykanat ... Özkan Kiliç
International Journal of Applied Mathematics Electronics and Computers | VOL. 8
Murat Aykanat, et. al.Murat Aykanat ... Özkan Kiliç
31 Dec 2020
International Journal of Applied Mathematics Electronics and Computers | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hybrid method for modeless Japanese input using N-gram based binary classification and dictionary

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications