Modeless Japanese Input Method Using Multiple Character Sequence Features

Y Ikegami,S Tsuruta,Y Sakurai

doi:10.1109/sitis.2012.93

Abstract

Recently, the rapid growth of globalization requires writing a large number of multilingual texts. However, Japanese PC users need to switch the input mode between Japanese and the Latin alphabet on conventional Japanese input method. That is cumbersome. Meanwhile, the solution system using a dictionary is hard to maintain because new words are created every year with high frequency. This paper proposes a modeless Japanese input method which automatically switches the input mode without using a dictionary. Using the model called "multiple character sequence features", this method discriminates whether to convert alphabet into Kana or not. There are multiple character sequence features, namely, character surface features and character type features both based on n-gram. These model features are learned by a Support Vector Machine from corpora especially from those of a large number of living words on Web. The evaluation of this method showed that the statistical accuracy by F-measure for both chatting texts and news texts was over 90% (mostly over 99%).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modeless Japanese Input Method Using Multiple Character Sequence Features

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Hybrid method for modeless Japanese input using N-gram based binary classification and dictionary
Yukino Ikegami ... Setsuo Tsuruta
Multimedia Tools and Applications | VOL. 74
Yukino Ikegami, et. al.Yukino Ikegami ... Setsuo Tsuruta
11 Jan 2014
Multimedia Tools and Applications | VOL. 74

Using Chou's 5-Step Rule to Predict DNA-Protein Binding with Multi-scale Complementary Feature.
Xiuquan Du ... Jiajia Hu
Journal of proteome research | VOL. 20
Xiuquan Du, et. al.Xiuquan Du ... Jiajia Hu
01 Feb 2021
Journal of proteome research | VOL. 20

Specific emitter identification based on multiple sequence feature learning.
Dong Yi ... Yanyun Wang
PLOS ONE | VOL. 19
Dong Yi, et. al.Dong Yi ... Yanyun Wang
15 May 2024
PLOS ONE | VOL. 19

Genome-wide identification and comparative analyses of key genes involved in C4 photosynthesis in five main gramineous crops.
Liang Chen ... Qiumei Lu
Frontiers in plant science | VOL. 14
Liang Chen, et. al.Liang Chen ... Qiumei Lu
13 Mar 2023
Frontiers in plant science | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modeless Japanese Input Method Using Multiple Character Sequence Features

Abstract

Talk to us

Similar Papers