Deep Learning-Based Context-Sensitive Spelling Typing Error Correction

Jung-Hun Lee,Minho Kim,Hyuk-Chul Kwon

doi:10.1109/access.2020.3014779

Jung-Hun Lee, Minho Kim + Show 1 more

Open Access

PDF Available

https://doi.org/10.1109/access.2020.3014779

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

This study aims to solve the context-sensitive spelling error problem for English documents. There are two types of spelling errors in English: non-word spelling errors and context-sensitive spelling errors. Non-word spelling errors are simple to correct because they can only be detected by matching the words in sentences with those in a dictionary; however, context-sensitive spelling errors entail increased difficulty of correction because the relationship between the word to be corrected and the surrounding context must be known. Spelling errors are considered noise in every field that uses text information, and preprocessing via document correction is necessary to minimize this problem. Context-sensitive spelling errors include homophone errors (which arise from the incorrect use of words that sound the same but are spelled differently), typographical errors (caused by striking an incorrect key on a keyboard), grammatical errors (which occur when the user does not know the correct grammatical rules), and cross word boundary errors (which arise from incorrect spacing between words). This study focuses on typographical errors. The context-sensitive spelling error problem is solved using the deep learning method, which is not an existing statistical method. The deep learning language model-based correction approach is divided into four parts, namely, correction based on word embedding information, contextual embedding information, an auto-regressive (AR) language model, and an auto-encoding (AE) language model. In this study, the best correction performance was obtained for the AE language model-based approach, and we verified its performance through a detailed correction test.

Highlights

Spelling errors can be classified into two categories: nonword and context-sensitive spelling errors
This paper is structured as follows: Section 2 presents related research, Section 3 discusses the context-sensitive spelling errors considered in this study, Section 4 elucidates the correctional language model, Section 5 presents an analysis of the experiment and results, and Section 6 presents the conclusion and future research
In the context-sensitive spelling error correction process, it is difficult to obtain correct answers to spelling errors for all words; we chose a deep learning language model based on unsupervised learning

Summary

INTRODUCTION

Spelling errors can be classified into two categories: nonword and context-sensitive spelling errors The former occur when a word is spelt with a non-conventional spelling, such as ‘‘fron.’’ it is easy to detect these errors by analyzing a word morphologically. The methods used to correct context-sensitive spelling errors can be separated into three categories: rule-based, statistical, and deep learning-based method. We apply various recently developed deep learning language models to context-sensitive spelling error correction and suggest the direction of a correction experiment. This paper is structured as follows: Section 2 presents related research, Section 3 discusses the context-sensitive spelling errors considered in this study, Section 4 elucidates the correctional language model, Section 5 presents an analysis of the experiment and results, and Section 6 presents the conclusion and future research

RELATED RESEARCH

CONTEXT-SENSITIVE SPELLING CORRECTION TECHNIQUE

COMPARISON OF EMBEDDING-BASED CORRECTION PERFORMANCE

Findings

CONCLUSION

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 26	License type: CC BY 4.0

R Discovery Prime

Deep Learning-Based Context-Sensitive Spelling Typing Error Correction

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

ANALISIS KESALAHAN BERBAHASA DALAM RUBRIK “FOKUS” MAJALAH PENDAPA TAMANSISWA
Yosephus Dominikus Fernandez ... Mukhlish Mukhlish
Caraka: Jurnal Ilmu Kebahasaan, Kesastraan, dan Pembelajarannya | VOL. 4
Yosephus Dominikus Fernandez, et. al.Yosephus Dominikus Fernandez ... Mukhlish Mukhlish
15 Jun 2018
Caraka: Jurnal Ilmu Kebahasaan, Kesastraan, dan Pembelajarannya | VOL. 4

“Without the spelling errors I would have shortlisted her…”: The impact of spelling errors on recruiters’ choice during the personnel selection process
Christelle Martin‐Lacroux
International Journal of Selection and Assessment | VOL. 25
Christelle Martin‐LacrouxChristelle Martin‐Lacroux
04 Aug 2017
International Journal of Selection and Assessment | VOL. 25

Grammatical versus Spelling Error Correction: An Investigation into the Responsiveness of Transformer-Based Language Models Using BART and MarianMT
Rohit Raju ... Sa Gandheesh
Journal of Information & Knowledge Management | VOL. -
Rohit Raju, et. al.Rohit Raju ... Sa Gandheesh
21 Mar 2024
Journal of Information & Knowledge Management | VOL. -

Is Chinese Spelling Check ready? Understanding the correction behavior in real-world scenarios
Liner Yang ... Erhong Yang
AI Open | VOL. 4
Liner Yang, et. al.Liner Yang ... Erhong Yang
01 Jan 2023
AI Open | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Deep Learning-Based Context-Sensitive Spelling Typing Error Correction

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access