Spell Checking System Research Articles

The article focuses particularly on the difference between typos (accidental mechanical errors) and spelling or conceptual errors that arise from insufficient knowledge of language rules. Modern typo detection methods are analyzed, highlighting the advantages and disadvantages of each. The Levenshtein method is one of the most common algorithms for detecting and correcting errors in text. It effectively identifies and corrects errors in short words where the number of operations to convert the erroneous word to the correct one is small. However, this method does not consider the context in which the word is used, which can lead to incorrect corrections. The keyboard layout-based typo detection method analyzes probable errors that can occur due to the proximity of keys on the keyboard. It is simple to implement and integrate into existing spell-checking systems but does not consider the context of word usage. The contextual analysis method for typo detection relies on using contextual information to identify and correct errors in text, requiring significant computational resources and a large, diverse corpus of texts for effective model training. Deep models, such as BERT or GPT, consider the context of entire sentences or even larger text blocks, allowing for high accuracy in typo detection but require significant computational resources for training and inference, as well as large volumes of high-quality data for training. Machine learning methods, such as n-grams and Bayesian classifiers, show significant potential due to their simplicity and efficiency but may not account for complex dependencies between words and context, reducing their accuracy. The study highlights the importance of accurate error detection in student assessment systems, where typos can affect final grades and the relevance of answers.

Read full abstract

Spellchecker is software that analyzes possible misspellings in the text. It is the process of detecting and sometimes providing some suggestions for incorrectly spelled words in a text. If dictionary of spell checker is larger than higher is the error detection and error correction rate. Though considerable work has been done in English language but not much work has been done in regional languages of India including Punjabi and Hindi. Punjabi is the official language of Punjab state in India Punjabi is world’s 12th most widely spoken language In Punjabi language, there is very small amount of work is completed in this region.. Hindi is the official language of 11 vowels and 33 consonants. Hindi is also the third most spoken language in the world. A few work is done in Punjabi-Hindi spell detection and correction field and it is not easy task to identify errors in Punjabi-Hindi text. The spell checker systems are online available but as not stand-alone applications. The only available spell checker for Punjabi is “Akhar” ”Raftaar” and “Sudhaar”. “Akhar” is paid software that is not available free for its use to everybody and “Sudhaar” spell checker is a desktop application Some paid Hindi spell checker software’s are also online available. “Hinspell” & “Hinkhoj” are available spell checkers for Hindi language but a lot of improvement is needed. NLP (Natural language processing) is a field of computer science concerned with interaction between computer and human language. We have developed a combined spell checker and error correcting system for both Punjabi and Hindi Language. We used hybrid approach to implement the Spelling checking and Correcting System. The proposed system use hybrid approach which s a combination of various approaches like rule based approach, dictionary lookup approach , edit distance approach and NGram approach. Proposed system is tested with various inputs collected from different sources and results are found very accurate than that of existing system. Keywords: Spell Checking, Error Detection, Error Correction, Punjabi, Hindi, Dictionary-lookup, Hybrid approach.

Read full abstract

Spell Checking System Research Articles

Related Topics

Articles published on Spell Checking System

Technologies Overview for Typo Segregation

Automatic Spell-Checking System for Spanish Based on the Ar2p Neural Network Model

SentiVerb system: classification of social media text using sentiment analysis

Arabic Part-of-Speech Tagger, an Approach Based on Neural Network Modelling

Context's impact on the automatic spelling correction

A Combined Spell Checking and Error Correcting System for Punjabi -Hindi Language using Hybrid Approach

Phonetic-based Sindhi spellchecker system using a hybrid model

Intelligent typo correction for text mining through machine learning

Novel Approach for Arabic Spell-Checker: Based on Radix Search Tree

For an Independent Spell-Checking System from the Arabic Language Vocabulary

FarsiSpell: A spell-checking system for Persian using a large monolingual corpus

Effective spell checking by learning user behavior

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Spell Checking System Research Articles

Related Topics

Articles published on Spell Checking System

Technologies Overview for Typo Segregation

Automatic Spell-Checking System for Spanish Based on the Ar2p Neural Network Model

SentiVerb system: classification of social media text using sentiment analysis

Arabic Part-of-Speech Tagger, an Approach Based on Neural Network Modelling

Context's impact on the automatic spelling correction

A Combined Spell Checking and Error Correcting System for Punjabi -Hindi Language using Hybrid Approach

Phonetic-based Sindhi spellchecker system using a hybrid model

Intelligent typo correction for text mining through machine learning

Novel Approach for Arabic Spell-Checker: Based on Radix Search Tree

For an Independent Spell-Checking System from the Arabic Language Vocabulary

FarsiSpell: A spell-checking system for Persian using a large monolingual corpus

Effective spell checking by learning user behavior