Dynamic decoding and dual synthetic data for automatic correction of grammar in low-resource scenario.

Ahmad Musyafa,Ahmad Musyafa,Ying Gao,Aiman Solyman,Siraj Khan,Wentian Cai,Muhammad Faizan Khan

doi:10.7717/peerj-cs.2122

Abstract

Grammar error correction systems are pivotal in the field of natural language processing (NLP), with a primary focus on identifying and correcting the grammatical integrity of written text. This is crucial for both language learning and formal communication. Recently, neural machine translation (NMT) has emerged as a promising approach in high demand. However, this approach faces significant challenges, particularly the scarcity of training data and the complexity of grammar error correction (GEC), especially for low-resource languages such as Indonesian. To address these challenges, we propose InSpelPoS, a confusion method that combines two synthetic data generation methods: the Inverted Spellchecker and Patterns+POS. Furthermore, we introduce an adapted seq2seq framework equipped with a dynamic decoding method and state-of-the-art Transformer-based neural language models to enhance the accuracy and efficiency of GEC. The dynamic decoding method is capable of navigating the complexities of GEC and correcting a wide range of errors, including contextual and grammatical errors. The proposed model leverages the contextual information of words and sentences to generate a corrected output. To assess the effectiveness of our proposed framework, we conducted experiments using synthetic data and compared its performance with existing GEC systems. The results demonstrate a significant improvement in the accuracy of Indonesian GEC compared to existing methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic decoding and dual synthetic data for automatic correction of grammar in low-resource scenario.

Abstract

Talk to us

Similar Papers

More From: PeerJ. Computer science

Lead the way for us

Similar Papers

Grammar Correction for Multiple Errors in Chinese Based on Prompt Templates
Zhici Wang ... Zhiyong Hu
Applied Sciences | VOL. 13
Zhici Wang, et. al.Zhici Wang ... Zhiyong Hu
31 Jul 2023
Applied Sciences | VOL. 13

Automatic Arabic Grammatical Error Correction based on Expectation-Maximization routing and target-bidirectional agreement
Aiman Solyman ... Zeinab Mahmoud
Knowledge-Based Systems | VOL. 241
Aiman Solyman, et. al.Aiman Solyman ... Zeinab Mahmoud
13 Jan 2022
Knowledge-Based Systems | VOL. 241

Analysis of the Application of Feedback Filtering and Seq2Seq Model in English Grammar
Aizhen Zhang
Wireless Communications and Mobile Computing | VOL. 2022
Aizhen ZhangAizhen Zhang
24 Mar 2022
Wireless Communications and Mobile Computing | VOL. 2022

Synthetic data with neural machine translation for automatic correction in arabic grammar
Aiman Solyman ... Zeinab Aleibeid
Egyptian Informatics Journal | VOL. 22
Aiman Solyman, et. al.Aiman Solyman ... Zeinab Aleibeid
24 Dec 2020
Egyptian Informatics Journal | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic decoding and dual synthetic data for automatic correction of grammar in low-resource scenario.

Abstract

Talk to us

Similar Papers

More From: PeerJ. Computer science