Abstract

Grammar error correction (GEC) refers to the proper correction of grammatical errors in a given sentence. Important factors to consider in GEC are not only the grammatical correction of the sentence, but also the recognition of a correct sentence in which no changes are required. However, GEC approaches in which deep learning recently started being used consider only the former aspect, which leads to overcorrection, whereby changes are made to a correct sentence unnecessarily. Because this bias is also reflected in performance metrics, conventional performance metrics consider only part of the important factors in GEC. This study proposes a new metric to consider both important aspects in GEC and to provide a new viewpoint for the GEC task. To the best of the authors knowledge, this study is the first to deal with comprehensively considering the correction performance and overcorrection problem in GEC. The experimental results demonstrate that the model performance ranking was reversed when evaluating the performance with the proposed metric compared to the General Language Understanding Evaluation benchmark [21], which only considers the correction performance. This indicates that the high performance of the correction does not result in less problems with the overcorrection and that the overcorrection problem should also be considered when evaluating the model performance. Moreover, we found that the copy mechanism [14] helps to alleviate the problem of overcorrection.

Highlights

  • In recent years, artificial neural networks and deep learning methods have been applied to different and varied fields [24], [25]

  • neural MT (NMT)-based grammar error correction (GEC)? If overcorrection occurs, does the metric reflecting overcorrection affect the performance ranking? this study aims to measure the performance when a correct sentence is entered as input and is output without any modification

  • This paper suggests a new metric for the GEC task that can identify the correction and overcorrection performance simultaneously

Read more

Summary

Introduction

Artificial neural networks and deep learning methods have been applied to different and varied fields [24], [25]. Among these fields, natural language processing (NLP) based on deep learning and machine learning has become a topic of substantial interest. In NLP, grammar error correction (GEC) refers to the proper correction of spelling and grammatical errors that appear in a given sentence. There are two important factors in GEC: correcting grammatical or spelling errors in sentences, and making no changes to a sentence with no errors. The majority of recent studies have considered only the first factor.

Objectives
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call