Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

Marcin Junczys-Dowmunt,Shubha Guha,Kenneth Heafield,Roman Grundkiewicz

doi:10.18653/v1/n18-1055

Abstract

Previously, neural methods in grammatical error correction (GEC) did not reach state-of-the-art results compared to phrase-based statistical machine translation (SMT) baselines. We demonstrate parallels between neural GEC and low-resource neural MT and successfully adapt several methods from low-resource MT to neural GEC. We further establish guidelines for trustable results in neural GEC and propose a set of model-independent methods for neural GEC that can be easily applied in most GEC settings. Proposed methods include adding source-side noise, domain-adaptation techniques, a GEC-specific training-objective, transfer learning with monolingual data, and ensembling of independently trained GEC models and language models. The combined effects of these methods result in better than state-of-the-art neural GEC models that outperform previously best neural GEC systems by more than 10% M² on the CoNLL-2014 benchmark and 5.9% on the JFLEG test set. Non-neural state-of-the-art systems are outperformed by more than 2% on the CoNLL-2014 benchmark and by 4% on JFLEG.

Highlights

Most successful approaches to automated grammatical error correction (GEC) are based on methods from statistical machine translation (SMT), especially the phrase-based variant
If we look at recent MT work with this in mind, we find one area where phrased-based SMT dominates over neural machine translation (NMT): low-resource machine translation
Current state-of-the-art GEC systems based on SMT, all include large-scale indomain language models either following the steps outlined in Junczys-Dowmunt and Grundkiewicz (2016) or directly re-using their domain-adapted Common-Crawl language model

Summary

Introduction

Most successful approaches to automated grammatical error correction (GEC) are based on methods from statistical machine translation (SMT), especially the phrase-based variant. The Cambridge Learner Corpus (CLC) by Nicholls (2003) — probably the best resource in this list — is non-public and we would strongly discourage reporting results that include it as training data as this makes comparisons difficult Current state-of-the-art GEC systems based on SMT, all include large-scale indomain language models either following the steps outlined in Junczys-Dowmunt and Grundkiewicz (2016) or directly re-using their domain-adapted Common-Crawl language model It seems that the current state of neural methods in GEC reflects the behavior for NMT systems trained on smaller data sets. We recommend a model-independent toolbox for neural GEC

A trustable baseline for neural GEC

Training and test data

Preprocessing and sub-words

Model and training procedure

Optimizer instability

Ensembling of independent models

Adaptations for GEC

Source-word dropout as corruption

Domain adaptation

Error adaptation

Tied embeddings

Edit-weighted MLE objective

Transfer learning for GEC

Pre-training embeddings

Pre-training decoder parameters

Results for transfer learning

Ensembling with language models

Deeper NMT models

Architectures

Training settings

Pre-training deep models

Results

A standard tool set for neural GEC

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2018
Citations: 154	License type: cc-by

Similar Papers

Neural Grammatical Error Correction with Finite State Transducers
Felix Stahlberg ... Christopher Bryant
-
Felix Stahlberg, et. al.Felix Stahlberg ... Christopher Bryant
01 Jan 2019
01 Jan 2019

A Comprehensive Survey of Grammatical Error Correction
Yu Wang ... Kai Dang
ACM Transactions on Intelligent Systems and Technology | VOL. 12
Yu Wang, et. al.Yu Wang ... Kai Dang
31 Oct 2021
ACM Transactions on Intelligent Systems and Technology | VOL. 12

A Hybrid System for Chinese Grammatical Error Diagnosis and Correction
Chen Li ... Hengyou Liu
-
Chen Li, et. al.Chen Li ... Hengyou Liu
01 Jan 2018
01 Jan 2018

Adaptation in Statistical Machine Translation for Low-resource Domains in English-Vietnamese Language
Nghia-Luan Pham ... Van-Vinh Nguyen
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Nghia-Luan Pham, et. al.Nghia-Luan Pham ... Van-Vinh Nguyen
30 May 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

Abstract

Highlights

Summary

Talk to us

Similar Papers