Japanese Sentence Compression with a Large Training Dataset

Shun Hasegawa,Manabu Okumura,Hiroya Takamura,Yuta Kikuchi

doi:10.18653/v1/p17-2044

Abstract

In English, high-quality sentence compression models by deleting words have been trained on automatically created large training datasets. We work on Japanese sentence compression by a similar approach. To create a large Japanese training dataset, a method of creating English training dataset is modified based on the characteristics of the Japanese language. The created dataset is used to train Japanese sentence compression models based on the recurrent neural network.

Highlights

Sentence compression is the task of shortening a sentence while preserving its important information and grammaticality
A high-quality English sentence compression model by deleting words was trained on a large training dataset (Filippova and Altun, 2013; Filippova et al, 2015)
The first model is the original Filippova et al.’s model, an encoder-decoder model with a long short-term memory (LSTM), which we extend in this paper to make the other two models that can control the output length (Kikuchi et al, 2016), because controlling the output length makes a compressed sentence more informative under the desired length

Summary

Introduction

Sentence compression is the task of shortening a sentence while preserving its important information and grammaticality. One advantage of compression by deleting words as opposed to abstractive compression lies in the small search space Another one is that the compressed sentence is more likely to be free from incorrect information not mentioned in the source sentence. A high-quality English sentence compression model by deleting words was trained on a large training dataset (Filippova and Altun, 2013; Filippova et al, 2015). Nouns, verbs, adjectives, and adverbs (i.e., content words) shared by S and H are identified by matching word lemmas, and a rooted dependency subtree that contains all the shared content words is regarded as C Their method is designed for English, and cannot be applied to Japanese as it is. The first model is the original Filippova et al.’s model, an encoder-decoder model with a long short-term memory (LSTM), which we extend in this paper to make the other two models that can control the output length (Kikuchi et al, 2016), because controlling the output length makes a compressed sentence more informative under the desired length

Creating training dataset for Japanese

Identification of shared content words

Transformation of a dependency tree

Extraction of the minimum rooted subtree

Conditions imposed on news articles

Sentence compression with LSTM

Experiments

Automatic evaluation

Human evaluation

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Japanese Sentence Compression with a Large Training Dataset

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2017
Citations: 11	License type: cc-by

Similar Papers

Maximizing SLU Performance with Minimal Training Data Using Hybrid RNN Plus Rule-based Approach
Takeshi Homma ... Masahito Togami
-
Takeshi Homma, et. al.Takeshi Homma ... Masahito Togami
01 Jan 2018
01 Jan 2018

Towards valid estimates of activity energy expenditure using an accelerometer: searching for a proper analytical strategy and big data
Alberto G Bonomi
Journal of Applied Physiology | VOL. 115
Alberto G BonomiAlberto G Bonomi
12 Sep 2013
Journal of Applied Physiology | VOL. 115

Single-cell segmentation in bacterial biofilms with an optimized deep learning method enables tracking of cell lineages and measurements of growth rates.
Eric Jelli ... Niklas Netter
Molecular Microbiology | VOL. 119
Eric Jelli, et. al.Eric Jelli ... Niklas Netter
17 Apr 2023
Molecular Microbiology | VOL. 119

Designing a Pattern Recognition Neural Network with a Reject Output and Many Sets of Weights and Biases
Le Dung ... Makoto Mizukaw
-
Le Dung, et. al.Le Dung ... Makoto Mizukaw
01 Nov 2008
01 Nov 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Japanese Sentence Compression with a Large Training Dataset

Abstract

Highlights

Summary

Talk to us

Similar Papers