Multiattentive Recurrent Neural Network Architecture for Multilingual Readability Assessment

Ion Madrazo Azpiazu,Maria Soledad Pera

doi:10.1162/tacl_a_00278

Ion Madrazo Azpiazu, Maria Soledad Pera

Open Access

https://doi.org/10.1162/tacl_a_00278

Copy DOI

Abstract

We present a multiattentive recurrent neural network architecture for automatic multilingual readability assessment. This architecture considers raw words as its main input, but internally captures text structure and informs its word attention process using other syntax- and morphology-related datapoints, known to be of great importance to readability. This is achieved by a multiattentive strategy that allows the neural network to focus on specific parts of a text for predicting its reading level. We conducted an exhaustive evaluation using data sets targeting multiple languages and prediction task types, to compare the proposed model with traditional, state-of-the-art, and other neural network strategies.

Highlights

Readability assessment has been used by diverse stakeholders–from educators to public institutions—for determining the complexity of texts (Benjamin, 2012)
Xmij always contains all possible morphological tags considered for the language, assigning a Not applicable (NA) value when the label cannot be applied to the token—for example, tense would have a value of NA for all nouns
We describe the strategies considered in our assessment, including traditional formulas, stateof-the-art tools based on extensive feature engineering, and neural network structures intended for an ablation study on major components of Vec2Read

Summary

Introduction

Readability assessment has been used by diverse stakeholders–from educators to public institutions—for determining the complexity of texts (Benjamin, 2012). To improve the quality of automatic readability assessment, researchers turned to more sophisticated techniques that go beyond examining shallow features These techniques, typically based on supervised machine learning, incorporate hundreds (even thousands) of features that describe a text from multiple perspectives: syntax, morphology, cohesion, discourse structure, and subject matter (Dell’Orletta et al, 2011; Francois and Fairon, 2012; Denning et al, 2016; Arfeet al., 2018). The dependency on these numerous features, has made readability assessment tools too complex to deploy and apply to languages beyond the one for which they were originally designed. Feature and language dependency, along with lack of homogeneity in terms of readability scales, often prevent researchers from comparing new strategies with state-of-the-art counterparts, preventing community consensus on which features are the most beneficial for capturing text complexity (De Clercq and Hoste, 2016)

Objectives

Methods

Findings

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Nov 1, 2019
Citations: 53	License type: cc-by

R Discovery Prime

R Discovery Prime

Multiattentive Recurrent Neural Network Architecture for Multilingual Readability Assessment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Evolutionary Neural Network Based on Immune Continuous Ant Colony Algorithm
Gao Wei
-
Gao WeiGao Wei
01 Jan 2010
01 Jan 2010

Forecast of Waterway Cargo Turnover Volume Based on Genetic Algorithm to Optimize Neural Network Parameters
Rong Ma
Journal of Physics: Conference Series | VOL. 2083
Rong MaRong Ma
01 Nov 2021
Journal of Physics: Conference Series | VOL. 2083

A fuzzy intelligent approach to the classification problem in gene expression data analysis
Mehdi Khashei ... Mehdi Bijari
Knowledge-Based Systems | VOL. 27
Mehdi Khashei, et. al.Mehdi Khashei ... Mehdi Bijari
29 Oct 2011
Knowledge-Based Systems | VOL. 27

Calculating the Synthetic Efficiency of Hydroturbine Based on the BP Neural Network and Elman Neural Network
Lin Zhang ... Yi Min Wang
Applied Mechanics and Materials | VOL. 457-458
Lin Zhang, et. al.Lin Zhang ... Yi Min Wang
01 Oct 2013
Applied Mechanics and Materials | VOL. 457-458

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiattentive Recurrent Neural Network Architecture for Multilingual Readability Assessment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics