Integrating LSA-based hierarchical conceptual space and machine learning methods for leveling the readability of domain-specific texts

Hou-Chiang Tseng,Tao-Hsing Chang,Berlin Chen,Yao-Ting Sung

doi:10.1017/s1351324919000093

Abstract

AbstractText readability assessment is a challenging interdisciplinary endeavor with rich practical implications. It has long drawn the attention of researchers internationally, and the readability models since developed have been widely applied to various fields. Previous readability models have only made use of linguistic features employed for general text analysis and have not been sufficiently accurate when used to gauge domain-specific texts. In view of this, this study proposes a latent-semantic-analysis (LSA)-constructed hierarchical conceptual space that can be used to train a readability model to accurately assess domain-specific texts. Compared with a baseline reference using a traditional model, the new model improves by 13.88% to achieve 68.98% of accuracy when leveling social science texts, and by 24.61% to achieve 73.96% of accuracy when assessing natural science texts. We then combine the readability features developed for the current study with general linguistic features, and the accuracy of leveling social science texts improves by an even higher degree of 31.58% to achieve 86.68%, and that of natural science texts by 26.56% to achieve 75.91%. These results indicate that the readability features developed in this study can be used both to train a readability model for leveling domain-specific texts and also in combination with the more common linguistic features to enhance the efficacy of the model. Future research can expand the generalizability of the model by assessing texts from different fields and grade levels using the proposed method, thus enhancing the practical applications of this new method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Integrating LSA-based hierarchical conceptual space and machine learning methods for leveling the readability of domain-specific texts

Abstract

Talk to us

Similar Papers

More From: Natural Language Engineering

Lead the way for us

Journal: Natural Language Engineering	Publication Date: Apr 5, 2019
Citations: 17

Similar Papers

Do words matter: Investigating the association between linguistic features of accounting examinations and marks
Juan Mendelsohn Ontong
South African Journal of Education | VOL. 44
Juan Mendelsohn OntongJuan Mendelsohn Ontong
31 May 2024
South African Journal of Education | VOL. 44

Sentiment Analysis of Korean Using Effective Linguistic Features and Adjustment of Word Senses
Hayeun Jang ... Hyopil Shin
Language and Information | VOL. 14
Hayeun Jang, et. al.Hayeun Jang ... Hyopil Shin
31 Dec 2010
Language and Information | VOL. 14

You've got style
Erica L Snow ... Cecile A Perret
-
Erica L Snow, et. al.Erica L Snow ... Cecile A Perret
16 Mar 2015
16 Mar 2015

Incorporating linguistic theories of pronunciation variation into speech–recognition models
Mari Ostendorf
Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences | VOL. 358
Mari OstendorfMari Ostendorf
15 Apr 2000
Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences | VOL. 358

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrating LSA-based hierarchical conceptual space and machine learning methods for leveling the readability of domain-specific texts

Abstract

Talk to us

Similar Papers

More From: Natural Language Engineering