A Machine Learning–Based Readability Model for Gujarati Texts

Chandrakant K Bhogayata

doi:10.1145/3637826

Abstract

This study aims to develop a machine learning–based model to predict the readability of Gujarati texts. The dataset was 50 prose passages from Gujarati literature. Fourteen lexical and syntactic readability text features were extracted from the dataset using a machine learning algorithm of the unigram parts of speech tagger and three Python programming scripts. Two samples of native Gujarati speaking secondary and higher education students rated the Gujarati texts for readability judgment on a 10-point scale of “easy” to “difficult” with the interrater agreement. After dimensionality reduction, seven text features as the independent variables and the mean readability rating as the dependent variable were used to train the readability model. As the students' level of education and gender were related to their readability rating, four readability models for school students, university students, male students, and female students were trained with a backward stepwise multiple linear regression algorithm of supervised machine learning. The trained model is comparable across the raters’ groups. The best model is the university students’ readability rating model. The model is cross-validated. It explains 91% and 88% of the variance in readability ratings at training and cross-validation, respectively, and its effect size and power are large and high.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Machine Learning–Based Readability Model for Gujarati Texts

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Feb 8, 2024
Citations: 1

Similar Papers

Suicidal Mortality and Motives Among Middle-School, High-School, and University Students
Motohiro Okada ... Eishi Motomura
JAMA Network Open | VOL. 6
Motohiro Okada, et. al.Motohiro Okada ... Eishi Motomura
07 Aug 2023
JAMA Network Open | VOL. 6

대학생의 재학 중 일경험과 직업가치의 관계 분석
Yun-Seo Jo
Korean Association For Learner-Centered Curriculum And Instruction | VOL. 23
Yun-Seo JoYun-Seo Jo
15 Jun 2023
Korean Association For Learner-Centered Curriculum And Instruction | VOL. 23

PO-110 The relationship between beverage consumption and overweight of university students
Ziyin Chen ... Peizhen Zhang
Exercise Biochemistry Review | VOL. 1
Ziyin Chen, et. al.Ziyin Chen ... Peizhen Zhang
03 Oct 2018
Exercise Biochemistry Review | VOL. 1

Gender differences in stressors and reactions to stressors among Jordanian university students
Shaher H Hamaideh
International Journal of Social Psychiatry | VOL. 58
Shaher H HamaidehShaher H Hamaideh
08 Sep 2010
International Journal of Social Psychiatry | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Machine Learning–Based Readability Model for Gujarati Texts

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing