Abstract

Reading is a foundational skill and the focus of school-level education efforts across countries. The assessment of linguistic competence from oral reading has long been the subject of scientific studies linking the reader’s comprehension of the text to various measures of oral reading fluency. Given the time and resource intensive nature of such assessment, it is of interest to automate the prediction of reading fluency from audio recordings using the same pedagogical rubrics. Given recent findings about the importance of prosody to the communicative purpose of reading aloud, we discuss new approaches to modeling it reliably for the automatic assessment task. We present a new data set of children’s oral reading screened for minimum word decoding skill and rated for comprehensibility by two human experts. We develop a system for the automatic prediction of rater scores that also facilitates insights about the complementarity and inter-dependence of computed lexical accuracy, rate and prosodic features as corroborated by multiple performance measures. With achieved values of correlation and agreement that surpass the corresponding inter-rater measures, we also show how text-dependent prosodic features, informed by speech rate and speaking style, contribute prominently to system performance.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call