Abstract
An extension to a rating system for tracking the evolution of parameters over time using continuous variables is introduced. The proposed rating system assumes a distribution for the continuous responses, which is agnostic to the origin of the continuous scores and thus can be used for applications as varied as continuous scores obtained from language testing to scores derived from accuracy and response time from elementary arithmetic learning systems. Large-scale, high-stakes, online, anywhere anytime learning and testing inherently comes with a number of unique problems that require new psychometric solutions. These include (1) the cold start problem, (2) problem of change, and (3) the problem of personalization and adaptation. We outline how our proposed method addresses each of these problems. Three simulations are carried out to demonstrate the utility of the proposed rating system.
Highlights
Large-scale, high-stakes, online, anywhere anytime learning and testing inherently comes with a number of unique problems that require new psychometric solutions
We shall see that the differences in discrimination that derive from the dyadic expansion of the continuous response variables in the continuous Rasch (CR) model translate into differences in the stakes of the game
We have proposed a new method to analyze data generated by massive online learning systems, such as Duolingo English Test (DET) or Math Garden, based on the CR model and the Urnings ratings system
Summary
Large-scale, high-stakes, online, anywhere anytime learning and testing inherently comes with a number of unique problems that require new psychometric solutions. There is the problem of change: learner and item properties change as a cohort of learners progresses through its education While such changes are intended, they are not handled by traditional psychometrics developed to assess student’s ability at a single time point. There is the problem of personalization and adaptation: to optimally support learning, each learner follows her own path at her own pace This will give rise to sparse, incomplete data that are not analyzed using likelihood-based methods. Online learning systems, such as Duolingo, for foreign languages, and Math Garden, for elementary arithmetic, generate large data sets with large number of item responses per learner as learners practice with many items over extended periods of time. Throughout, the Duolingo English Test (DET; Wagner and Kunnan, 2015; LaFlair and Settles, 2019; Maris, 2020), and Math Garden (Klinkenberg et al, 2011) will serve as motivating examples
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.