Contrasting Classical and Machine Learning Approaches in the Estimation of Value-Added Scores in Large-Scale Educational Data.

Jessica Levy,Ulrich Keller,Antoine Fischbach,Dominic Mussack,Pedro Cardoso-Leite,Martin Brunner

doi:10.3389/fpsyg.2020.02190

Jessica Levy, Ulrich Keller + Show 4 more

Open Access

https://doi.org/10.3389/fpsyg.2020.02190

Copy DOI

Abstract

There is no consensus on which statistical model estimates school value-added (VA) most accurately. To date, the two most common statistical models used for the calculation of VA scores are two classical methods: linear regression and multilevel models. These models have the advantage of being relatively transparent and thus understandable for most researchers and practitioners. However, these statistical models are bound to certain assumptions (e.g., linearity) that might limit their prediction accuracy. Machine learning methods, which have yielded spectacular results in numerous fields, may be a valuable alternative to these classical models. Although big data is not new in general, it is relatively new in the realm of social sciences and education. New types of data require new data analytical approaches. Such techniques have already evolved in fields with a long tradition in crunching big data (e.g., gene technology). The objective of the present paper is to competently apply these “imported” techniques to education data, more precisely VA scores, and assess when and how they can extend or replace the classical psychometrics toolbox. The different models include linear and non-linear methods and extend classical models with the most commonly used machine learning methods (i.e., random forest, neural networks, support vector machines, and boosting). We used representative data of 3,026 students in 153 schools who took part in the standardized achievement tests of the Luxembourg School Monitoring Program in grades 1 and 3. Multilevel models outperformed classical linear and polynomial regressions, as well as different machine learning models. However, it could be observed that across all schools, school VA scores from different model types correlated highly. Yet, the percentage of disagreements as compared to multilevel models was not trivial and real-life implications for individual schools may still be dramatic depending on the model type used. Implications of these results and possible ethical concerns regarding the use of machine learning methods for decision-making in education are discussed.

Highlights

Value-added (VA) models are statistical models designed to estimate school effectiveness based on students’ achievement
We focused on how the choice of the statistical model affects VA scores, which are used to evaluate the effectiveness of schools, as VA scores and measures of school effectiveness may vary greatly depending on the statistical method used (e.g., Sloat et al, 2018)
We aim to extend the study from Levy et al (2020) by examining different model types for the estimation of school VA scores, and the study from Schiltz et al (2018) by using a different data set with population data, by adding multilevel models, by adding non-linear “classical” models, and by adding different types of machine learning methods to the comparison

Summary

Introduction

Value-added (VA) models are statistical models designed to estimate school (or teacher) effectiveness based on students’ achievement. The use of VA models is highly consequential because VA scores are often used for accountability and high-stakes decisions to allocate financial or personal resources to schools (for an overview from a more economical point of view, see, Hanushek, 2019) These high stakes can make estimating VA scores a politically charged topic, especially in the US, where many states have implemented VA-based evaluation systems (AmreinBeardsley and Holloway, 2017; Kurtz, 2018). In recent years, the consequential use of VA models seems to be decreasing again in many states (Close et al, 2020) Despite both the practical and political relevance, there is currently no consensus on how to best estimate VA scores (Everson, 2017; Levy et al, 2019).

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in psychology	Publication Date: Aug 21, 2020
Citations: 10	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Contrasting Classical and Machine Learning Approaches in the Estimation of Value-Added Scores in Large-Scale Educational Data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in psychology

Lead the way for us

Similar Papers

Sensors support machine learning
-
Food Science and Technology | VOL. 33
--
01 Dec 2019
Food Science and Technology | VOL. 33

A review of machine learning applications in wildfire science and management
Piyush Jain ... Sean C.P Coogan
Environmental Reviews | VOL. 28
Piyush Jain, et. al.Piyush Jain ... Sean C.P Coogan
28 Jul 2020
Environmental Reviews | VOL. 28

Comparison of Statistical and Machine Learning models for pipe failure modeling in Water Distribution Networks (WDN)
Mónica Giraldo ... Juan Rodríguez
-
Mónica Giraldo, et. al.Mónica Giraldo ... Juan Rodríguez
12 Nov 2019
12 Nov 2019

A review of machine learning applications in life cycle assessment studies
Xiaobo Xue Romeiko ... Callie Babbitt
Science of The Total Environment | VOL. 912
Xiaobo Xue Romeiko, et. al.Xiaobo Xue Romeiko ... Callie Babbitt
28 Nov 2023
Science of The Total Environment | VOL. 912

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Contrasting Classical and Machine Learning Approaches in the Estimation of Value-Added Scores in Large-Scale Educational Data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in psychology