Why ability point estimates can be pointless: a primer on using skill measures from large-scale assessments in secondary analyses

Clemens M Lechner,Matthias Bluemke,Katharina Groskurth,Nivedita Bhaktha

doi:10.1186/s42409-020-00020-5

Clemens M Lechner, Matthias Bluemke + Show 2 more

Open Access

https://doi.org/10.1186/s42409-020-00020-5

Copy DOI

Abstract

Measures of cognitive or socio-emotional skills from large-scale assessments surveys (LSAS) are often based on advanced statistical models and scoring techniques unfamiliar to applied researchers. Consequently, applied researchers working with data from LSAS may be uncertain about the assumptions and computational details of these statistical models and scoring techniques and about how to best incorporate the resulting skill measures in secondary analyses. The present paper is intended as a primer for applied researchers. After a brief introduction to the key properties of skill assessments, we give an overview over the three principal methods with which secondary analysts can incorporate skill measures from LSAS in their analyses: (1) as test scores (i.e., point estimates of individual ability), (2) through structural equation modeling (SEM), and (3) in the form of plausible values (PVs). We discuss the advantages and disadvantages of each method based on three criteria: fallibility (i.e., control for measurement error and unbiasedness), usability (i.e., ease of use in secondary analyses), and immutability (i.e., consistency of test scores, PVs, or measurement model parameters across different analyses and analysts). We show that although none of the methods are optimal under all criteria, methods that result in a single point estimate of each respondent’s ability (i.e., all types of “test scores”) are rarely optimal for research purposes. Instead, approaches that avoid or correct for measurement error—especially PV methodology—stand out as the method of choice. We conclude with practical recommendations for secondary analysts and data-producing organizations.

Highlights

In the last two decades, large-scale assessments surveys (LSAS) have expanded considerably in number and scope
We review the three principal options that applied researchers have at their disposal to incorporate skill measures from LSAS in their secondary analyses: test scores, structural equation modeling (SEM), and plausible values (PVs)
There are good reasons for secondary analysts to gradually move away from using test scores—or at least to be mindful of the shortcomings of deceptively simple test scores in the context of LSAS, where the interest is in population quantities

Summary

Introduction

In the last two decades, large-scale assessments surveys (LSAS) have expanded considerably in number and scope. National and international LSAS, such as PISA, TIMMS, PIAAC, NEPS, or NAEP, provide a wealth of data on cognitive and socio-emotional (or “non-cognitive”) skills of children, youth, and adults. This increasing data availability has led to a veritable surge in investigations in economics, psychology, and sociology on issues such as skill formation, inequality in skills, or labor market returns to skills. There is uncertainty among applied researchers about the statistical assumptions and computational details behind these different models and scoring techniques, their respective pros and cons, and how to best incorporate the skill measures that result from them in secondary analyses. Less-than-optimal practices may result in faulty analyses and erroneous substantive conclusions

Objectives

Methods

Findings

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Measurement Instruments for the Social Sciences	Publication Date: Jan 19, 2021
Citations: 11	License type: open-access

R Discovery Prime

R Discovery Prime

Why ability point estimates can be pointless: a primer on using skill measures from large-scale assessments in secondary analyses

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Measurement Instruments for the Social Sciences

Lead the way for us

Similar Papers

To Score or Not to Score? A Simulation Study on the Performance of Test Scores, Plausible Values, and SEM, in Regression With Socio-Emotional Skill or Personality Scales as Predictors.
Nivedita Bhaktha ... Clemens M Lechner
Frontiers in Psychology | VOL. 12
Nivedita Bhaktha, et. al.Nivedita Bhaktha ... Clemens M Lechner
15 Oct 2021
Frontiers in Psychology | VOL. 12

Guidelines for secondary analysis in search of response shift
Carolyn E. Schwartz ... Lisa Lix
Quality of Life Research | VOL. 22
Carolyn E. Schwartz, et. al.Carolyn E. Schwartz ... Lisa Lix
10 Apr 2013
Quality of Life Research | VOL. 22

Using plausible values in secondary analysis in large-scale assessments
Inga Laukaityte ... Marie Wiberg
Communications in Statistics - Theory and Methods | VOL. 46
Inga Laukaityte, et. al.Inga Laukaityte ... Marie Wiberg
08 Dec 2016
Communications in Statistics - Theory and Methods | VOL. 46

Considerations for the use of plausible values in large-scale assessments
Paul A Jewsbury ... Eugenio J Gonzalez
Large-scale Assessments in Education | VOL. 12
Paul A Jewsbury, et. al.Paul A Jewsbury ... Eugenio J Gonzalez
08 Aug 2024
Large-scale Assessments in Education | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Why ability point estimates can be pointless: a primer on using skill measures from large-scale assessments in secondary analyses

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Measurement Instruments for the Social Sciences