Measuring Test Measurement Error

Donald Boyd,James Wyckoff,Susanna Loeb,Hamilton Lankford

doi:10.3102/1076998613508584

Abstract

Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for educational policy and practice. While test vendors provide estimates of split-test reliability, these measures do not account for potentially important day-to-day differences in student performance. In this article, we demonstrate a credible, low-cost approach for estimating the overall extent of measurement error that can be applied when students take three or more tests in the subject of interest (e.g., state assessments in consecutive grades). Our method generalizes the test–retest framework by allowing for (a) growth or decay in knowledge and skills between tests, (b) tests being neither parallel nor vertically scaled, and (c) the degree of measurement error varying across tests. The approach maintains relatively unrestrictive, testable assumptions regarding the structure of student achievement growth. Estimation only requires descriptive statistics (e.g., test-score correlations). With student-level data, the extent and pattern of measurement-error heteroscedasticity also can be estimated. In turn, one can compute Bayesian posterior means of achievement and achievement gains given observed scores—estimators having statistical properties superior to those for the observed score (score gain). We employ math and English language arts test-score data from New York City to demonstrate these methods and estimate the overall extent of test measurement error is at least twice as large as that reported by the test vendor.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Measuring Test Measurement Error

Abstract

Talk to us

Similar Papers

More From: Journal of educational and behavioral statistics : a quarterly publication sponsored by the American Educational Research Association and the American Statistical Association

Lead the way for us

Journal: Journal of educational and behavioral statistics : a quarterly publication sponsored by the American Educational Research Association and the American Statistical Association	Publication Date: Dec 1, 2013
Citations: 32

Similar Papers

Biomarkers in nutritional epidemiology.
Sheila A Bingham
Public Health Nutrition | VOL. 5
Sheila A BinghamSheila A Bingham
01 Dec 2002
Public Health Nutrition | VOL. 5

Measurement Error in Earnings Data in the Health and Retirement Study
Jessie Bricker ... Gary V Engelhardt
SSRN | VOL. -
Jessie Bricker, et. al.Jessie Bricker ... Gary V Engelhardt
09 Nov 2008
SSRN | VOL. -

Measurement error in earnings data in the health and retirement study
Jesse Bricker ... Gary V Engelhardt
Journal of economic and social measurement | VOL. 33
Jesse Bricker, et. al.Jesse Bricker ... Gary V Engelhardt
01 Dec 2008
Journal of economic and social measurement | VOL. 33

Perceived Impact of Personality Traits on the Academic Performance of Students in Biology
Emmanuel E Achor ... Babatunde S Kayode
Innovare Journal of Education | VOL. -
Emmanuel E Achor, et. al.Emmanuel E Achor ... Babatunde S Kayode
01 Mar 2024
Innovare Journal of Education | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Measuring Test Measurement Error

Abstract

Talk to us

Similar Papers

More From: Journal of educational and behavioral statistics : a quarterly publication sponsored by the American Educational Research Association and the American Statistical Association