A Comparison of Classical and Modern Measures of Internal Consistency.

Pasquale Anselmi,Daiana Colledani,Egidio Robusto

doi:10.3389/fpsyg.2019.02714

Pasquale Anselmi, Daiana Colledani + Show 1 more

Open Access

https://doi.org/10.3389/fpsyg.2019.02714

Copy DOI

Journal: Frontiers in psychology	Publication Date: Dec 4, 2019
Citations: 62	License type: CC BY 4.0

Affiliation: University of Padua

Abstract

Three measures of internal consistency – Kuder-Richardson Formula 20 (KR20), Cronbach’s alpha (α), and person separation reliability (R) – are considered. KR20 and α are common measures in classical test theory, whereas R is developed in modern test theory and, more precisely, in Rasch measurement. These three measures specify the observed variance as the sum of true variance and error variance. However, they differ for the way in which these quantities are obtained. KR20 uses the error variance of an “average” respondent from the sample, which overestimates the error variance of respondents with high or low scores. Conversely, R uses the actual average error variance of the sample. KR20 and α use respondents’ test scores in calculating the observed variance. This is potentially misleading because test scores are not linear representations of the underlying variable, whereas calculation of variance requires linearity. Contrariwise, if the data fit the Rasch model, the measures estimated for each respondent are on a linear scale, thus being numerically suitable for calculating the observed variance. Given these differences, R is expected to be a better index of internal consistency than KR20 and α. The present work compares the three measures on simulated data sets with dichotomous and polytomous items. It is shown that all the estimates of internal consistency decrease with the increasing of the skewness of the score distribution, with R decreasing to a larger extent. Thus, R is more conservative than KR20 and α, and prevents test users from believing a test has better measurement characteristics than it actually has. In addition, it is shown that Rasch-based infit and outfit person statistics can be used for handling data sets with random responses. Two options are described. The first one implies computing a more conservative estimate of internal consistency. The second one implies detecting individuals with random responses. When there are a few individuals with a consistent number of random responses, infit and outfit allow for correctly detecting almost all of them. Once these individuals are removed, a “cleaned” data set is obtained that can be used for computing a less biased estimate of internal consistency.

Highlights

The present work deals with internal consistency, which expresses the degree to which the items of a test produce similar scores
Compared with Kuder-Richardson Formula 20 (KR20) and α, R is expected to be a better index of internal consistency as the numerical values are linear rather than non-linear, and the actual average error variance of the sample is used instead on the error variance of an “average” respondent
In the case of a symmetric score distribution, the error variance estimated by KR20 and α largely resembles that resulting from R

Summary

Introduction

The present work deals with internal consistency, which expresses the degree to which the items of a test produce similar scores. Three measures of internal consistency are considered, namely Kuder-Richardson Formula 20 (KR20; Kuder and Richardson, 1937), Cronbach’s α (Cronbach, 1951), and person separation reliability (R; Wright and Masters, 1982). KR20 and α are well-known measures in classical test theory, where they are widely used to evaluate the internal consistency of cognitive and personality tests. The derivations of KR20 and α used continuous random variables for item scores (Sijtsma, 2009). As such, they include dichotomous scoring (e.g., correct/incorrect; yes/no) and ordered polytomous scoring (e.g., never/sometimes/often/always; very difficult/difficult/easy/very easy) as special cases. When all items are scored 1 or 0, the formula for KR20 reduces to that for α (Cronbach, 1951)

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparison of Classical and Modern Measures of Internal Consistency.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in psychology

Lead the way for us

Similar Papers

Development of the Paranormal and Supernatural Beliefs Scale using classical and modern test theory
Charlotte E Dean ... Richard Wiseman
BMC Psychology | VOL. 9
Charlotte E Dean, et. al.Charlotte E Dean ... Richard Wiseman
23 Jun 2021
BMC Psychology | VOL. 9

Evaluating the Psychometric Properties of the 7-Item Persian Game Addiction Scale for Iranian Adolescents.
Chung-Ying Lin ... Anders Broström
Frontiers in Psychology | VOL. 10
Chung-Ying Lin, et. al.Chung-Ying Lin ... Anders Broström
05 Feb 2019
Frontiers in Psychology | VOL. 10

Psychometric Evaluation of the Persian eHealth Literacy Scale (eHEALS) Among Elder Iranians With Heart Failure.
Chung-Ying Lin ... Amir H Pakpour
Evaluation & the Health Professions | VOL. 43
Chung-Ying Lin, et. al.Chung-Ying Lin ... Amir H Pakpour
11 Feb 2019
Evaluation & the Health Professions | VOL. 43

The accuracy and consistency of mastery for each content domain using the Rasch and deterministic inputs, noisy “and” gate diagnostic classification models: a simulation study and a real-world analysis using data from the Korean Medical Licensing Examination
Dong Gi Seo ... Jae Kum Kim
Journal of educational evaluation for health professions | VOL. 18
Dong Gi Seo, et. al.Dong Gi Seo ... Jae Kum Kim
05 Jul 2021
Journal of educational evaluation for health professions | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparison of Classical and Modern Measures of Internal Consistency.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in psychology