Seeking the real item difficulty: bias-corrected item difficulty and some consequences in Rasch and IRT modeling

Jari Metsämuuronen

doi:10.1007/s41237-022-00169-9

Abstract

When the response pattern in a test item deviates from the deterministic pattern, the percentage of correct answers (p) is shown to be a biased estimator for the latent item difficulty (π). This is specifically true with the items of medium item difficulty. Four elements of impurities in p are formalized in the binary settings and four new estimators of π are proposed and studied. Algebraic reasons and a simulation suggest that, except the case of deterministic item discrimination, the real item difficulty is almost always more extreme than what p indicates. This characteristic of p to be biased toward a medium-leveled item difficulty has a strict consequence to item response theory (IRT) and Rasch modeling. Because the classical estimator of item difficulty p is a biased estimator of the latent difficulty level, the item parameters A and B and the person parameter θ within IRT modeling are, consequently, biased estimators of item discrimination and item difficulty as well as ability levels of the test takers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Behaviormetrika	Publication Date: Jun 17, 2022
Citations: 1	License type: open-access

R Discovery Prime

R Discovery Prime

Seeking the real item difficulty: bias-corrected item difficulty and some consequences in Rasch and IRT modeling

Abstract

Talk to us

Similar Papers

More From: Behaviormetrika

Lead the way for us

Similar Papers

Using the GLIMMIX Procedure in SAS 9.3 to Fit a Standard Dichotomous Rasch and Hierarchical 1-PL IRT Model
Ryan A Black ... Stephen F Butler
Applied Psychological Measurement | VOL. 36
Ryan A Black, et. al.Ryan A Black ... Stephen F Butler
25 Apr 2012
Applied Psychological Measurement | VOL. 36

Mixture IRT Model With a Higher-Order Structure for Latent Traits.
Hung-Yu Huang
Educational and Psychological Measurement | VOL. 77
Hung-Yu HuangHung-Yu Huang
11 Jul 2016
Educational and Psychological Measurement | VOL. 77

Classical and modern measurement theories, patient reports, and clinical outcomes

Contemporary Clinical Trials | VOL. 31

01 Jan 2009
Contemporary Clinical Trials | VOL. 31

A Multilevel Mixture IRT Framework for Modeling Response Times as Predictors or Indicators of Response Engagement in IRT Models.
Gabriel Nagy ... Esther Ulitzsch
Educational and Psychological Measurement | VOL. 82
Gabriel Nagy, et. al.Gabriel Nagy ... Esther Ulitzsch
13 Sep 2021
Educational and Psychological Measurement | VOL. 82

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Seeking the real item difficulty: bias-corrected item difficulty and some consequences in Rasch and IRT modeling

Abstract

Talk to us

Similar Papers

More From: Behaviormetrika