Item Response Theory for Evaluating Regression Algorithms

Joao V C Moraes,Ricardo B C Prudencio,Jessica T S Reinaldo,Telmo M Silva Filho

doi:10.1109/ijcnn48605.2020.9207030

Joao V C Moraes, Ricardo B C Prudencio + Show 2 more

https://doi.org/10.1109/ijcnn48605.2020.9207030

Copy DOI

Abstract

Item Response Theory (IRT) is a tool developed in psychometrics to measure latent abilities of human respondents based on their responses to items with different levels of difficulty. Recently, IRT has been applied to evaluation in AI, by treating the algorithms as respondents and the AI tasks as items. Particularly in machine learning, IRT has been applied for evaluation of classifiers based on their predictions to each test instance. Based on a matrix of responses (classifiers vs instances), the IRT model estimates the latent difficulty and discrimination of each instance, as well as the ability of each classifier, in such a way that a classifier receives high ability value when it tends to correctly classify the most difficult instances. The IRT models previously adopted for evaluation in classification are not directly applied for regression, since they rely on dichotomous responses (i.e., a response has to be either correct or incorrect). In this paper we propose a new IRT model, particularly designed for dealing with nonnegative unbounded responses, which is adequate for modelling the absolute errors of regression algorithms. In the proposed model, responses follow a gamma distribution, parameterised according to respondents’ abilities and items’ difficulty and discrimination parameters. The proposed parameterisation results in item characteristic curves with more flexible shapes compared to the logistic curves widely adopted in IRT. The proposed model was evaluated with diverse regression algorithms and two benchmark datasets, one synthetic and one real. Useful insights were derived by inspecting regions in these datasets that present different levels of difficulty and discrimination.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Item Response Theory for Evaluating Regression Algorithms

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Evaluating regression algorithms at the instance level using item response theory
João V.C Moraes ... Ricardo B.C Prudêncio
Knowledge-Based Systems | VOL. 240
João V.C Moraes, et. al.João V.C Moraes ... Ricardo B.C Prudêncio
04 Jan 2022
Knowledge-Based Systems | VOL. 240

Classical and modern measurement theories, patient reports, and clinical outcomes

Contemporary Clinical Trials | VOL. 31

01 Jan 2009
Contemporary Clinical Trials | VOL. 31

Revisiting the Samejima–Bolfarine–Bazán IRT models: New features and extensions
Jorge Luis Bazán ... Caio L N Azevedo
Brazilian Journal of Probability and Statistics | VOL. 37
Jorge Luis Bazán, et. al.Jorge Luis Bazán ... Caio L N Azevedo
01 Mar 2023
Brazilian Journal of Probability and Statistics | VOL. 37

K-means clustering of item characteristic curves and item information curves via functional principal component analysis
Francesca Fortuna ... Fabrizio Maturo
Quality & Quantity | VOL. 53
Francesca Fortuna, et. al.Francesca Fortuna ... Fabrizio Maturo
06 Mar 2018
Quality & Quantity | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Item Response Theory for Evaluating Regression Algorithms

Abstract

Talk to us

Similar Papers