Bias-aware ranking from pairwise comparisons

Antonio Ferrara,Francesco Bonchi,Francesco Fabbri,Fariba Karimi,Claudia Wagner

doi:10.1007/s10618-024-01024-z

Abstract

Human feedback is often used, either directly or indirectly, as input to algorithmic decision making. However, humans are biased: if the algorithm that takes as input the human feedback does not control for potential biases, this might result in biased algorithmic decision making, which can have a tangible impact on people’s lives. In this paper, we study how to detect and correct for evaluators’ bias in the task of ranking people (or items) from pairwise comparisons. Specifically, we assume we are given pairwise comparisons of the items to be ranked produced by a set of evaluators. While the pairwise assessments of the evaluators should reflect to a certain extent the latent (unobservable) true quality scores of the items, they might be affected by each evaluator’s own bias against, or in favor, of some groups of items. By detecting and amending evaluators’ biases, we aim to produce a ranking of the items that is, as much as possible, in accordance with the ranking one would produce by having access to the latent quality scores. Our proposal is a novel method that extends the classic Bradley-Terry model by having a bias parameter for each evaluator which distorts the true quality score of each item, depending on the group the item belongs to. Thanks to the simplicity of the model, we are able to write explicitly its log-likelihood w.r.t. the parameters (i.e., items’ latent scores and evaluators’ bias) and optimize by means of the alternating approach. Our experiments on synthetic and real-world data confirm that our method is able to reconstruct the bias of each single evaluator extremely well and thus to outperform several non-trivial competitors in the task of producing a ranking which is as much as possible close to the unbiased ranking.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bias-aware ranking from pairwise comparisons

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery

Lead the way for us

Journal: Data Mining and Knowledge Discovery	Publication Date: May 31, 2024
License type: CC BY 4.0

Similar Papers

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study
Hakan Koğar
International Journal of Assessment Tools in Education | VOL. 5
Hakan KoğarHakan Koğar
18 Mar 2018
International Journal of Assessment Tools in Education | VOL. 5

Item Response Modeling of Paired Comparison and Ranking Data
Alberto Maydeu-Olivares ... Anna Brown
Multivariate Behavioral Research | VOL. 45
Alberto Maydeu-Olivares, et. al.Alberto Maydeu-Olivares ... Anna Brown
30 Nov 2010
Multivariate Behavioral Research | VOL. 45

A Hierarchical Multi-Unidimensional IRT Approach for Analyzing Sparse, Multi-Group Data for Integrative Data Analysis.
Yan Huo ... Su-Young Kim
Psychometrika | VOL. 80
Yan Huo, et. al.Yan Huo ... Su-Young Kim
30 Sep 2014
Psychometrika | VOL. 80

SPECTRAL METHOD AND REGULARIZED MLE ARE BOTH OPTIMAL FOR TOP-K RANKING.
Yuxin Chen ... Kaizheng Wang
The Annals of Statistics | VOL. 47
Yuxin Chen, et. al.Yuxin Chen ... Kaizheng Wang
21 May 2019
The Annals of Statistics | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bias-aware ranking from pairwise comparisons

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery