REFORMULATION AND GENERALISATION OF THE COHEN AND FLEISS KAPPAS

Zheng Xie,Barry M.G Cheetham,Chaitanya Gadepalli

doi:10.20319/lijhls.2017.33.115

Abstract

The assessment of consistency in the categorical or ordinal decisions made by observers or raters is an important problem especially in the medical field.  The Fleiss Kappa, Cohen Kappa and Intra-class Correlation (ICC), as commonly used for this purpose, are compared and a generalised approach to these measurements is presented.  Differences between the Fleiss Kappa and multi-rater versions of the Cohen Kappa are explained and it is shown how both may be applied to ordinal scoring with linear, quadratic or other weighting.  The relationship between quadratically weighted Fleiss and Cohen Kappa and pair-wise ICC is clarified and generalised to multi-rater assessments. The AC coefficient is considered as an alternative measure of consistency and the relevance of the Kappas and AC to measuring content validity is explored.

Full Text