An Overview of Interrater Agreement on Likert Scales for Researchers and Practitioners.

Thomas A O'Neill

doi:10.3389/fpsyg.2017.00777

Abstract

Applications of interrater agreement (IRA) statistics for Likert scales are plentiful in research and practice. IRA may be implicated in job analysis, performance appraisal, panel interviews, and any other approach to gathering systematic observations. Any rating system involving subject-matter experts can also benefit from IRA as a measure of consensus. Further, IRA is fundamental to aggregation in multilevel research, which is becoming increasingly common in order to address nesting. Although, several technical descriptions of a few specific IRA statistics exist, this paper aims to provide a tractable orientation to common IRA indices to support application. The introductory overview is written with the intent of facilitating contrasts among IRA statistics by critically reviewing equations, interpretations, strengths, and weaknesses. Statistics considered include rwg, , r′wg, rwg(p), average deviation (AD), awg, standard deviation (Swg), and the coefficient of variation (CVwg). Equations support quick calculation and contrasting of different agreement indices. The article also includes a “quick reference” table and three figures in order to help readers identify how IRA statistics differ and how interpretations of IRA will depend strongly on the statistic employed. A brief consideration of recommended practices involving statistical and practical cutoff standards is presented, and conclusions are offered in light of the current literature.

Highlights

The assessment of interrater agreement (IRA) for Likert-type response scales has fundamental implications for a wide range of research and practice
IRA statistics are critical to justification of aggregation in multilevel research, but they are frequently applied in job analysis, performance appraisal, assessment centers, employment interviews, and so forth
IRA offers a unique perspective from reliability because reliability deals with consistency of ratings and agreement deals with the similarity of absolute levels of ratings

Summary

INTRODUCTION

The assessment of interrater agreement (IRA) for Likert-type response scales has fundamental implications for a wide range of research and practice. Underscoring the importance of IRA statistics is that, unlike interrater reliability and consistency statistics, IRA provides a single value of agreement for each rating target, thereby facilitating identification of units of raters who are very high or very low in agreement. This advantageous feature permits subsequent investigation of other substantive and theoretically interesting. LeBreton and Senter (2008) provided a seminal review of IRA and consistency statistics, but the focus was largely on implications of these types of statistics for multilevel research methods and not on the many other applications of IRA (e.g., agreement in importance ratings collected in job analysis; Harvey, 1991). A comment on IRA and interrater consistency is offered

General Logic

Potential Cause for Concern

AVERAGE DEVIATION INDEX

STANDARD DEVIATION

COEFFICIENT OF VARIATION

STANDARDS FOR AGREEMENT

Practical Standards

Statistical Standards

Current Best Practice in Judging Agreement Levels

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Psychology	Publication Date: May 12, 2017
Citations: 71	License type: cc-by

R Discovery Prime

R Discovery Prime

An Overview of Interrater Agreement on Likert Scales for Researchers and Practitioners.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Psychology

Lead the way for us

Similar Papers

Interrater Agreement and Interrater Reliability: Implications for Multilevel Research
Jenell L S Wittmer ... James M Lebreton
-
Jenell L S Wittmer, et. al.Jenell L S Wittmer ... James M Lebreton
25 Mar 2021
25 Mar 2021

Interrater agreement statistics with skewed data: evaluation of alternatives to Cohen's kappa.
Shu Xu ... Michael F Lorber
Journal of Consulting and Clinical Psychology | VOL. 82
Shu Xu, et. al.Shu Xu ... Michael F Lorber
01 Jan 2014
Journal of Consulting and Clinical Psychology | VOL. 82

Interrater Agreement Reconsidered: An Alternative to the rwg Indices
Reagan D Brown ... Neil M A Hauenstein
Organizational Research Methods | VOL. 8
Reagan D Brown, et. al.Reagan D Brown ... Neil M A Hauenstein
01 Apr 2005
Organizational Research Methods | VOL. 8

Interrater Agreement for Consensus Definitions of Delayed Ischemic Events After Aneurysmal Subarachnoid Hemorrhage.
Sahar F Zafar ... Kathryn L OʼConnor
Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society | VOL. 33
Sahar F Zafar, et. al.Sahar F Zafar ... Kathryn L OʼConnor
01 Jun 2016
Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Overview of Interrater Agreement on Likert Scales for Researchers and Practitioners.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Psychology