Effects of rating criteria order on the halo effect in L2 writing assessment: a many-facet Rasch measurement analysis

Hyunwoo Kim

doi:10.1186/s40468-020-00115-0

Abstract

The halo effect is raters’ undesirable tendency to assign more similar ratings across rating criteria than they should. The impacts of the halo effect on ratings have been studied in rater-mediated L2 writing assessment. Little is known, however, about the extent to which rating criteria order in analytic rating scales is associated with the magnitude of the group- and individual-level halo effects. Thus, this study attempts to examine the extent to which the magnitude of the halo effect is associated with rating criteria order in analytic rating scales. To select essays untainted by the effects of rating criteria order, a balanced Latin square design was implemented along with the employment of four expert raters. Next, 11 trained novice Korean raters rated the 30 screened essays with respect to the four rating criteria in three different rating orders: standard-, reverse-, and random-order. A three-facet rating scale model (L2 writer ability, rater severity, criterion difficulty) was fitted to estimate the group- and individual-level halo effects. The overall results of this study showed that the similar magnitude of the group-level halo effect was detected in the standard- and reverse-order rating rubrics while the random presentation of rating criteria decreased the group-level halo effect. A theoretical implication of the study is the necessity of considering rating criteria order as a source of construct-irrelevant easiness or difficulty when developing analytic rating scales.

Highlights

Background of study The halo effect is defined as rater’s cognitive bias, where the judgment of a certain rating criterion is influenced by that of related other rating criteria of test takers’ performance
Aside from these three sources of the halo effect, design features of analytic rating scales as the source of the halo effect have been relegated to a lesser position, rating criteria order as an underlying mechanism of the halo effect has been suggested in rater-mediated performance assessment (Balzer & Sulsky, 1992; Fisicaro & Lance, 1990; Judd, Drake, Downing, & Krosnick, 1991; Murphy, Jako, & Anhalt, 1993)
Group-level halo effect A three-facet rating scale model (L2 writer ability, rater severity, criterion difficulty) was fitted to estimate the magnitude of the group-level halo effect exhibited by the four expert raters using FACETS (Ver 3.82)

Summary

Introduction

Background of study The halo effect is defined as rater’s cognitive bias, where the judgment of a certain rating criterion is influenced by that of related other rating criteria of test takers’ performance. The sources of the halo effect have been known to entail rater’s general impression, a salient rating criterion, and an inability of raters mainly induced by insufficient rater training (Lance, Lapointe, & Fisicaro, 1994). Aside from these three sources of the halo effect, design features of analytic rating scales as the source of the halo effect have been relegated to a lesser position, rating criteria order as an underlying mechanism of the halo effect has been suggested in rater-mediated performance assessment (Balzer & Sulsky, 1992; Fisicaro & Lance, 1990; Judd, Drake, Downing, & Krosnick, 1991; Murphy, Jako, & Anhalt, 1993). Regarding the effects of rating criteria order on the halo effect, Lai et al (2015) clearly stated the need to control the sequence in which rating criteria are rated to identify which rating criteria are most vulnerable to the halo effect in L2 writing assessment

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Language Testing in Asia	Publication Date: Nov 10, 2020
Citations: 7	License type: open-access

R Discovery Prime

R Discovery Prime

Effects of rating criteria order on the halo effect in L2 writing assessment: a many-facet Rasch measurement analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Language Testing in Asia

Lead the way for us

Similar Papers

Building an Initial Validity Argument for Binary and Analytic Rating Scales for an EFL Classroom Writing Assessment: Evidence from Many-Facets Rasch Measurement
Apichat Khamboonruang
rEFLections | VOL. 29
Apichat KhamboonruangApichat Khamboonruang
14 Dec 2022
rEFLections | VOL. 29

高校生の自由英作文はどのように評価されているのか－分析的評価尺度と総合的評価尺度の比較を通しての検討－ How are high school students’ free compositions evaluated by teachers and teacher candidates? :A comparative analysis between analytic and holistic rating scales
Hiroyuki Yamanishi
JALT Journal | VOL. 26
Hiroyuki YamanishiHiroyuki Yamanishi
01 Nov 2004
JALT Journal | VOL. 26

The effect on reliability and sensitivity to level of training of combining analytic and holistic rating scales for assessing communication skills in an internal medicine resident OSCE
Vijay John Daniels ... Dwight Harley
Patient Education and Counseling | VOL. 100
Vijay John Daniels, et. al.Vijay John Daniels ... Dwight Harley
14 Feb 2017
Patient Education and Counseling | VOL. 100

On the Construct Validity of an Analytic Rating Scale for Speaking Assessment
Chunguang Tian
International Journal of English Language Teaching | VOL. 4
Chunguang TianChunguang Tian
19 Sep 2016
International Journal of English Language Teaching | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effects of rating criteria order on the halo effect in L2 writing assessment: a many-facet Rasch measurement analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Language Testing in Asia