Using generalizability theory to investigate the variability and reliability of EFL composition scores by human raters and e-rater

Elif Sari,Turgay Han

doi:10.30827/portalin.vi38.18056

Abstract

ABSTRACT: Using the generalizability theory (G-theory) as a theoretical framework, this study aimed at investigating the variability and reliability of holistic scores assigned by human raters and e-rater to the same EFL essays. Eighty argumentative essays written on two different topics by tertiary level Turkish EFL students were scored holistically by e-rater and eight human raters who received a detailed rater training. The results showed that e-rater and human raters assigned significantly different holistic scores to the same EFL essays. G-theory analyses revealed that human raters assigned considerably inconsistent scores to the same EFL essays although they were given a detailed rater training and more reliable ratings were attained when e-rater was integrated in the scoring procedure. Some implications are given for EFL writing assessment practices.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Porta Linguarum Revista Interuniversitaria de Didáctica de las Lenguas Extranjeras	Publication Date: Jun 1, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Using generalizability theory to investigate the variability and reliability of EFL composition scores by human raters and e-rater

Abstract

Talk to us

Similar Papers

More From: Porta Linguarum Revista Interuniversitaria de Didáctica de las Lenguas Extranjeras

Lead the way for us

Similar Papers

The impact of essay organization and overall quality on the holistic scoring of EFL writing: Perspectives from classroom english teachers and national writing raters
Junfei Li ... Jinyan Huang
Assessing Writing | VOL. 51
Junfei Li, et. al.Junfei Li ... Jinyan Huang
01 Jan 2021
Assessing Writing | VOL. 51

ANALYTIC SCORING OF TOEFL® CBT ESSAYS: SCORES FROM HUMANS AND E‐RATER®
Yong‐Won Lee ... Robert Kantor
ETS Research Report Series | VOL. 2008
Yong‐Won Lee, et. al.Yong‐Won Lee ... Robert Kantor
01 Jun 2008
ETS Research Report Series | VOL. 2008

Toward Automated Multi-trait Scoring of Essays: Investigating Links among Holistic, Analytic, and Text Feature Scores
Robert Kantor ... Claudia Gentile
Applied Linguistics | VOL. 31
Robert Kantor, et. al.Robert Kantor ... Claudia Gentile
25 Nov 2009
Applied Linguistics | VOL. 31

Automated Essay Scoring and the Deep Learning Black Box: How Are Rubric Scores Determined?
Vivekanandan S Kumar ... David Boulanger
International Journal of Artificial Intelligence in Education | VOL. 31
Vivekanandan S Kumar, et. al.Vivekanandan S Kumar ... David Boulanger
15 Sep 2020
International Journal of Artificial Intelligence in Education | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using generalizability theory to investigate the variability and reliability of EFL composition scores by human raters and e-rater

Abstract

Talk to us

Similar Papers

More From: Porta Linguarum Revista Interuniversitaria de Didáctica de las Lenguas Extranjeras