Difficulty of Summarization Items for Japanese Learners: Effects of Passages, Distractors, and Response Formats

Takahiro Terao

doi:10.3389/feduc.2020.00009

Takahiro Terao

Open Access

https://doi.org/10.3389/feduc.2020.00009

Copy DOI

Abstract

This study aims to examine factors affecting the difficulty of summarization items for Japanese learners. In the process of item development, a connection between cognitive features related to target construct and the difficulty of test items is necessary to check empirically in order to define the abilities to be measured. Previous studies have mainly focused on local reading comprehension, while this study addressed summarization skills at the paragraph levels. The study originally developed items for the experiment to elicit three macrorules to represent a gist of the paragraph and text; deletion, generalization, and integration. And this study evaluated the influences that passages, distractor characteristics central to summarization processes, and response formats have on item difficulty, using the item difficulty modeling. When editing distractors, characteristics of L2 learners’ summarization were carefully reviewed and reflected. The participants included 150 freshmen from Japan, who were asked to answer experimental summarization items. The results of the linear logistic test model indicated that the main source of difficulty in summarization items was distractor characteristics. In particular, summaries with unnecessary information or lacking necessary information increased difficulty. In addition, summaries with detailed information, such as episodes and examples, and with a viewpoint that differed from the author’s also increased difficulty. The effect of passage differences was found to be minimal. Difference in response formats moderately affected item difficulty, and the extended-matching format was slightly difficult than the conventional multiple-choice format. This study suggested that test developers and item writers should care about the distractor development, to contain students’ errors when measuring summarization skills of L2 learners.

Highlights

Automatic item generation (AIG) is a promising methodology to reduce the cost and effort of human item writers and to create test items systematically (Gierl and Lai, 2012)
The current study investigated the influence of English passages, distractor characteristics, and response formats on the difficulty of summarization items
The results of this study suggested that the extended matching (EM) format was more difficult than the conventional multiple-choice (CMC) format

Summary

Introduction

Automatic item generation (AIG) is a promising methodology to reduce the cost and effort of human item writers and to create test items systematically (Gierl and Lai, 2012). In the AIG framework, test items are generated from an item model, which includes manipulable elements in the item stem and options Such elements are divided into two types: radicals and incidentals (Gorin, 2005). A test item should reflect only the targeted skills and abilities, and assign a score that corresponds with high or low proficiency (Wilson, 2005). Such items are considered to accomplish construct representation (Embretson, 1983). When developing valid test items, it is important to prevent construct underrepresentation and minimize construct-irrelevant variance

Objectives

Methods

Discussion

Conclusion