Learner Corpus Research Articles

In recent years, reading comprehension has gradually become a proficiency indicator of interest in lexical and grammar. As sentences are the basic units of discourse structure, sentence difficulty is often applied to the study of text difficulty. Although there have been a number of studies on sentence difficulty, the lack of consistency in the indicators chosen or the discussion of specific grammatical issues have limited the research on sentence difficulty. Therefore, this study adopts a corpus-based approach, using a corpus as an objective and scientific data source. The study utilizes the Digital Platform for Chinese Grammar and the 8000 Chinese Words as important reference sources. Additionally, the CRIE 3.0 is employed to validate the texts and establish sentence difficulty indicators. However, due to the incomplete development of certain indicators in the "Chinese Grammar Digital Platform", the study refers to the Chinese Proficiency Grading Standards for International Chinese Language Education and Hanyu Shuiping Kaoshi to establish comprehensive sentence structure and sentence component difficulty indicators. Subsequently, the established difficulty indicators are validated by conducting comparative analyses using corpora as the basis. Native speaker corpora are used as benchmarks, while Mandarin learner corpora are used for comparison, and then validate objectively through the machine learning model. These validation aims to examine the validity and reliability of the selected indicators and establish a calculation method involving " level of grammar * point distribution ratio of grammar " to determine the difficulty indicators for Chinese sentences, Additionally, expert reliability is accessed to ensure the credibility of indicators.

This paper explores the reliability of using ChatGPT in evaluating EFL writing by assessing its intra- and inter-rater reliability. Eighty-two compositions were randomly sampled from the Written English Corpus of Chinese Learners. These compositions were rated by three experienced raters with regard to ‘language’, ‘content’, and ‘organization’. The writing samples were also rated by ChatGPT twice over some time, and the average scores were calculated. Independent samples t-test was conducted to compare the average scores given by ChatGPT and human raters. Pearson correlation analyses were conducted between the two sets of overall scores given by ChatGPT to calculate the intra-rater reliability, as well as between average scores given by ChatGPT and human raters for inter-rater reliability. The results of comparative analysis shows that ChatGPT may be used for evaluating EFL essays, as the scores are similar to those provided by reliable human raters. However, the result of correlation analyses shows that the intra-rater reliability of ChatGPT is not h igh enough to be acceptable, r=0.575, p<0.01 and the strength of the inter-rater reliability is moderate as well, r=0.508, p<0.01. Besides, there is no significant relationship between their average scores on ‘organization’ of the writings, r=0.181, p>0.05. Thus, it can be concluded that ChatGPT is not a reliable tool to rate and score EFL writings using the prompt in this study. One of the possible reasons for the unreliability of ChatGPT as a rater of EFL writing seems to be related to scoring for the ‘organization’ of the essay. These findings imply that while ChatGPT has potential as an evaluative tool, its current limitations, particularly in assessing organization, must be addressed before it can be reliably used in educational settings.

Learner Corpus Research Articles

Related Topics

Articles published on Learner Corpus

From early to future learner corpus research

Exploring the functions of lexical bundles in undergraduate opinion paragraphs for pedagogical use

The Study on Sentence Difficulty of the Chinese Components

Academic Word List and Cefr Levels: Profiling Academic Vocabulary in a Technical University Learner Corpus

The MuLeCo Project: A learner corpus of L1 German learners of Romance languages

From second language acquisition research to foreign language teaching through the prism of corpora

영어권 한국어 학습자의 보조사 ‘도’ 사용 발달 연구A Study of English-Speaking Korean Learners’ Acquisition of the Particle to “also”

The Reliability of using ChatGPT in Rating EFL Writings

Analysis of Syntactic and Lexical Complexity in a Learner Corpus

Triangulating learner corpus and online experimental data: Evidence from gender agreement and relative clauses in L2 Greek

Seeking research funding in a peripheral context: A learner corpus genre study of grant proposal summaries

Usage-based approaches to assessing syntactic sophistication in second language writing: Interaction of genre and proficiency

Crowdsourced Comparative Judgement for Evaluating Learner Texts: How Reliable are Judges Recruited from an Online Crowdsourcing Platform?

Causation in a Learner Corpus of Hungarian-English Summaries of Disease Descriptions

Prompt response variation in learner corpus research: Implications for data interpretation

Exemplification in student essay writing: A study of learner corpus of essay writing (LCEW)

An Investigation Into English Learners’ Speech Acts Using a Semi-Automatic Annotation Tool

A refined and concise model of indices for quantitatively measuring lexical richness of Chinese university students’ EFL writing

AI 기반 피드백의 교육적 활용 가능성 탐색

English Adjectives Used by Non-natives in ICNALE Spoken Dialogues

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Learner Corpus Research Articles

Related Topics

Articles published on Learner Corpus

From early to future learner corpus research

Exploring the functions of lexical bundles in undergraduate opinion paragraphs for pedagogical use

The Study on Sentence Difficulty of the Chinese Components

Academic Word List and Cefr Levels: Profiling Academic Vocabulary in a Technical University Learner Corpus

The MuLeCo Project: A learner corpus of L1 German learners of Romance languages

From second language acquisition research to foreign language teaching through the prism of corpora

영어권 한국어 학습자의 보조사 ‘도’ 사용 발달 연구A Study of English-Speaking Korean Learners’ Acquisition of the Particle to “also”

The Reliability of using ChatGPT in Rating EFL Writings

Analysis of Syntactic and Lexical Complexity in a Learner Corpus

Triangulating learner corpus and online experimental data: Evidence from gender agreement and relative clauses in L2 Greek

Seeking research funding in a peripheral context: A learner corpus genre study of grant proposal summaries

Usage-based approaches to assessing syntactic sophistication in second language writing: Interaction of genre and proficiency

Crowdsourced Comparative Judgement for Evaluating Learner Texts: How Reliable are Judges Recruited from an Online Crowdsourcing Platform?

Causation in a Learner Corpus of Hungarian-English Summaries of Disease Descriptions

Prompt response variation in learner corpus research: Implications for data interpretation

Exemplification in student essay writing: A study of learner corpus of essay writing (LCEW)

An Investigation Into English Learners’ Speech Acts Using a Semi-Automatic Annotation Tool

A refined and concise model of indices for quantitatively measuring lexical richness of Chinese university students’ EFL writing

AI 기반 피드백의 교육적 활용 가능성 탐색

English Adjectives Used by Non-natives in ICNALE Spoken Dialogues