L2 and L1 semantic context indices as automated measures of lexical sophistication

Kátia Monteiro,Robert-Mihai Botarleanu,Scott Crossley,Mihai Dascălu

doi:10.1177/02655322221147924

Kátia Monteiro, Robert-Mihai Botarleanu + Show 2 more

https://doi.org/10.1177/02655322221147924

Copy DOI

Abstract

Lexical frequency benchmarks have been extensively used to investigate second language (L2) lexical sophistication, especially in language assessment studies. However, indices based on semantic co-occurrence, which may be a better representation of the experience language users have with lexical items, have not been sufficiently tested as benchmarks of lexical sophistication. To address this gap, we developed and tested indices based on semantic co-occurrence from two computational methods, namely, Latent Semantic Analysis and Word2Vec. The indices were developed from one L2 written corpus (i.e., EF Cambridge Open Language Database [EF-CAMDAT]) and one first language (L1) written corpus (i.e., Corpus of Contemporary American English [COCA] Magazine). Available L1 semantic context indices (i.e., Touchstone Applied Sciences Associates [TASA] indices) were also assessed. To validate the indices, they were used to predict L2 essay quality scores as judged by human raters. The models suggested that the semantic context indices developed from EF-CAMDAT and TASA, but not the COCA Magazine indices, explained unique variance in the presence of lexical sophistication measures. This study suggests that semantic context indices based on multi-level corpora, including L2 corpora, may provide a useful representation of the experience L2 writers have with input, which may assist with automatic scoring of L2 writing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

L2 and L1 semantic context indices as automated measures of lexical sophistication

Abstract

Talk to us

Similar Papers

More From: Language Testing

Lead the way for us

Journal: Language Testing	Publication Date: Feb 2, 2023
Citations: 1

Similar Papers

An Efficient Annotation based Image Retrieval System by Mining of Semantically Related user Queries with Improved Markovian Model
M Sangeetha ... K Anandakumar
Indian Journal of Science and Technology | VOL. 8
M Sangeetha, et. al.M Sangeetha ... K Anandakumar
09 Dec 2015
Indian Journal of Science and Technology | VOL. 8

Does Semantic Search Performs Better than Lexical Search in the Task of Assisting Legal Opinion Writing?
Daniel De Souza Costa Pedroso ... Marcelo Ladeira
-
Daniel De Souza Costa Pedroso, et. al.Daniel De Souza Costa Pedroso ... Marcelo Ladeira
01 Dec 2019
01 Dec 2019

Text Influenced Molecular Indexing (TIMI): A Literature Database Mining Approach that Handles Text and Chemistry.
Suresh B Singh ... Richard D Hull
ChemInform | VOL. 34
Suresh B Singh, et. al.Suresh B Singh ... Richard D Hull
29 Jul 2003
ChemInform | VOL. 34

Supervised Semantic Indexing Using Sub-spacing
Sadiq Sani ... Stewart Massie
-
Sadiq Sani, et. al.Sadiq Sani ... Stewart Massie
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

L2 and L1 semantic context indices as automated measures of lexical sophistication

Abstract

Talk to us

Similar Papers

More From: Language Testing