Computational Modeling Of Language Research Articles

Advances in computational language models increasingly enable adaptive support for self‐regulated learning (SRL) in digital learning environments (DLEs; eg, via automated feedback). However, the accuracy of those models is a common concern for educational stakeholders (eg, policymakers, researchers, teachers and learners themselves). We compared the accuracy of four Dutch language models (ie, spaCy medium, spaCy large, FastText and ConceptNet NumberBatch) in the context of secondary school students' learning of causal relations from expository texts, scaffolded by causal diagram completion. Since machine learning relies on human‐labelled data for the best results, we used a dataset with 10,193 students' causal diagram answers, compiled over a decade of research using a diagram completion intervention to enhance students' monitoring of their text comprehension. The language models were used in combination with four popular machine learning classifiers (ie, logistic regression, random forests, support vector machine and neural networks) to evaluate their performance on automatically scoring students' causal diagrams in terms of the correctness of events and their sequence (ie, the causal structure). Five performance metrics were studied, namely accuracy, precision, recall, F1 and the area under the curve of the receiver operating characteristic (ROC‐AUC). The spaCy medium model combined with the neural network classifier achieved the best performance for the correctness of causal events in four of the five metrics, while the ConceptNet NumberBatch model worked best for the correctness of the causal sequence. These evaluation results provide a criterion for model adoption to adaptively support SRL of causal relations in DLEs. Practitioner notesWhat is already known about this topic Accurate monitoring is a prerequisite for effective self‐regulation. Students struggle to accurately monitor their comprehension of causal relations in texts. Completing causal diagrams improves students' monitoring accuracy, but there is room for further improvement. Automatic scoring could be used to provide adaptive support during diagramming. What this paper adds Comparison of four Dutch word vector models combined with four machine learning classifiers for the automatic scoring of students' causal diagrams. Five performance metrics to evaluate the above solutions. Evaluation of the word vector models for estimating the semantic similarity between student and model answers. Implications for practice and/or policy High‐quality word vector models could (em)power adaptive support during causal diagramming via automatic scoring. The evaluated solutions can be embedded in digital learning environments (DLEs). Criteria for model adoption to adaptively support SRL of causal relations in DLEs. The increased saliency of (in)correct answers via automatic scoring might help to improve students' monitoring accuracy.

Read full abstract

To support a victim of violence and establish the correct penalty for the perpetrator, it is crucial to correctly evaluate and communicate the severity of the violence. Recent data have shown these communications to be biased. However, computational language models provide opportunities for automated evaluation of the severity to mitigate the biases. We investigated whether these biases can be removed with computational algorithms trained to measure the severity of violence described. In phase 1 (P1), participants (N=71) were instructed to write some text and type 5 keywords describing an event where they experienced physical violence and 1 keyword describing an event where they experienced psychological violence in an intimate partner relationship. They were also asked to rate the severity. In phase 2 (P2), another set of participants (N=40) read the texts and rated them for severity of violence on the same scale as in P1. We also quantified the text data to word embeddings. Machine learning was used to train a model to predict the severity ratings. For physical violence, there was a greater accuracy bias for humans (r2=0.22) compared to the computational model (r2=0.31; t38=-2.37, P=.023). For psychological violence, the accuracy bias was greater for humans (r2=0.058) than for the computational model (r2=0.35; t38=-14.58, P<.001). Participants in P1 experienced psychological violence as more severe (mean 6.46, SD 1.69) than participants rating the same events in P2 (mean 5.84, SD 2.80; t86=-2.22, P=.029<.05), whereas no calibration bias was found for the computational model (t134=1.30, P=.195). However, no calibration bias was found for physical violence for humans between P1 (mean 6.59, SD 1.81) and P2 (mean 7.54, SD 2.62; t86=1.32, P=.19) or for the computational model (t134=0.62, P=.534). There was no difference in the severity ratings between psychological and physical violence in P1. However, the bias (ie, the ratings in P2 minus the ratings in P1) was highly negatively correlated with the severity ratings in P1 (r2=0.29) and in P2 (r2=0.37), whereas the ratings in P1 and P2 were somewhat less correlated (r2=0.11) using the psychological and physical data combined. The results show that the computational model mitigates accuracy bias and removes calibration biases. These results suggest that computational models can be used for debiasing the severity evaluations of violence. These findings may have application in a legal context, prioritizing resources in society and how violent events are presented in the media.

Read full abstract

Computational Modeling Of Language Research Articles

Related Topics

Articles published on Computational Modeling Of Language

A computational approach to detecting the envelope of variation

Communicative efficiency in multimodal language directed at children and adults.

Navigating the semantic space: Unraveling the structure of meaning in psychosis using different computational language models

Towards adaptive support for self‐regulated learning of causal relations: Evaluating four Dutch word vector models

Unraveling lexical semantics in the brain: Comparing internal, external, and hybrid language models.

Languages with more speakers tend to be harder to (machine-)learn

Computational Analysis of Superfood Representations in News Media

So Cloze Yet So Far: N400 Amplitude Is Better Predicted by Distributional Information Than Human Predictability Judgements

An eye-tracking-with-EEG coregistration corpus of narrative sentences

Removing Biases in Communication of Severity Assessments of Intimate Partner Violence: Model Development and Evaluation

Informational content vs. discourse orientation: experimental and computational perspectives

A synchronized multimodal neuroimaging dataset for studying brain language processing

Computational Topic Models for Theological Investigations

Process and content in decisions from memory.

Hierarchy in language interpretation: evidence from behavioural experiments and computational modelling

Linguistic Variation and Change in 250 Years of English Scientific Writing: A Data-Driven Approach.

Evaluating Computational Language Models with Scaling Properties of Natural Language

The influence of place and time on lexical behavior: A distributional analysis.

Lexical Predictability During Natural Reading: Effects of Surprisal and Entropy Reduction.

Using stochastic language models (SLM) to map lexical, syntactic, and phonological information processing in the brain.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Computational Modeling Of Language Research Articles

Related Topics

Articles published on Computational Modeling Of Language

A computational approach to detecting the envelope of variation

Communicative efficiency in multimodal language directed at children and adults.

Navigating the semantic space: Unraveling the structure of meaning in psychosis using different computational language models

Towards adaptive support for self‐regulated learning of causal relations: Evaluating four Dutch word vector models

Unraveling lexical semantics in the brain: Comparing internal, external, and hybrid language models.

Languages with more speakers tend to be harder to (machine-)learn

Computational Analysis of Superfood Representations in News Media

So Cloze Yet So Far: N400 Amplitude Is Better Predicted by Distributional Information Than Human Predictability Judgements

An eye-tracking-with-EEG coregistration corpus of narrative sentences

Removing Biases in Communication of Severity Assessments of Intimate Partner Violence: Model Development and Evaluation

Informational content vs. discourse orientation: experimental and computational perspectives

A synchronized multimodal neuroimaging dataset for studying brain language processing

Computational Topic Models for Theological Investigations

Process and content in decisions from memory.

Hierarchy in language interpretation: evidence from behavioural experiments and computational modelling

Linguistic Variation and Change in 250 Years of English Scientific Writing: A Data-Driven Approach.

Evaluating Computational Language Models with Scaling Properties of Natural Language

The influence of place and time on lexical behavior: A distributional analysis.

Lexical Predictability During Natural Reading: Effects of Surprisal and Entropy Reduction.

Using stochastic language models (SLM) to map lexical, syntactic, and phonological information processing in the brain.