Effectiveness of Pre-Trained Language Models for the Japanese Winograd Schema Challenge

Keigo Takahashi,Mamoru Komachi,Teruaki Oka

doi:10.20965/jaciii.2023.p0511

Abstract

This paper compares Japanese and multilingual language models (LMs) in a Japanese pronoun reference resolution task to determine the factors of LMs that contribute to Japanese pronoun resolution. Specifically, we tackle the Japanese Winograd schema challenge task (WSC task), which is a well-known pronoun reference resolution task. The Japanese WSC task requires inter-sentential analysis, which is more challenging to solve than intra-sentential analysis. A previous study evaluated pre-trained multilingual LMs in terms of training language on the target WSC task, including Japanese. However, the study did not perform pre-trained LM-wise evaluations, focusing on the training language-wise evaluations with a multilingual WSC task. Furthermore, it did not investigate the effectiveness of factors (e.g., model size, learning settings in the pre-training phase, or multilingualism) to improve the performance. In our study, we compare the performance of inter-sentential analysis on the Japanese WSC task for several pre-trained LMs, including multilingual ones. Our results confirm that XLM, a pre-trained LM on multiple languages, performs the best among all considered LMs, which we attribute to the amount of data in the pre-training phase.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Advanced Computational Intelligence and Intelligent Informatics	Publication Date: May 20, 2023
Citations: 1	License type: cc-by-nd

R Discovery Prime

R Discovery Prime

Effectiveness of Pre-Trained Language Models for the Japanese Winograd Schema Challenge

Abstract

Talk to us

Similar Papers

More From: Journal of Advanced Computational Intelligence and Intelligent Informatics

Lead the way for us

Similar Papers

Neural Transfer Learning For Vietnamese Sentiment Analysis Using Pre-trained Contextual Language Models
An Pha Le ... Tran Vu Pham
-
An Pha Le, et. al.An Pha Le ... Tran Vu Pham
16 Dec 2021
16 Dec 2021

Improving Pre-Trained Multilingual Model with Vocabulary Expansion
Hai Wang ... Dian Yu
-
Hai Wang, et. al.Hai Wang ... Dian Yu
01 Jan 2019
01 Jan 2019

Towards an Enhanced Understanding of Bias in Pre-trained Neural Language Models: A Survey with Special Emphasis on Affective Bias
Anoop K ... Lajish V L
-
Anoop K, et. al. Anoop K ... Lajish V L
01 Jan 2021
01 Jan 2021

Improving Cross-lingual Information Retrieval on Low-Resource Languages via Optimal Transport Distillation
Zhiqi Huang ... James Allan
-
Zhiqi Huang, et. al.Zhiqi Huang ... James Allan
27 Feb 2023
27 Feb 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effectiveness of Pre-Trained Language Models for the Japanese Winograd Schema Challenge

Abstract

Talk to us

Similar Papers

More From: Journal of Advanced Computational Intelligence and Intelligent Informatics