Uppsala NLP at SemEval-2021 Task 2: Multilingual Language Models for Fine-tuning and Feature Extraction in Word-in-Context Disambiguation

Huiling You,Xingran Zhu,Sara Stymne

doi:10.18653/v1/2021.semeval-1.15

Abstract

We describe the Uppsala NLP submission to SemEval-2021 Task 2 on multilingual and cross-lingual word-in-context disambiguation. We explore the usefulness of three pre-trained multilingual language models, XLM-RoBERTa (XLMR), Multilingual BERT (mBERT) and multilingual distilled BERT (mDistilBERT). We compare these three models in two setups, fine-tuning and as feature extractors. In the second case we also experiment with using dependency-based information. We find that fine-tuning is better than feature extraction. XLMR performs better than mBERT in the cross-lingual setting both with fine-tuning and feature extraction, whereas these two models give a similar performance in the multilingual setting. mDistilBERT performs poorly with fine-tuning but gives similar results to the other models when used as a feature extractor. We submitted our two best systems, fine-tuned with XLMR and mBERT.

Highlights

SemEval-2021 Task 2: Multilingual and Crosslingual Word-in-Context Disambiguation (MCLWiC) (Martelli et al, 2021) is an extension from WiC (Pilehvar and Camacho-Collados, 2019), a shared task at the IJCAI-19 SemDeep workshop (SemDeep-5)
XLMR gives the best results for all cross-lingual language pairs, with an improvement over Multilingual BERT (mBERT) of 4.1– 10.5 percentage points
We found that fine-tuning the language models is preferable to using them as feature extractors either for an multi-layer perceptron (MLP) or for logistic regression

Summary

Introduction

SemEval-2021 Task 2: Multilingual and Crosslingual Word-in-Context Disambiguation (MCLWiC) (Martelli et al, 2021) is an extension from WiC (Pilehvar and Camacho-Collados, 2019), a shared task at the IJCAI-19 SemDeep workshop (SemDeep-5). WiC was proposed as a benchmark to evaluate context-sensitive word representations. The WiC dataset consists of a list of English sentence-pairs. Each sentence-pair has a target word, and the task is to determine whether the target word is used in the same meaning or different meanings in the two sentences, as a binary classification task. MCL-WiC extends WiC to multilingual and cross-lingual datasets, and covers 5. Example The cat chases after the mouse. La souris mange le fromage. (‘The mouse eats the cheese’)

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Uppsala NLP at SemEval-2021 Task 2: Multilingual Language Models for Fine-tuning and Feature Extraction in Word-in-Context Disambiguation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2021
Citations: 2	License type: cc-by

Similar Papers

Improving Pre-Trained Multilingual Model with Vocabulary Expansion
Hai Wang ... Dian Yu
-
Hai Wang, et. al.Hai Wang ... Dian Yu
01 Jan 2019
01 Jan 2019

Detecting Hate Speech in Cross-Lingual and Multi-lingual Settings Using Language Agnostic Representations
Sebastián E Rodríguez ... Héctor Allende
-
Sebastián E Rodríguez, et. al.Sebastián E Rodríguez ... Héctor Allende
01 Jan 2020
01 Jan 2020

MUSCAT: Multilingual Rumor Detection in Social Media Conversations
Md Rabiul Awal ... Minh Dang Nguyen
-
Md Rabiul Awal, et. al.Md Rabiul Awal ... Minh Dang Nguyen
17 Dec 2022
17 Dec 2022

Evaluating Multilingual BERT for Estonian
Claudia Kittask ... Kirill Milintsevich
-
Claudia Kittask, et. al.Claudia Kittask ... Kirill Milintsevich
15 Sep 2020
15 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Uppsala NLP at SemEval-2021 Task 2: Multilingual Language Models for Fine-tuning and Feature Extraction in Word-in-Context Disambiguation

Abstract

Highlights

Summary

Talk to us

Similar Papers