Does learning from language family help? A case study on a low-resource question-answering task

Hariom A Pandya,Brijesh S Bhatt

doi:10.1017/nlp.2024.13

Abstract

Abstract Multilingual pre-trained models make it possible to develop natural language processing (NLP) applications for low-resource languages (LRLs) using the model of resource-rich languages (RRLs). However, the structural characteristics of the target languages can impact task-specific learning. In this paper, we investigate the influence of structural diversities of languages on the system’s overall performance. Specifically, we propose a customized approach to leverage task-specific data of low-resource language families via transfer learning from RRL. Our findings are based on question-answering tasks using the XLM-R, mBERT, and IndicBERT transformer models and Indic languages (Hindi, Bengali, and Telugu). On the XQuAD-Hindi dataset, the few-shot learning using Bengali improves the benchmark mBERT (F1/EM) score by +(10.86/7.87) and XLM-R score by +(3.84/4.42). Few-shot learning using Telugu has also improved the mBERT score by +(10.42/7.36) and +(3.04/2.72) for XLM-R. In addition, our model has demonstrated benchmark-compatible performance in a zero-shot setup with single-epoch task learning. This approach can be adapted for other NLP tasks for LRLs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Does learning from language family help? A case study on a low-resource question-answering task

Abstract

Talk to us

Similar Papers

More From: Natural Language Processing

Lead the way for us

Journal: Natural Language Processing	Publication Date: Jun 3, 2024
License type: CC BY 4.0

Similar Papers

UA-LLM: ADVANCING CONTEXT-BASED QUESTION ANSWERING IN UKRAINIAN THROUGH LARGE LANGUAGE MODELS
M V Syromiatnikov ... V M Ruvinskaya
Radio Electronics, Computer Science, Control | VOL. -
M V Syromiatnikov, et. al.M V Syromiatnikov ... V M Ruvinskaya
02 Apr 2024
Radio Electronics, Computer Science, Control | VOL. -

Borrow from rich cousin: transfer learning for emotion detection using cross lingual embedding
Zishan Ahmad ... Pushpak Bhattachharyya
Expert Systems with Applications | VOL. 139
Zishan Ahmad, et. al.Zishan Ahmad ... Pushpak Bhattachharyya
27 Jul 2019
Expert Systems with Applications | VOL. 139

A Survey on Few-Shot Learning in Natural Language Processing
Mengde Yang
-
Mengde YangMengde Yang
01 May 2021
01 May 2021

ProQA: Structural Prompt-based Pre-training for Unified Question Answering
...
-
, et. al. ...
27 Jun 2022
27 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Does learning from language family help? A case study on a low-resource question-answering task

Abstract

Talk to us

Similar Papers

More From: Natural Language Processing