Multilingual Models Research Articles

Clinical text and documents contain very rich information and knowledge in healthcare, and their processing using state-of-the-art language technology becomes very important for building intelligent systems for supporting healthcare and social good. This processing includes creating language understanding models and translating resources into other natural languages to share domain-specific cross-lingual knowledge. In this work, we conduct investigations on clinical text machine translation by examining multilingual neural network models using deep learning such as Transformer based structures. Furthermore, to address the language resource imbalance issue, we also carry out experiments using a transfer learning methodology based on massive multilingual pre-trained language models (MMPLMs). The experimental results on three sub-tasks including (1) clinical case (CC), (2) clinical terminology (CT), and (3) ontological concept (OC) show that our models achieved top-level performances in the ClinSpEn-2022 shared task on English-Spanish clinical domain data. Furthermore, our expert-based human evaluations demonstrate that the small-sized pre-trained language model (PLM) outperformed the other two extra-large language models by a large margin in the clinical domain fine-tuning, which finding was never reported in the field. Finally, the transfer learning method works well in our experimental setting using the WMT21fb model to accommodate a new language space Spanish that was not seen at the pre-training stage within WMT21fb itself, which deserves more exploitation for clinical knowledge transformation, e.g. to investigate into more languages. These research findings can shed some light on domain-specific machine translation development, especially in clinical and healthcare fields. Further research projects can be carried out based on our work to improve healthcare text analytics and knowledge transformation. Our data is openly available for research purposes at: https://github.com/HECTA-UoM/ClinicalNMT.

Read full abstract

Pre-trained language models (PLM) based on transformer neural networks developed in the field of natural language processing (NLP) offer great opportunities to improve automatic content analysis in communication science, especially for the coding of complex semantic categories in large datasets via supervised machine learning. However, three characteristics so far impeded the widespread adoption of the methods in the applying disciplines: the dominance of English language models in NLP research, the necessary computing resources, and the effort required to produce training data to fine-tune PLMs. In this study, we address these challenges by using a multilingual transformer model in combination with the adapter extension to transformers, and few-shot learning methods. We test our approach on a realistic use case from communication science to automatically detect claims and arguments together with their stance in the German news debate on arms deliveries to Ukraine. In three experiments, we evaluate (1) data preprocessing strategies and model variants for this task, (2) the performance of different few-shot learning methods, and (3) how well the best setup performs on varying training set sizes in terms of validity, reliability, replicability and reproducibility of the results. We find that our proposed combination of transformer adapters with pattern exploiting training provides a parameter-efficient and easily shareable alternative to fully fine-tuning PLMs. It performs on par in terms of validity, while overall, provides better properties for application in communication studies. The results also show that pre-fine-tuning for a task on a near-domain dataset leads to substantial improvement, in particular in the few-shot setting. Further, the results indicate that it is useful to bias the dataset away from the viewpoints of specific prominent individuals.

Read full abstract

Multilingual Models Research Articles

Related Topics

Articles published on Multilingual Models

CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer

CONSIDER: Commonalities and Specialties Driven Multilingual Code Retrieval Framework

Multi-class hate speech detection in the Norwegian language using FAST-RNN and multilingual fine-tuned transformers

Construction and analysis of uncertainty indices based on multilingual text representations

BioLORD-2023: semantic textual representations fusing large language models and clinical knowledge graph insights.

A Bilingual Basque–Spanish Dataset of Parliamentary Sessions for the Development and Evaluation of Speech Technology

Neural machine translation of clinical text: an empirical investigation into multilingual pre-trained language models and transfer-learning.

Multilingualism in Kazakhstan�s education system: Implementation and challenges

Beyond Language Barriers: Allowing Multiple Languages in Postsecondary Chemistry Classes Through Multilingual Machine Learning

Определение коэффициента билингвизма будущих учителей-предметников методом ассоциативного эксперимента

Assessing the risks and opportunities posed by AI-enhanced influence operations on social media

DReAMy: a library for the automatic analysis and annotation of dream reports with multilingual large language models

Initial Exploration of the Transformation of Technology Museums Empowered by Artificial Intelligence

A comparative study of cross-lingual sentiment analysis

On Language Policy in the Education of Young People in Multinational States

CodeFed: Federated Speech Recognition for Low-Resource Code-Switching Detection

Are Emotions Conveyed Across Machine Translations? Establishing an Analytical Process for the Effectiveness of Multilingual Sentiment Analysis with Italian Text

Few-shot learning for automated content analysis: Efficient coding of arguments and claims in the debate on arms deliveries to Ukraine

AGI-P: A Gender Identification Framework for Authorship Analysis Using Customized Fine-Tuning of Multilingual Language Model

Terminology in the Age of AI: The Transformation of Terminology Theory and Practice

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multilingual Models Research Articles

Related Topics

Articles published on Multilingual Models

CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer

CONSIDER: Commonalities and Specialties Driven Multilingual Code Retrieval Framework

Multi-class hate speech detection in the Norwegian language using FAST-RNN and multilingual fine-tuned transformers

Construction and analysis of uncertainty indices based on multilingual text representations

BioLORD-2023: semantic textual representations fusing large language models and clinical knowledge graph insights.

A Bilingual Basque–Spanish Dataset of Parliamentary Sessions for the Development and Evaluation of Speech Technology

Neural machine translation of clinical text: an empirical investigation into multilingual pre-trained language models and transfer-learning.

Multilingualism in Kazakhstan�s education system: Implementation and challenges

Beyond Language Barriers: Allowing Multiple Languages in Postsecondary Chemistry Classes Through Multilingual Machine Learning

Определение коэффициента билингвизма будущих учителей-предметников методом ассоциативного эксперимента

Assessing the risks and opportunities posed by AI-enhanced influence operations on social media

DReAMy: a library for the automatic analysis and annotation of dream reports with multilingual large language models

Initial Exploration of the Transformation of Technology Museums Empowered by Artificial Intelligence

A comparative study of cross-lingual sentiment analysis

On Language Policy in the Education of Young People in Multinational States

CodeFed: Federated Speech Recognition for Low-Resource Code-Switching Detection

Are Emotions Conveyed Across Machine Translations? Establishing an Analytical Process for the Effectiveness of Multilingual Sentiment Analysis with Italian Text

Few-shot learning for automated content analysis: Efficient coding of arguments and claims in the debate on arms deliveries to Ukraine

AGI-P: A Gender Identification Framework for Authorship Analysis Using Customized Fine-Tuning of Multilingual Language Model

Terminology in the Age of AI: The Transformation of Terminology Theory and Practice