Exploring the effectiveness of instruction tuning in biomedical language processing

Omid Rohanian,Mohammadmahdi Nouriborji,Samaneh Kouchaki,Farhad Nooralahzadeh,Lei Clifton,David A Clifton

doi:10.1016/j.artmed.2024.103007

Abstract

Large Language Models (LLMs), particularly those similar to ChatGPT, have significantly influenced the field of Natural Language Processing (NLP). While these models excel in general language tasks, their performance in domain-specific downstream tasks such as biomedical and clinical Named Entity Recognition (NER), Relation Extraction (RE), and Medical Natural Language Inference (NLI) is still evolving. In this context, our study investigates the potential of instruction tuning for biomedical language processing, applying this technique to two general LLMs of substantial scale. We present a comprehensive, instruction-based model trained on a dataset that consists of approximately 200,000 instruction-focused samples. This dataset represents a carefully curated compilation of existing data, meticulously adapted and reformatted to align with the specific requirements of our instruction-based tasks. This initiative represents an important step in utilising such models to achieve results on par with specialised encoder-only models like BioBERT and BioClinicalBERT for various classical biomedical NLP tasks. Our work includes an analysis of the dataset’s composition and its impact on model performance, providing insights into the intricacies of instruction tuning. By sharing our codes, models, and the distinctively assembled instruction-based dataset, we seek to encourage ongoing research and development in this area.22Our code repository is available at https://github.com/nlpie-research/BioInstTune-LLM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring the effectiveness of instruction tuning in biomedical language processing

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence In Medicine

Lead the way for us

Similar Papers

Developments in The Field of Natural Language Processing

International Journal of Advanced Research in Computer Science | VOL. 8

30 Apr 2017
International Journal of Advanced Research in Computer Science | VOL. 8

Clinical named entity recognition and relation extraction using natural language processing of medical free text: A systematic review
David Fraile Navarro ... Shlomo Berkovsky
International Journal of Medical Informatics | VOL. 177
David Fraile Navarro, et. al.David Fraile Navarro ... Shlomo Berkovsky
05 Jun 2023
International Journal of Medical Informatics | VOL. 177

Advancing entity recognition in biomedicine via instruction tuning of large language models.
Vipina K Keloth ... Qiao Jin
Bioinformatics (Oxford, England) | VOL. 40
Vipina K Keloth, et. al.Vipina K Keloth ... Qiao Jin
21 Mar 2024
Bioinformatics (Oxford, England) | VOL. 40

Medical Knowledge Attention Enhanced Neural Model for Named Entity Recognition in Chinese EMR
Zhichang Zhang ... Yu Zhang
-
Zhichang Zhang, et. al.Zhichang Zhang ... Yu Zhang
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring the effectiveness of instruction tuning in biomedical language processing

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence In Medicine