Development and analysis of medical instruction-tuning for Japanese large language models

Issey Sukeda,Masahiro Suzuki,Satoshi Kodera,Hiroki Sakaji

doi:10.36922/aih.2695

Abstract

In the ongoing wave of impact driven by large language models (LLMs) like ChatGPT, the adaptation of LLMs to the medical domain has emerged as a crucial research frontier. Since mainstream LLMs tend to be designed for general-purpose applications, constructing a medical LLM through domain adaptation is a huge challenge. While instruction-tuning, particularly based on low-rank adaptation (LoRA), has become a frequently employed strategy to fine-tune LLMs recently, its precise roles in domain adaptation remain unknown. Here, we investigated how LoRA-based instruction-tuning improves the performance of Japanese medical question-answering tasks by employing a multifaceted evaluation of multiple-choice questions, including scoring based on &ldquo;Exact match&rdquo; and &ldquo;Gestalt distance&rdquo; in addition to the conventional accuracy. Our findings suggest that LoRA-based instruction-tuning can partially incorporate domain-specific knowledge into LLMs, with larger models demonstrating more pronounced effects. Furthermore, our results underscore the potential of adapting English-centric models for Japanese applications in domain adaptation, while also highlighting the persisting limitations of Japanese-centric models. This initiative represents a pioneering effort in enabling medical institutions to fine-tune and operate models without relying on external services.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Development and analysis of medical instruction-tuning for Japanese large language models

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence in Health

Lead the way for us

Journal: Artificial Intelligence in Health	Publication Date: Apr 8, 2024
License type: CC BY-NC 4.0

Similar Papers

CancerGPT for few shot drug pair synergy prediction using large pretrained language models
Tianhao Li ... Yejin Kim
npj Digital Medicine | VOL. 7
Tianhao Li, et. al.Tianhao Li ... Yejin Kim
19 Feb 2024
npj Digital Medicine | VOL. 7

Large Language Models are Good Translators
Zhaohan Zeng ... Zhibin Liang
Journal of Emerging Investigators | VOL. -
Zhaohan Zeng, et. al.Zhaohan Zeng ... Zhibin Liang
01 Jan 2024
Journal of Emerging Investigators | VOL. -

Research on the Application and Optimization Strategies of Deep Learning in Large Language Models
Jerry Yao ... Bin Yuan
Journal of Theory and Practice of Engineering Science | VOL. 4
Jerry Yao, et. al.Jerry Yao ... Bin Yuan
27 May 2024
Journal of Theory and Practice of Engineering Science | VOL. 4

Leveraging Large Language Models for Precision Monitoring of Chemotherapy-Induced Toxicities: A Pilot Study with Expert Comparisons and Future Directions.
Oskitz Ruiz Sarrias ... Covadonga Figaredo Berjano
Cancers | VOL. 16
Oskitz Ruiz Sarrias, et. al.Oskitz Ruiz Sarrias ... Covadonga Figaredo Berjano
12 Aug 2024
Cancers | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development and analysis of medical instruction-tuning for Japanese large language models

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence in Health