In-context learning enables multimodal large language models to classify cancer pathology images

Dyke Ferber,Georg Wölflein,Isabella C Wiest,Marta Ligero,Srividhya Sainath,Narmin Ghaffari Laleh,Omar S M El Nahhas,Gustav Müller-Franzes,Dirk Jäger,Daniel Truhn,Jakob Nikolas Kather

doi:10.1038/s41467-024-51465-9

Abstract

Medical image classification requires labeled, task-specific datasets which are used to train deep learning networks de novo, or to fine-tune foundation models. However, this process is computationally and technically demanding. In language processing, in-context learning provides an alternative, where models learn from within prompts, bypassing the need for parameter updates. Yet, in-context learning remains underexplored in medical image analysis. Here, we systematically evaluate the model Generative Pretrained Transformer 4 with Vision capabilities (GPT-4V) on cancer image processing with in-context learning on three cancer histopathology tasks of high importance: Classification of tissue subtypes in colorectal cancer, colon polyp subtyping and breast tumor detection in lymph node sections. Our results show that in-context learning is sufficient to match or even outperform specialized neural networks trained for particular tasks, while only requiring a minimal number of samples. In summary, this study demonstrates that large vision language models trained on non-domain specific data can be applied out-of-the box to solve medical image-processing tasks in histopathology. This democratizes access of generalist AI models to medical experts without technical background especially for areas where annotated data is scarce.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

In-context learning enables multimodal large language models to classify cancer pathology images

Abstract

Talk to us

Similar Papers

More From: Nature Communications

Lead the way for us

Journal: Nature Communications	Publication Date: Nov 21, 2024
License type: cc-by

Similar Papers

CancerGPT for few shot drug pair synergy prediction using large pretrained language models
Tianhao Li ... Yejin Kim
npj Digital Medicine | VOL. 7
Tianhao Li, et. al.Tianhao Li ... Yejin Kim
19 Feb 2024
npj Digital Medicine | VOL. 7

A Large and Diverse Arabic Corpus for Language Modeling
Abbas Raza Ali ... Hasan Raza Ali
Procedia Computer Science | VOL. 225
Abbas Raza Ali, et. al.Abbas Raza Ali ... Hasan Raza Ali
01 Jan 2023
Procedia Computer Science | VOL. 225

Large Language Models: A Historical and Sociocultural Perspective.
Eugene Yu Ji
Cognitive science | VOL. 48
Eugene Yu JiEugene Yu Ji
01 Mar 2024
Cognitive science | VOL. 48

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... W Nick Street
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... W Nick Street
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

In-context learning enables multimodal large language models to classify cancer pathology images

Abstract

Talk to us

Similar Papers

More From: Nature Communications