A visual-language foundation model for pathology image analysis using medical Twitter.

Zhi Huang,Thomas J Montine,Federico Bianchi,Mert Yuksekgonul,James Zou

doi:10.1038/s41591-023-02504-3

Abstract

The lack of annotated publicly available medical images is a major barrier for computational research and education innovations. At the same time, many de-identified images and much knowledge are shared by clinicians on public forums such as medical Twitter. Here we harness these crowd platforms to curate OpenPath, a large dataset of 208,414 pathology images paired with natural language descriptions. We demonstrate the value of this resource by developing pathology language-image pretraining (PLIP), a multimodal artificial intelligence with both image and text understanding, which is trained on OpenPath. PLIP achieves state-of-the-art performances for classifying new pathology images across four external datasets: for zero-shot classification, PLIP achieves F1 scores of 0.565-0.832 compared to F1 scores of 0.030-0.481 for previous contrastive language-image pretrained model. Training a simple supervised classifier on top of PLIP embeddings also achieves 2.5% improvement in F1 scores compared to using other supervised model embeddings. Moreover, PLIP enables users to retrieve similar cases by either image or natural language search, greatly facilitating knowledge sharing. Our approach demonstrates that publicly shared medical information is a tremendous resource that can be harnessed to develop medical artificial intelligence for enhancing diagnosis, knowledge sharing and education.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A visual-language foundation model for pathology image analysis using medical Twitter.

Abstract

Talk to us

Similar Papers

More From: Nature Medicine

Lead the way for us

Journal: Nature Medicine	Publication Date: Aug 17, 2023
Citations: 103

Similar Papers

Abstract 4927: Combining single-cell ATAC and RNA sequencing for supervised cell annotation
Jaidip Gill ... Natasha Markuzon
Cancer Research | VOL. 84
Jaidip Gill, et. al.Jaidip Gill ... Natasha Markuzon
22 Mar 2024
Abstract 4927: Combining single-cell ATAC and RNA sequencing for supervised cell annotation
Jaidip Gill ... Natasha Markuzon

Do natural language search engines really understand what users want?
Nadjla Hariri
Online Information Review | VOL. 37
Nadjla HaririNadjla Hariri
12 Apr 2013
Online Information Review | VOL. 37

Focus of Attention in Decision Support Systems.
John O Gurney ... Kenneth
-
John O Gurney, et. al.John O Gurney ... Kenneth
12 May 1995
12 May 1995

Generating VHDL models from natural language descriptions
...
-
, et. al. ...
23 Sep 1994
23 Sep 1994

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A visual-language foundation model for pathology image analysis using medical Twitter.

Abstract

Talk to us

Similar Papers

More From: Nature Medicine