A multimodal generative AI copilot for human pathology

Ming Y Lu,Ming Y Lu,Ming Y Lu,Ming Y Lu,Bowen Chen,Bowen Chen,Drew F K Williamson,Drew F K Williamson,Drew F K Williamson,Richard J Chen,Richard J Chen,Richard J Chen,Melissa Zhao,Melissa Zhao,Aaron K Chow,Kenji Ikemura,Kenji Ikemura,Ahrong Kim,Ahrong Kim,Dimitra Pouli,Dimitra Pouli,Ankush Patel,Amr Soliman,Chengkuan Chen,Tong Ding,Tong Ding,Judy J Wang,Georg Gerber,Ivy Liang,Ivy Liang,Long Phi Le,Anil V Parwani,Luca L Weishaupt,Luca L Weishaupt,Faisal Mahmood,Faisal Mahmood,Faisal Mahmood,Faisal Mahmood

doi:10.1038/s41586-024-07618-3

Abstract

Computational pathology1,2 has witnessed considerable progress in the development of both task-specific predictive models and task-agnostic self-supervised vision encoders3,4. However, despite the explosive growth of generative artificial intelligence (AI), there have been few studies on building general-purpose multimodal AI assistants and copilots5 tailored to pathology. Here we present PathChat, a vision-language generalist AI assistant for human pathology. We built PathChat by adapting a foundational vision encoder for pathology, combining it with a pretrained large language model and fine-tuning the whole system on over 456,000 diverse visual-language instructions consisting of 999,202 question and answer turns. We compare PathChat with several multimodal vision-language AI assistants and GPT-4V, which powers the commercially available multimodal general-purpose AI assistant ChatGPT-4 (ref. 6). PathChat achieved state-of-the-art performance on multiple-choice diagnostic questions from cases with diverse tissue origins and disease models. Furthermore, using open-ended questions and human expert evaluation, we found that overall PathChat produced more accurate and pathologist-preferable responses to diverse queries related to pathology. As an interactive vision-language AI copilot that can flexibly handle both visual and natural language inputs, PathChat may potentially find impactful applications in pathology education, research and human-in-the-loop clinical decision-making.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature	Publication Date: Jun 12, 2024
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A multimodal generative AI copilot for human pathology

Abstract

Talk to us

Similar Papers

More From: Nature

Lead the way for us

Similar Papers

Physician Adoption of AI Assistant
Ting Hou ... Huazhong Zhao
Manufacturing & Service Operations Management | VOL. -
Ting Hou, et. al.Ting Hou ... Huazhong Zhao
17 Jul 2024
Manufacturing & Service Operations Management | VOL. -

Hey, Alexa, Is My Water Safe?
David B Lafrance
Journal - American Water Works Association | VOL. 110
David B LafranceDavid B Lafrance
01 Oct 2018
Journal - American Water Works Association | VOL. 110

Artificial intelligence assistants and risk: framing a connectivity risk narrative
Martin Cunneen ... Martin Mullins
AI & SOCIETY | VOL. 35
Martin Cunneen, et. al.Martin Cunneen ... Martin Mullins
07 Nov 2019
AI & SOCIETY | VOL. 35

Assistive AI in Lung Cancer Screening: A Retrospective Multinational Study in the United States and Japan.
Atilla P Kiraly ... Zaid Nabulsi
Radiology. Artificial intelligence | VOL. 6
Atilla P Kiraly, et. al.Atilla P Kiraly ... Zaid Nabulsi
01 May 2024
Radiology. Artificial intelligence | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multimodal generative AI copilot for human pathology

Abstract

Talk to us

Similar Papers

More From: Nature