Development of novel deep multimodal representation learning-based model for the differentiation of liver tumors on B-mode ultrasound images.

Masaya Sato,Takuma Nakatsuka,Hayato Nakagawa,Hiromi Hikita,Momoe Endo,Takashi Tanaka,Yoko Soroida,Ryosuke Tateishi,Mamiko Sato,Tamaki Kobayashi,Makiko Kurihara,Kazuhiko Koike,Yutaka Yatomi,Ayaka Nakamura,Hiroaki Gotoh,Tomomi Iwai

doi:10.1111/jgh.15763

Masaya Sato, Takuma Nakatsuka + Show 14 more

Open Access

https://doi.org/10.1111/jgh.15763

Copy DOI

Abstract

Recently, multimodal representation learning for images and other information such as numbers or language has gained much attention. The aim of the current study was to analyze the diagnostic performance of deep multimodal representation model-based integration of tumor image, patient background, and blood biomarkers for the differentiation of liver tumors observed using B-mode ultrasonography (US). First, we applied supervised learning with a convolutional neural network (CNN) to 972 liver nodules in the training and development sets to develop a predictive model using segmented B-mode tumor images. Additionally, we also applied a deep multimodal representation model to integrate information about patient background or blood biomarkers to B-mode images. We then investigated the performance of the models in an independent test set of 108 liver nodules. Using only the segmented B-mode images, the diagnostic accuracy and area under the curve (AUC) values were 68.52% and 0.721, respectively. As the information about patient background and blood biomarkers was integrated, the diagnostic performance increased in a stepwise manner. The diagnostic accuracy and AUC value of the multimodal DL model (which integrated B-mode tumor image, patient age, sex, aspartate aminotransferase, alanine aminotransferase, platelet count, and albumin data) reached 96.30% and 0.994, respectively. Integration of patient background and blood biomarkers in addition to US image using multimodal representation learning outperformed the CNN model using US images. We expect that the deep multimodal representation model could be a feasible and acceptable tool for the definitive diagnosis of liver tumors using B-mode US.

Highlights

Ultrasonography (US) is widely used for hepatocellular carcinoma (HCC) surveillance to screen high-risk populations, because of its cost-effectiveness and non-invasiveness
Since B-mode US provides structural information that may reflect the histological characteristics of the tumor,[2] a precise and objective recognition of B-mode images has the potential to become a powerful tool for the qualitative diagnosis of liver tumors
As B-mode US itself provides structural information, an objective recognition of B-mode images using the Machine learning (ML) approach has the potential to become a powerful tool for the qualitative diagnosis of liver tumors

Summary

Introduction

Ultrasonography (US) is widely used for hepatocellular carcinoma (HCC) surveillance to screen high-risk populations, because of its cost-effectiveness and non-invasiveness. A definitive diagnosis of liver tumors observed using B-mode sonography can be difficult because of the low specificity of this modality.[1] Currently, B-mode sonography is usually used in combination with other contrast imaging modalities such as computed tomography (CT) or magnetic resonance imaging (MRI), to obtain a definitive diagnosis. Since B-mode US provides structural information that may reflect the histological characteristics of the tumor,[2] a precise and objective recognition of B-mode images has the potential to become a powerful tool for the qualitative diagnosis of liver tumors. The ImageNet Large Scale Visual Recognition Challenge competition is an annual competition for computer vision; in the competition held in 2017, DL technology with deep convolutional neural network (CNN)

Objectives

Methods

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Gastroenterology and Hepatology	Publication Date: Jan 12, 2022
Citations: 12	License type: cc-by

R Discovery Prime

R Discovery Prime

Development of novel deep multimodal representation learning-based model for the differentiation of liver tumors on B-mode ultrasound images.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Gastroenterology and Hepatology

Lead the way for us

Similar Papers

Diagnosis of Significant Liver Fibrosis by Using a DCNN Model With Fusion of Features From US B-Mode Image and Nakagami Parametric Map: An Animal Study
Qiang Liu ... Wencong Xu
IEEE Access | VOL. 9
Qiang Liu, et. al.Qiang Liu ... Wencong Xu
01 Jan 2020
IEEE Access | VOL. 9

Deep learning-based multimodal fusion network for segmentation and classification of breast cancers using B-mode and elastography ultrasound images.
Sampa Misra ... Chulhong Kim
Bioengineering & translational medicine | VOL. 8
Sampa Misra, et. al.Sampa Misra ... Chulhong Kim
28 Dec 2022
Bioengineering & translational medicine | VOL. 8

Multimodal Representation Learning for Recommendation in Internet of Things
Zhenhua Huang ... Cheng Wang
IEEE Internet of Things Journal | VOL. 6
Zhenhua Huang, et. al.Zhenhua Huang ... Cheng Wang
01 Dec 2019
IEEE Internet of Things Journal | VOL. 6

Frequency and average gray-level information for thermal ablation status in ultrasound B-Mode sequences
Jens Ziegle ... Michael Friebe
Current Directions in Biomedical Engineering | VOL. 6
Jens Ziegle, et. al.Jens Ziegle ... Michael Friebe
17 Sep 2020
Current Directions in Biomedical Engineering | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development of novel deep multimodal representation learning-based model for the differentiation of liver tumors on B-mode ultrasound images.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Gastroenterology and Hepatology