Multimodal Representation Learning via Maximization of Local Mutual Information.

Ruizhi Liao,Seth Berkowitz,William M. Wells,Miriam Cha,Steven Horng,Polina Golland,Daniel Moyer,Keegan Quigley

doi:10.1007/978-3-030-87196-3_26

Abstract

We propose and demonstrate a representation learning approach by maximizing the mutual information between local features of images and text. The goal of this approach is to learn useful image representations by taking advantage of the rich information contained in the free text that describes the findings in the image. Our method trains image and text encoders by encouraging the resulting representations to exhibit high local mutual information. We make use of recent advances in mutual information estimation with neural network discriminators. We argue that the sum of local mutual information is typically a lower bound on the global mutual information. Our experimental results in the downstream image classification tasks demonstrate the advantages of using local features for image-text representation learning. Our code is available at: https://github.com/RayRuizhiLiao/mutual_info_img_txt.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal Representation Learning via Maximization of Local Mutual Information.

Abstract

Talk to us

Similar Papers

More From: Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention

Lead the way for us

Journal: Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention	Publication Date: Jan 1, 2021
Citations: 16

Similar Papers

InfoSeg: Unsupervised Semantic Image Segmentation with Mutual Information Maximization
Robert Harb ... Patrick Knöbelreiter
-
Robert Harb, et. al.Robert Harb ... Patrick Knöbelreiter
01 Jan 2020
01 Jan 2020

A strategy for multimodal deformable image registration to integrate PET/MR into radiotherapy treatment planning
Sara Leibfarth ... Daniela Thorwarth
Acta Oncologica | VOL. 52
Sara Leibfarth, et. al.Sara Leibfarth ... Daniela Thorwarth
23 Jul 2013
Acta Oncologica | VOL. 52

Mutual Information Regularization for Weakly-Supervised RGB-D Salient Object Detection
Aixuan Li ... Jing Zhang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 34
Aixuan Li, et. al.Aixuan Li ... Jing Zhang
01 Jan 2024
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 34

Multi-sensor image registration by combining local self-similarity matching and mutual information
Xiaoping Liu ... Shuli Chen
Frontiers of Earth Science | VOL. 12
Xiaoping Liu, et. al.Xiaoping Liu ... Shuli Chen
01 Oct 2018
Frontiers of Earth Science | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal Representation Learning via Maximization of Local Mutual Information.

Abstract

Talk to us

Similar Papers

More From: Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention