Cross Attention Transformers for Multi-modal Unsupervised Whole-Body PET Anomaly Detection

Ashay Patel,Walter H.L Pinaya,Vicky Goh,Petru-Danial Tudiosu,M Jorge Cardoso,Gary Cook,Sebastien Ourselin

doi:10.59275/j.melba.2023-18c1

Ashay Patel, Walter H.L Pinaya + Show 5 more

Open Access

https://doi.org/10.59275/j.melba.2023-18c1

Copy DOI

Abstract

Cancer is a highly heterogeneous condition that can occur almost anywhere in the human body. [<sup>18</sup>F]fluorodeoxyglucose Positron Emission Tomography (<sup>18</sup>F-FDG PET) is a imaging modality commonly used to detect cancer due to its high sensitivity and clear visualisation of the pattern of metabolic activity. Nonetheless, as cancer is highly heterogeneous, it is challenging to train general-purpose discriminative cancer detection models, with data availability and disease complexity often cited as a limiting factor. Unsupervised learning methods, more specifically anomaly detection models, have been suggested as a putative solution. These models learn a healthy representation of tissue and detect cancer by predicting deviations from the healthy norm, which requires models capable of accurately learning long-range interactions between organs, their imaging patterns, and other abstract features with high levels of expressivity. Such characteristics are suitably satisfied by transformers, which have been shown to generate state-of-the-art results in unsupervised anomaly detection by training on normal data. This work expands upon such approaches by introducing multi-modal conditioning of the transformer via cross-attention i.e. supplying anatomical reference information from paired CT images to aid the PET anomaly detection task. Furthermore, we show the importance and impact of codebook sizing within a Vector Quantized Variational Autoencoder, on the ability of the transformer network to fulfill the task of anomaly detection. Using 294 whole-body PET/CT samples containing various cancer types, we show that our anomaly detection method is robust and capable of achieving accurate cancer localization results even in cases where normal training data is unavailable. In addition, we show the efficacy of this approach on out-of-sample data showcasing the generalizability of this approach even with limited training data. Lastly, we propose to combine model uncertainty with a new kernel density estimation approach, and show that it provides clinically and statistically significant improvements in accuracy and robustness, when compared to the classic residual-based anomaly maps. Overall, a superior performance is demonstrated against leading state-of-the-art alternatives, drawing attention to the potential of these approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cross Attention Transformers for Multi-modal Unsupervised Whole-Body PET Anomaly Detection

Abstract

Talk to us

Similar Papers

More From: Machine Learning for Biomedical Imaging

Lead the way for us

Journal: Machine Learning for Biomedical Imaging	Publication Date: Apr 19, 2023
Citations: 1

Similar Papers

Cross Attention Transformers for Multi-modal Unsupervised Whole-Body PET Anomaly Detection.
Ashay Patel ... M Jorge Cardoso
Deep generative models : Second MICCAI Workshop, DGM4MICCAI 2022, held in conjunction with MICCAI 2022, Singapore, September 22, 2022, proceedings. MICCAI Workshop on Deep Generative Models (2nd : 2022 : Singapore) | VOL. 13609
Ashay Patel, et. al.Ashay Patel ... M Jorge Cardoso
01 Jan 2021
01 Jan 2021

Holistic features for real-time crowd behaviour anomaly detection
Mark Marsden ... Kevin Mcguinness
-
Mark Marsden, et. al.Mark Marsden ... Kevin Mcguinness
01 Sep 2016
01 Sep 2016

FDG-PET Staging of Head and Neck Cancer--Can Improved Imaging Lead to Improved Treatment?
D L Schwartz ... R S Weber
JNCI Journal of the National Cancer Institute | VOL. 100
D L Schwartz, et. al.D L Schwartz ... R S Weber
13 May 2008
JNCI Journal of the National Cancer Institute | VOL. 100

Short-term wind speed estimation based on kernel density estimation using GNSS-reflectometry observation data
Kittipong Kasantikul ... Qiang Wang
-
Kittipong Kasantikul, et. al.Kittipong Kasantikul ... Qiang Wang
01 May 2017
01 May 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross Attention Transformers for Multi-modal Unsupervised Whole-Body PET Anomaly Detection

Abstract

Talk to us

Similar Papers

More From: Machine Learning for Biomedical Imaging