Image synthesis with class-aware semantic diffusion models for surgical scene segmentation.

Yihang Zhou,Rebecca Towning,Stamatia Giannarou,Zaid Awad

doi:10.1049/htl2.70003

Yihang Zhou, Rebecca Towning + Show 2 more

Open Access

https://doi.org/10.1049/htl2.70003

Copy DOI

Export

Save

Cite

Journal: Healthcare technology letters	Publication Date: Feb 1, 2025
License type: cc-by

Abstract
Full-Text
Similar Papers

Abstract

Listen

Surgical scene segmentation is essential for enhancing surgical precision, yet it is frequently compromised by the scarcity and imbalance of available data. To address these challenges, semantic image synthesis methods based on generative adversarial networks and diffusion models have been developed. However, these models often yield non-diverse images and fail to capture small, critical tissue classes, limiting their effectiveness. In response, a class-aware semantic diffusion model (CASDM), a novel approach which utilizes segmentation maps as conditions for image synthesis to tackle data scarcity and imbalance is proposed. Novel class-aware mean squared error and class-aware self-perceptual loss functions have been defined to prioritize critical, less visible classes, thereby enhancing image quality and relevance. Furthermore, to the authors' knowledge, they are the first to generate multi-class segmentation maps using text prompts in a novel fashion to specify their contents. These maps are then used by CASDM to generate surgical scene images, enhancing datasets for training and validating segmentation models. This evaluation assesses both image quality and downstream segmentation performance, demonstrates the strong effectiveness and generalisability of CASDM in producing realistic image-map pairs, significantly advancing surgical scene segmentation across diverse and challengingdatasets.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Image synthesis with class-aware semantic diffusion models for surgical scene segmentation.

Abstract

Published Version

Talk to us

Similar Papers

More From: Healthcare technology letters

Lead the way for us

Similar Papers

U-shaped GAN for Semi-Supervised Learning and Unsupervised Domain Adaptation in High Resolution Chest Radiograph Segmentation.
Hongyu Wang ... Jia Wang
Frontiers in medicine | VOL. 8
Hongyu Wang, et. al.Hongyu Wang ... Jia Wang
13 Jan 2022
Frontiers in medicine | VOL. 8

BEGAN v3: Avoiding Mode Collapse in GANs Using Variational Inference
Sung-Wook Park ... Jong-Chan Kim
Electronics | VOL. 9
Sung-Wook Park, et. al.Sung-Wook Park ... Jong-Chan Kim
23 Apr 2020
Electronics | VOL. 9

High-resolution dermoscopy image synthesis with conditional generative adversarial networks
Saisai Ding ... Jing Xie
Biomedical Signal Processing and Control | VOL. 64
Saisai Ding, et. al.Saisai Ding ... Jing Xie
25 Oct 2020
Biomedical Signal Processing and Control | VOL. 64

Low-Light Image Enhancement Based on Joint Generative Adversarial Network and Image Quality Assessment
Wei Hua ... Youshen Xia
-
Wei Hua, et. al.Wei Hua ... Youshen Xia
01 Oct 2018
01 Oct 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Image synthesis with class-aware semantic diffusion models for surgical scene segmentation.

Abstract

Published Version

Talk to us

Similar Papers

More From: Healthcare technology letters