Abstract

A neural network is a mathematical model that is able to perform a task automatically or semi-automatically after learning the human knowledge that we provided. Moreover, a Convolutional Neural Network (CNN) is a type of neural network that has shown to efficiently learn tasks related to the area of image analysis, such as image segmentation, whose main purpose is to find regions or separable objects within an image. A more specific type of segmentation, called semantic segmentation, guarantees that each region has a semantic meaning by giving it a label or class. Since CNNs can automate the task of image semantic segmentation, they have been very useful for the medical area, applying them to the segmentation of organs or abnormalities (tumors). This work aims to improve the task of binary semantic segmentation of volumetric medical images acquired by Magnetic Resonance Imaging (MRI) using a pre-existing Three-Dimensional Convolutional Neural Network (3D CNN) architecture. We propose a formulation of a loss function for training this 3D CNN, for improving pixel-wise segmentation results. This loss function is formulated based on the idea of adapting a similarity coefficient, used for measuring the spatial overlap between the prediction and ground truth, and then using it to train the network. As contribution, the developed approach achieved good performance in a context where the pixel classes are imbalanced. We show how the choice of the loss function for training can affect the nal quality of the segmentation. We validate our proposal over two medical image semantic segmentation datasets and show comparisons in performance between the proposed loss function and other pre-existing loss functions used for binary semantic segmentation.

Highlights

  • Semantic segmentation in images is a type of segmentation that aims to find regions within an image that has a semantic meaning

  • 3.2 Matthews Correlation Coefficient (MCC) We propose the use of the MCC metric as a loss function for guiding the binary semantic segmentation learning process of volumetric medical images

  • The MCC, expressed in Equation (10), cannot be directly applied as a loss function for training a neural network, we show an adaptation for the calculation of each of its terms in Equations (11), (12), (13) and (14), in terms of operations between the prediction vectors containing the scores after softmax, and the ground truth vectors

Read more

Summary

Introduction

Semantic segmentation in images is a type of segmentation that aims to find regions within an image that has a semantic meaning. In other words, it ensures that pixels grouped into a particular region belong to the same class [1,2,3,4,5,6]. The task of semantic segmentation involves performing classification over every pixel of the image. A Convolutional Neural Network (CNN), is a type of neural network that can efficiently perform semantic segmentation in images. CNNs are very useful in this scenario, given that a single MRI image can contain tens to hundreds of images, and it is a slow and difficult task for a human to do it manually

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.