Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation.

G Anthony Reina,Ravi Panchumarthy,Siddhesh Pravin Thakur,Alexei Bastidas,Spyridon Bakas

doi:10.3389/fnins.2020.00065

G Anthony Reina, Ravi Panchumarthy + Show 3 more

Open Access

https://doi.org/10.3389/fnins.2020.00065

Copy DOI

Abstract

Convolutional neural network (CNN) models obtain state of the art performance on image classification, localization, and segmentation tasks. Limitations in computer hardware, most notably memory size in deep learning accelerator cards, prevent relatively large images, such as those from medical and satellite imaging, from being processed as a whole in their original resolution. A fully convolutional topology, such as U-Net, is typically trained on down-sampled images and inferred on images of their original size and resolution, by simply dividing the larger image into smaller (typically overlapping) tiles, making predictions on these tiles, and stitching them back together as the prediction for the whole image. In this study, we show that this tiling technique combined with translationally-invariant nature of CNNs causes small, but relevant differences during inference that can be detrimental in the performance of the model. Here we quantify these variations in both medical (i.e., BraTS) and non-medical (i.e., satellite) images and show that training a 2D U-Net model on the whole image substantially improves the overall model performance. Finally, we compare 2D and 3D semantic segmentation models to show that providing CNN models with a wider context of the image in all three dimensions leads to more accurate and consistent predictions. Our results suggest that tiling the input to CNN models—while perhaps necessary to overcome the memory limitations in computer hardware—may lead to undesirable and unpredictable errors in the model's output that can only be adequately mitigated by increasing the input of the model to the largest possible tile size.

Highlights

Since their resurgence in 2012 convolutional neural networks (CNN) have rapidly proved to be the state-of-the-art method for computer-aided diagnosis in medical imaging, and have led to improved accuracy in classification, localization, and segmentation tasks (Krizhevsky et al, 2012; Chen et al, 2016; Greenspan et al, 2016)
Convolutional networks are a natural fit for tiling methods, as they can be trained on images of one size and perform inference on images of a larger size by breaking the large image into smaller, overlapping tiles (Ronneberger et al, 2015; Çiçek et al, 2016; Roth et al, 2018)
Brain Tumor Segmentation (BraTS) The medical data used for our evaluations reflect the publiclyavailable training dataset of the International Brain Tumor Segmentation (BraTS) challenge 20191 (Figure 1) (Menze et al, 2014; Bakas et al, 2017a,b,c; Bakas et al, 2018)

Summary

Introduction

Since their resurgence in 2012 convolutional neural networks (CNN) have rapidly proved to be the state-of-the-art method for computer-aided diagnosis in medical imaging, and have led to improved accuracy in classification, localization, and segmentation tasks (Krizhevsky et al, 2012; Chen et al, 2016; Greenspan et al, 2016). Tiling is often applied when using large images due to the memory limitations of the hardware (Roth et al, 2018). In CNN models, the activation maps of the intermediate layers use several times the memory footprint of the original input image. These activation maps can increase the allocated memory to hundreds of gigabytes. Convolutional networks are a natural fit for tiling methods, as they can be trained on images of one size and perform inference on images of a larger size by breaking the large image into smaller, overlapping tiles (Ronneberger et al, 2015; Çiçek et al, 2016; Roth et al, 2018). To perform the overlapping tiling at inference time, varying N × N (or in the 3D case, N × N × N) tiles are cropped from the whole image at uniformly spaced offsets along the image dimensions

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Neuroscience	Publication Date: Feb 7, 2020
Citations: 33	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neuroscience

Lead the way for us

Similar Papers

Evaluation of Transfer Learning Approach for Satellite Image Classification
Ibrahim Onuralp Yigit
-
Ibrahim Onuralp YigitIbrahim Onuralp Yigit
15 Oct 2020
15 Oct 2020

Using very‐high‐resolution satellite imagery and deep learning to detect and count African elephants in heterogeneous landscapes
Isla Duporge ... Graeme Buchanan
Remote Sensing in Ecology and Conservation | VOL. 7
Isla Duporge, et. al.Isla Duporge ... Graeme Buchanan
23 Dec 2020
Remote Sensing in Ecology and Conservation | VOL. 7

Deep learning for multi-modal classification of cloud, shadow and land cover scenes in PlanetScope and Sentinel-2 imagery
Yuri Shendryk ... Peter Thorburn
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 157
Yuri Shendryk, et. al.Yuri Shendryk ... Peter Thorburn
17 Sep 2019
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 157

Artificial intelligence: finding the intersection of predictive modeling and clinical utility
Karthik Ravi
Gastrointestinal Endoscopy | VOL. 93
Karthik RaviKarthik Ravi
07 Mar 2021
Gastrointestinal Endoscopy | VOL. 93

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neuroscience