Abstract

With the fast improvement of classification networks, many of these networks are being in use as backbones of semantic segmentation networks to improve the accuracy. Using different classification networks as the backbone of the same semantic segmentation network may show different accuracy performance. This paper selected the sandstone dataset and self-driving cars dataset to compare the accuracy performance differences of VGG-16, ResNet-34, and Inceptionv3 as the backbone of UNet, where the original encoder of the UNet is replaced by a backbone. The three backbone networks are imported from Segmentation Models library, and they have weights trained on ImageNet dataset. The best accuracy performance of the semantic segmentation network on the sandstone dataset is when VGG-16 is used as the backbone, it achieved 76.22% MIoU. On the other hand, the highest accuracy performance of the semantic segmentation network on self-driving cars dataset is 75.47% MIoU, achieved when Inceptionv3 is used as the backbone. However, the accuracy is improved when using all the three backbones with both datasets, compared to the accuracy performance of the UNet without using any backbone network.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.