Abstract

Semantic segmentation performs pixel-wise classification for given images, which can be widely used in autonomous driving, robotics, medical diagnostics and etc. The recent advanced approaches have witnessed rapid progress in semantic segmentation. However, these supervised learning based methods rely heavily on large-scale datasets to acquire strong generalizing ability, such that they are coupled with some constraints. Firstly, human annotation of pixel-level segmentation masks is laborious and time-consuming, which causes relatively expensive training data and make it hard to deal with urgent tasks in dynamic environment. Secondly, the outstanding performance of the above data-hungry methods will decrease with few available training examples. In order to overcome the limitations of the supervised learning semantic segmentation methods, this paper proposes a generalized meta-learning framework, named Meta-Seg. It consists of a meta-learner and a base-learner. Specifically, the meta-learner learns a good initialization and a parameter update strategy from a distribution of few-shot semantic segmentation tasks. The base-learner can be any semantic segmentation models theoretically and can implement fast adaptation (that is updating parameters with few iterations) under the guidance of the meta-learner. In this work, the successful semantic segmentation model FCN8s is integrated into Meta-Seg. Experiments on the famous few-shot semantic segmentation dataset PASCAL $5^{i}$ prove Meta-Seg is a promising framework for few-shot semantic segmentation. Besides, this method can provide with reference for the relevant researches of meta-learning semantic segmentation.

Highlights

  • In recent years, deep learning, especially convolutional networks [1], have made significant breakthroughs in many visual understanding tasks including image classification [2]–[5], object detection [6]–[13] and semantic segmentation [14]–[21]

  • The experiment results prove the effectiveness of Meta-Seg and meta-learning is a promising approach for few-shot semantic segmentation

  • We design a generalized few-shot semantic segmentation framework named MetaSeg, which consists of a meta-learner and a base-leaner

Read more

Summary

Introduction

Deep learning, especially convolutional networks [1], have made significant breakthroughs in many visual understanding tasks including image classification [2]–[5], object detection [6]–[13] and semantic segmentation [14]–[21]. It still faces the challenges of overfitting in a few-shot regime. Weakly supervised semantic segmentation methods [23]–[27] are proposed to reduce the burden of data annotation. These methods merely solve the dependencies on annotated data, which still require plenty of training images.

Objectives
Methods
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.