On Pre-trained Image Features and Synthetic Images for Deep Learning

Stefan Hinterstoisser,Kurt Konolige,Paul Wohlhart,Vincent Lepetit

doi:10.1007/978-3-030-11009-3_42

Stefan Hinterstoisser, Kurt Konolige + Show 2 more

Open Access

https://doi.org/10.1007/978-3-030-11009-3_42

Copy DOI

Abstract

Deep Learning methods usually require huge amounts of training data to perform at their full potential, and often require expensive manual labeling. Using synthetic images is therefore very attractive to train object detectors, as the labeling comes for free, and several approaches have been proposed to combine synthetic and real images for training. In this paper, we evaluate if ‘freezing’ the layers responsible for feature extraction to generic layers pre-trained on real images, and training only the remaining layers with plain OpenGL rendering may allow for training with synthetic images only. Our experiments with very recent deep architectures for object recognition (Faster-RCNN, R-FCN, Mask-RCNN) and image feature extractors (InceptionResnet and Resnet) show this simple approach performs surprisingly well.

Full Text