Abstract
We describe a novel semi-supervised learning method that reduces the labelling effort needed to train convolutional neural networks (CNNs) when processing georeferenced imagery. This allows deep learning CNNs to be trained on a per-dataset basis, which is useful in domains where there is limited learning transferability across datasets. The method identifies representative subsets of images from an unlabelled dataset based on the latent representation of a location guided autoencoder. We assess the method's sensitivities to design options using four different ground-truthed datasets of georeferenced environmental monitoring images, where these include various scenes in aerial and seafloor imagery. Efficiency gains are achieved for all the aerial and seafloor image datasets analysed in our experiments, demonstrating the benefit of the method across application domains. Compared to CNNs of the same architecture trained using conventional transfer and active learning, the method achieves equivalent accuracy with an order of magnitude fewer annotations, and 85 % of the accuracy of CNNs trained conventionally with approximately 10,000 human annotations using just 40 prioritised annotations. The biggest gains in efficiency are seen in datasets with unbalanced class distributions and rare classes that have a relatively small number of observations.
Highlights
G EOREFERENCED visual images taken by aircraft, satellites and submersibles are widely used in environmental monitoring
This paper proposes a novel semi-supervised learning pipeline to classify georeferenced imagery using deep learning convolutional neural networks (CNNs)
The main advantage of the proposed location guided autoencoder (LGA)-SS method is that it can interpret images according to class boundaries of interest for environmental monitoring more efficiently than the alternative methods tested in this work, requiring less human effort and achieving better accuracy
Summary
G EOREFERENCED visual images taken by aircraft, satellites and submersibles are widely used in environmental monitoring. Spatial invariant features such as Local Binary Patterns (LBP) [15] and Spatial Pyramid Matching (SPM) [16] have been effectively applied to classification problems for land [17], [18] and seafloor imagery [19], [20] These types of features require manual tuning of parameters, or feature engineering, to efficiently describe each independent dataset. A separate classification process is needed, which typically requires further parameter tuning As such these methods often require expert knowledge of both the data and application domain, and have limited versatility when applied to multiple datasets
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: IEEE Transactions on Pattern Analysis and Machine Intelligence
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.