Abstract
Scientific studies on species composition and abundance distribution of fishes have considerable importance to the fishery industry, biodiversity protection, and marine ecosystem. In these studies, fish images are typically collected with the help of scuba divers or autonomous underwater vehicles. These images are then annotated manually by marine biologists. Such a process is certainly a tremendous waste of manpower and material resources. In recent years, the introduction of deep learning has helped making remarkable progress in this area. However, fish image classification can be considered as fine-grained problem, which is more challenging than common image classification, especially with low-quality and small-scale data. Meanwhile, well-known effective convolutional neural networks (CNNs) consistently require a large quantity of high-quality data. This paper presents a new method by improving transfer learning and squeeze-and-excitation networks for fine-grained fish image classification on low-quality and small-scale datasets. Our method enhances data augmentation through super-resolution reconstruction to enlarge the dataset with high-quality images, pre-pretrains, and pretrains to learn common and domain knowledge simultaneously while fine-tuning with professional skill. In addition, refined squeeze-and-excitation blocks are designed to improve bilinear CNNs for a fine-grained classification. Unlike well-known CNNs for image classification, our method can classify images with insufficient low-quality training data. Moreover, we compare the performance of our method with commonly used CNNs on small-scale fine-grained datasets, namely, Croatian and QUT fish datasets. The experimental results show that our method outperforms popular CNNs with higher fish classification accuracy, which indicates its potential applications in combination with other newly updated CNNs.
Highlights
With the advancement of technology in modern society, people have considerably better exploration and comprehension of our ocean
Feature extraction methods based on image processing technology have been proposed to classify fish images efficiently
In 2017, SENets [38] refreshed the score in ImageNet Large-Scale Visual Recognition Challenge (ILSVRC), wherein the top five errors were reduced to 2.251%
Summary
With the advancement of technology in modern society, people have considerably better exploration and comprehension of our ocean. Despite the aforementioned handcrafted low-level features, as well as conventional machine learning tools, such as SVMs and PCA, convolutional neural networks (CNNs) composed of only several convolutional and nonlinear layers have shown many advantages on visual tracking [27], [28], saliency detection [29], and image processing [30]–[32]. AlexNet [33] with deep CNNs obtained the highest classification result in comparison with conventional methods in the ImageNet Large-Scale Visual Recognition Challenge (ILSVRC) 2012 [34]. Recent works have introduced CNN-based methods to address the fish image classification problem. Small-scale fine-grained image classification remains a challenging task because CNNs cannot develop professional skill from limited images, (e.g., only 10 samples per category). On the basis of a B-CNN framework, our method enhances data augmentation, develops a new network block, and designs a new fine-tuning strategy to classify low-quality small-scale and fine-grained images.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.