Abstract

Automatic biology image classification is essential for biodiversity conservation and ecological study. Recently, due to the record-shattering performance, deep convolutional neural networks (DCNNs) have been used more often in biology image classification. However, training DCNNs requires a large amount of labeled data, which may be difficult to collect for some organisms. This study was carried out to exploit cross-domain transfer learning for DCNNs with limited data. According to the literature, previous studies mainly focus on transferring from ImageNet to a specific domain or transferring between two closely related domains. While this study explores deep transfer learning between species from different domains and analyzes the situation when there is a huge difference between the source domain and the target domain. Inspired by the analysis of previous studies, the effect of biology cross-domain image classification in transfer learning is proposed. In this work, the multiple transfer learning scheme is designed to exploit deep transfer learning on several biology image datasets from different domains. There may be a huge difference between the source domain and the target domain, causing poor performance on transfer learning. To address this problem, multistage transfer learning is proposed by introducing an intermediate domain. The experimental results show the effectiveness of cross-domain transfer learning and the importance of data amount and validate the potential of multistage transfer learning.

Highlights

  • IntroductionPlants are extremely complex and diverse, and there are millions of different plant species [1, 2]

  • Building accurate knowledge of the identity, taxonomy, the geographic distribution, and the evolution of living species are essential for a sustainable development of humanity as well as for biodiversity conservation.In terrestrial ecosystems, plants are extremely complex and diverse, and there are millions of different plant species [1, 2]

  • Rectified Linear Unit (ReLU) activation function is applied to the output of every convolutional layer in all deep convolutional neural networks (DCNNs) used in this paper. e ReLU activation function can be described by the following equation: f(z) max(0, z), (1)

Read more

Summary

Introduction

Plants are extremely complex and diverse, and there are millions of different plant species [1, 2]. Plants must be classified into identifiable groups in order to have a clear, organized way of identifying the diverse array of plants and some specific applications such as weed control [3, 4]. For some species like weed plants and plankton, only experts such as taxonomists and trained technicians can identify taxa accurately. One expert may only identify a limited number of species in a specific domain (such as only species of weeds or phytoplankton) because it requires special skills acquired through extensive experiences [3, 12]. There is an increasing shortage of skilled taxonomists [13]. e declining and partly nonexistent taxonomic knowledge within the general public has been termed “taxonomic crisis” [14], making great challenges to the future of biological study and conservation [11]

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call