Abstract

The combination of unmanned aerial vehicles (UAV) with deep learning models has the capacity to replace manned aircrafts for wildlife surveys. However, the scarcity of animals in the wild often leads to highly unbalanced, large datasets for which even a good detection method can return a large amount of false detections. Our objectives in this paper were to design a training method that would reduce training time, decrease the number of false positives and alleviate the fine-tuning effort of an image classifier in a context of animal surveys. We acquired two highly unbalanced datasets of deer images with a UAV and trained a Resnet-18 classifier using hard-negative mining and a series of recent techniques. Our method achieved sub-decimal false positive rates on two test sets (1 false positive per 19,162 and 213,312 negatives respectively), while training on small but relevant fractions of the data. The resulting training times were therefore significantly shorter than they would have been using the whole datasets. This high level of efficiency was achieved with little tuning effort and using simple techniques. We believe this parsimonious approach to dealing with highly unbalanced, large datasets could be particularly useful to projects with either limited resources or extremely large datasets.

Highlights

  • Accepted: 12 January 2021Accurate animal counts are the cornerstone of robust conservation and management plans [1]

  • The training and testing were carried out using Pytorch 1.1.0 [35] on three graphics processing units (GPU)(NVIDIA GeForce GTX 1080 Ti) with a Resnet-18 [36] pre-trained on Imagnet, from the Pytorch model zoo

  • Our goal in this paper was to reduce training time and the number of FN when training on highly unbalanced data, with minimal fine-tuning of hyperparameters

Read more

Summary

Introduction

Accurate animal counts are the cornerstone of robust conservation and management plans [1]. Positioning systems allow the possibility to reproduce earlier flights, making them wellsuited for regular assessments of the same areas [2] When used correctly, they have proven to be able to produce more accurate counts than direct methods [5]. Compared to previous image classification techniques, they are completely data driven, extracting and refining automatically the relevant information to make their decision [21] Their performance is known to increase with the amount of data provided [22], making them interesting for tasks that repeatedly collect vast amounts of data, like self-driving cars or in our case, animal census. Recent, simple, and available methods without needing extensive fine-tuning

Data Acquisition
Pictures
Image Pre-Processing
Proposed Approach
Metric
Workflow
Hard-Negative
Class Activation Maps
Training Times
Discussion
Training the Models
Hard-Negative Mining
Perspectives on Future Work
Findings
Example

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.