An IoT System Using Deep Learning to Classify Camera Trap Images on the Edge

Imran Zualkernan,Brylle Ryan Gomez,Jacky Judas,Lana Alhaj Hussain,Ali Reza Sajun,Salam Dhou

doi:10.3390/computers11010013

Abstract

Camera traps deployed in remote locations provide an effective method for ecologists to monitor and study wildlife in a non-invasive way. However, current camera traps suffer from two problems. First, the images are manually classified and counted, which is expensive. Second, due to manual coding, the results are often stale by the time they get to the ecologists. Using the Internet of Things (IoT) combined with deep learning represents a good solution for both these problems, as the images can be classified automatically, and the results immediately made available to ecologists. This paper proposes an IoT architecture that uses deep learning on edge devices to convey animal classification results to a mobile app using the LoRaWAN low-power, wide-area network. The primary goal of the proposed approach is to reduce the cost of the wildlife monitoring process for ecologists, and to provide real-time animal sightings data from the camera traps in the field. Camera trap image data consisting of 66,400 images were used to train the InceptionV3, MobileNetV2, ResNet18, EfficientNetB1, DenseNet121, and Xception neural network models. While performance of the trained models was statistically different (Kruskal–Wallis: Accuracy H(5) = 22.34, p < 0.05; F1-score H(5) = 13.82, p = 0.0168), there was only a 3% difference in the F1-score between the worst (MobileNet V2) and the best model (Xception). Moreover, the models made similar errors (Adjusted Rand Index (ARI) > 0.88 and Adjusted Mutual Information (AMU) > 0.82). Subsequently, the best model, Xception (Accuracy = 96.1%; F1-score = 0.87; F1-Score = 0.97 with oversampling), was optimized and deployed on the Raspberry Pi, Google Coral, and Nvidia Jetson edge devices using both TenorFlow Lite and TensorRT frameworks. Optimizing the models to run on edge devices reduced the average macro F1-Score to 0.7, and adversely affected the minority classes, reducing their F1-score to as low as 0.18. Upon stress testing, by processing 1000 images consecutively, Jetson Nano, running a TensorRT model, outperformed others with a latency of 0.276 s/image (s.d. = 0.002) while consuming an average current of 1665.21 mA. Raspberry Pi consumed the least average current (838.99 mA) with a ten times worse latency of 2.83 s/image (s.d. = 0.036). Nano was the only reasonable option as an edge device because it could capture most animals whose maximum speeds were below 80 km/h, including goats, lions, ostriches, etc. While the proposed architecture is viable, unbalanced data remain a challenge and the results can potentially be improved by using object detection to reduce imbalances and by exploring semi-supervised learning.

Highlights

Animals’ behavior, movements, and locations can be captured using various monitoring techniques
Xception was the best model with an average F1-score of 0.878 and an average accuracy of 96.1%
This reduction in size allowed the models to be deployed onto various edge devices in order to benchmark performance

Summary

Introduction

Animals’ behavior, movements, and locations can be captured using various monitoring techniques. Ecologists often use camera traps to capture images of animals in a 4.0/). Camera traps are especially useful because they allow for standardized data collection while causing minimal disturbance to wildlife. Camera traps generate a large amount of data that can be used regionally or globally [1]. Camera traps are typically left in the wild for months, accumulating a large number of images. Data captured using camera traps often does not reach its full potential due to the costly and time-consuming manual procedures needed to label and classify the captured images

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computers	Publication Date: Jan 13, 2022
Citations: 25	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An IoT System Using Deep Learning to Classify Camera Trap Images on the Edge

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers

Lead the way for us

Similar Papers

Object classification and visualization with edge artificial intelligence for a customized camera trap platform
Sajid Nazir ... Mohammad Kaleem
Ecological Informatics | VOL. 79
Sajid Nazir, et. al.Sajid Nazir ... Mohammad Kaleem
02 Jan 2024
Ecological Informatics | VOL. 79

The First Study of White Rust Disease Recognition by Using Deep Neural Networks and Raspberry Pi Module Application in Chrysanthemum
Toan Khac Nguyen ... Truong-Dong Do
Inventions | VOL. 8
Toan Khac Nguyen, et. al.Toan Khac Nguyen ... Truong-Dong Do
31 May 2023
Inventions | VOL. 8

An IoT Transfer Learning-Based Service for the Health Status Monitoring of Grapevines
Antonios Morellos ... Dimitrios Kateris
Applied Sciences | VOL. 14
Antonios Morellos, et. al.Antonios Morellos ... Dimitrios Kateris
26 Jan 2024
Applied Sciences | VOL. 14

Anatomy of Deep Learning Image Classification and Object Detection on Commercial Edge Devices: A Case Study on Face Mask Detection
Dimitrios Kolosov ... Pandelis Kourtessis
IEEE Access | VOL. 10
Dimitrios Kolosov, et. al.Dimitrios Kolosov ... Pandelis Kourtessis
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An IoT System Using Deep Learning to Classify Camera Trap Images on the Edge

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers