How to treat mixed behavior segments in supervised machine learning of behavioural modes from inertial measurement data

Yehezkel S Resheff,Hanna M Bensch,Hanna M Bensch,Markus Zöttl,Markus Zöttl,Roi Harel,Roi Harel,Roi Harel,Roi Harel,Akiko Matsumoto-Oda,Margaret C Crofoot,Margaret C Crofoot,Margaret C Crofoot,Margaret C Crofoot,Sara Gomez,Luca Börger,Shay Rotics,Shay Rotics

doi:10.1186/s40462-024-00485-7

Abstract

The application of supervised machine learning methods to identify behavioural modes from inertial measurements of bio-loggers has become a standard tool in behavioural ecology. Several design choices can affect the accuracy of identifying the behavioural modes. One such choice is the inclusion or exclusion of segments consisting of more than a single behaviour (mixed segments) in the machine learning model training data. Currently, the common practice is to ignore such segments during model training. In this paper we tested the hypothesis that including mixed segments in model training will improve accuracy, as the model would perform better in identifying them in the test data. We test this hypothesis using a series of data simulations on four datasets of accelerometer data coupled with behaviour observations, obtained from four study species (Damaraland mole-rats, meerkats, olive baboons, polar bears). Results show that when a substantial proportion of the test data are mixed behaviour segments (above ~ 10%), including mixed segments in machine learning model training improves the accuracy of classification. These results were consistent across the four study species, and robust to changes in segment length, sample size, and degree of mixture within the mixed segments. However, we also find that in some cases (particularly in baboons) models trained with mixed segments show reduced accuracy in classifying test data containing only single behaviour (pure) segments, compared to models trained without mixed segments. Based on these results, we recommend that when the classification model is expected to deal with a substantial proportion of mixed behaviour segments (> 10%), it is beneficial to include them in model training, otherwise, it is unnecessary but also not harmful. The exception is when there is a basis to assume that the training data contains a higher rate of mixed segments than the actual (unobserved) data to be classified—such a situation may occur particularly when training data are collected in captivity and used to classify data from the wild. In this case, excess inclusion of mixed segments in training data should probably be avoided.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

How to treat mixed behavior segments in supervised machine learning of behavioural modes from inertial measurement data

Abstract

Talk to us

Similar Papers

More From: Movement Ecology

Lead the way for us

Journal: Movement Ecology	Publication Date: Jun 10, 2024
License type: CC BY 4.0

Similar Papers

Pushing the limits of solubility prediction via quality-oriented data selection.
Murat Cihan Sorkun ... Süleyman Er
iScience | VOL. 24
Murat Cihan Sorkun, et. al.Murat Cihan Sorkun ... Süleyman Er
17 Dec 2020
iScience | VOL. 24

Incorporating Training Data Uncertainty in Machine Learning Models for Satellite Imagery
Hamed Alemohammad
-
Hamed AlemohammadHamed Alemohammad
15 May 2023
15 May 2023

Disclosure control of machine learning models from trusted research environments (TRE): New challenges and opportunities
Esma Mansouri-Benssassi ... Emily Jefferson
Heliyon | VOL. 9
Esma Mansouri-Benssassi, et. al.Esma Mansouri-Benssassi ... Emily Jefferson
01 Apr 2023
Heliyon | VOL. 9

Modelling treatment benefit for bexmarilimab (an anti-Clever-1 antibody and a novel macrophage reprogrammer) using phase I/II first-in-man trial data.
Petri Bono ... Juho Jalkanen
Journal of Clinical Oncology | VOL. 40
Petri Bono, et. al.Petri Bono ... Juho Jalkanen
01 Jun 2022
Journal of Clinical Oncology | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How to treat mixed behavior segments in supervised machine learning of behavioural modes from inertial measurement data

Abstract

Talk to us

Similar Papers

More From: Movement Ecology