Detecting Screams From Home Audio Recordings to Identify Tantrums: Exploratory Study Using Transfer Machine Learning.

Rebecca O'Donovan,Eric Butter,Simon Lin,Emre Sezgin,Sven Bambach

doi:10.2196/18279

Rebecca O'Donovan, Eric Butter + Show 3 more

Open Access

https://doi.org/10.2196/18279

Copy DOI

Journal: JMIR formative research	Publication Date: Jun 16, 2020
Citations: 4	License type: cc-by

Affiliation: Nationwide Children's Hospital

Abstract

BackgroundQualitative self- or parent-reports used in assessing children’s behavioral disorders are often inconvenient to collect and can be misleading due to missing information, rater biases, and limited validity. A data-driven approach to quantify behavioral disorders could alleviate these concerns. This study proposes a machine learning approach to identify screams in voice recordings that avoids the need to gather large amounts of clinical data for model training.ObjectiveThe goal of this study is to evaluate if a machine learning model trained only on publicly available audio data sets could be used to detect screaming sounds in audio streams captured in an at-home setting.MethodsTwo sets of audio samples were prepared to evaluate the model: a subset of the publicly available AudioSet data set and a set of audio data extracted from the TV show Supernanny, which was chosen for its similarity to clinical data. Scream events were manually annotated for the Supernanny data, and existing annotations were refined for the AudioSet data. Audio feature extraction was performed with a convolutional neural network pretrained on AudioSet. A gradient-boosted tree model was trained and cross-validated for scream classification on the AudioSet data and then validated independently on the Supernanny audio.ResultsOn the held-out AudioSet clips, the model achieved a receiver operating characteristic (ROC)–area under the curve (AUC) of 0.86. The same model applied to three full episodes of Supernanny audio achieved an ROC-AUC of 0.95 and an average precision (positive predictive value) of 42% despite screams only making up 1.3% (n=92/7166 seconds) of the total run time.ConclusionsThese results suggest that a scream-detection model trained with publicly available data could be valuable for monitoring clinical recordings and identifying tantrums as opposed to depending on collecting costly privacy-protected clinical data for model training.

Highlights

One of the challenges in studying and diagnosing children with behavioral disorders is the relative unreliability of the information available
The same model applied to three full episodes of Supernanny audio achieved an receiver operating characteristic (ROC)-area under the curve (AUC) of 0.95 and an average precision of 42% despite screams only making up 1.3% (n=92/7166 seconds) of the total run time
We present an approach for detecting screams in home audio recordings with the aim of segmenting those recordings to review clinically relevant portions

Summary

Introduction

One of the challenges in studying and diagnosing children with (potential) behavioral disorders is the relative unreliability of the information available. A quantitative, data-driven approach for evaluating behavioral problems would alleviate the need to rely on these potentially inaccurate reports. With this goal in mind, our study focuses on detecting human screaming within continuous audio recording from inside a family’s home. Segmenting home audio to capture screams, with some postprocessing, could eventually allow researchers to use the scream as a proxy for temper tantrums or negative interactions between family members [8] Analyzing these interactions would allow clinicians to assess family relationships in a more objective manner than direct self-report [9], but identifying the segments manually would be tedious and time intensive. This study proposes a machine learning approach to identify screams in voice recordings that avoids the need to gather large amounts of clinical data for model training

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detecting Screams From Home Audio Recordings to Identify Tantrums: Exploratory Study Using Transfer Machine Learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JMIR formative research

Lead the way for us

Similar Papers

Real-time prediction of upcoming respiratory events via machine learning using snoring sound signal.
Bochun Wang ... Ji Wu
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17
Bochun Wang, et. al.Bochun Wang ... Ji Wu
12 Apr 2021
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17

Comparison of machine learning algorithms for predicting cognitive impairment using neuropsychological tests
Chanda Simfukwe ... Young Chul Youn
Applied Neuropsychology: Adult | VOL. ahead-of-print
Chanda Simfukwe, et. al.Chanda Simfukwe ... Young Chul Youn
06 Sep 2024
Applied Neuropsychology: Adult | VOL. ahead-of-print

Deep learning-based detection of lumbar spinal canal stenosis using convolutional neural networks
Hisataka Suzuki ... Daisuke Ukeba
The Spine Journal | VOL. -
Hisataka Suzuki, et. al.Hisataka Suzuki ... Daisuke Ukeba
01 Jun 2024
The Spine Journal | VOL. -

Applications of Machine Learning Model for Prediction of Outcomes in Primary Pontine Hemorrhage
Vich Yindeedej ... Raywat Noiphithak
World neurosurgery | VOL. 175
Vich Yindeedej, et. al.Vich Yindeedej ... Raywat Noiphithak
11 May 2023
World neurosurgery | VOL. 175

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting Screams From Home Audio Recordings to Identify Tantrums: Exploratory Study Using Transfer Machine Learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JMIR formative research