Describing Vocalizations in Young Children: A Big Data Approach Through Citizen Science Annotation.

Chiara Semenzin,Bridgette L Kelleher,Lisa Hamrick,Alejandrina Cristia,Amanda Seidl

doi:10.1044/2021_jslhr-20-00661

Abstract

Purpose Recording young children's vocalizations through wearables is a promising method to assess language development. However, accurately and rapidly annotating these files remains challenging. Online crowdsourcing with the collaboration of citizen scientists could be a feasible solution. In this article, we assess the extent to which citizen scientists' annotations align with those gathered in the lab for recordings collected from young children. Method Segments identified by Language ENvironment Analysis as produced by the key child were extracted from one daylong recording for each of 20 participants: 10 low-risk control children and 10 children diagnosed with Angelman syndrome, a neurogenetic syndrome characterized by severe language impairments. Speech samples were annotated by trained annotators in the laboratory as well as by citizen scientists on Zooniverse. All annotators assigned one of five labels to each sample: Canonical, Noncanonical, Crying, Laughing, and Junk. This allowed the derivation of two child-level vocalization metrics: the Linguistic Proportion and the Canonical Proportion. Results At the segment level, Zooniverse classifications had moderate precision and recall. More importantly, the Linguistic Proportion and the Canonical Proportion derived from Zooniverse annotations were highly correlated with those derived from laboratory annotations. Conclusions Annotations obtained through a citizen science platform can help us overcome challenges posed by the process of annotating daylong speech recordings. Particularly when used in composites or derived metrics, such annotations can be used to investigate early markers of language delays.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Describing Vocalizations in Young Children: A Big Data Approach Through Citizen Science Annotation.

Abstract

Talk to us

Similar Papers

More From: Journal of Speech, Language, and Hearing Research

Lead the way for us

Journal: Journal of Speech, Language, and Hearing Research	Publication Date: Jun 7, 2021
Citations: 5

Similar Papers

The utility of LENA as an indicator of developmental outcomes for young children with autism.
Rhylee Sulek ... David Trembath
International journal of language & communication disorders | VOL. 57
Rhylee Sulek, et. al.Rhylee Sulek ... David Trembath
22 Oct 2021
International journal of language & communication disorders | VOL. 57

Large-bodied birds are over-represented in unstructured citizen science data
Corey T Callaghan ... Alistair G B Poore
Scientific Reports | VOL. 11
Corey T Callaghan, et. al.Corey T Callaghan ... Alistair G B Poore
24 Sep 2021
Scientific Reports | VOL. 11

Accuracy of the Language Environment Analysis (LENA) speech processing system in identifying communicative vocalizations of young children and adults
Laura Dilley ... Derek Houston
The Journal of the Acoustical Society of America | VOL. 150
Laura Dilley, et. al.Laura Dilley ... Derek Houston
01 Oct 2021
The Journal of the Acoustical Society of America | VOL. 150

An Investigation of Language Environment Analysis Measures for Spanish-English Bilingual Preschoolers From Migrant Low-Socioeconomic-Status Backgrounds.
Carla Wood ... Emily A Diehm
Language, Speech, and Hearing Services in Schools | VOL. 47
Carla Wood, et. al.Carla Wood ... Emily A Diehm
01 Apr 2016
Language, Speech, and Hearing Services in Schools | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Describing Vocalizations in Young Children: A Big Data Approach Through Citizen Science Annotation.

Abstract

Talk to us

Similar Papers

More From: Journal of Speech, Language, and Hearing Research