ALICE: An open-source tool for automatic measurement of phoneme, syllable, and word counts from child-centered daylong recordings

Okko Räsänen,Shreyas Seshadri,Marisa Casillas,Alejandrina Cristia,Marvin Lavechin

doi:10.3758/s13428-020-01460-x

Abstract

Recordings captured by wearable microphones are a standard method for investigating young children’s language environments. A key measure to quantify from such data is the amount of speech present in children’s home environments. To this end, the LENA recorder and software—a popular system for measuring linguistic input—estimates the number of adult words that children may hear over the course of a recording. However, word count estimation is challenging to do in a language- independent manner; the relationship between observable acoustic patterns and language-specific lexical entities is far from uniform across human languages. In this paper, we ask whether some alternative linguistic units, namely phone(me)s or syllables, could be measured instead of, or in parallel with, words in order to achieve improved cross-linguistic applicability and comparability of an automated system for measuring child language input. We discuss the advantages and disadvantages of measuring different units from theoretical and technical points of view. We also investigate the practical applicability of measuring such units using a novel system called Automatic LInguistic unit Count Estimator (ALICE) together with audio from seven child-centered daylong audio corpora from diverse cultural and linguistic environments. We show that language-independent measurement of phoneme counts is somewhat more accurate than syllables or words, but all three are highly correlated with human annotations on the same data. We share an open-source implementation of ALICE for use by the language research community, enabling automatic phoneme, syllable, and word count estimation from child-centered audio recordings.

Highlights

The use oflong child-centered audio recordings from children’s natural environments is becoming one of the standard methods for studying child language acquisition of spoken languages
This work started by asking what type of linguistic units are the most meaningful measures of child language input, especially when developmental, linguistic, and technical considerations are taken into account
We presented a new open-source algorithm for linguistic unit count estimation called Automatic LInguistic unit Count Estimator (ALICE), and used it to test automatic estimation of all three candidate units with data from seven different corpora of child-centered daylong audio recordings

Summary

Introduction

The use of (day)long child-centered audio recordings from children’s natural environments is becoming one of the standard methods for studying child language acquisition of spoken languages. The existing standard solution to collecting and analyzing child-centered recordings is the widely adopted LENA system (Xu et al, 2008; Gilkerson & Richards, 2009). Behav Res (2021) 53:818–835 recording device and associated software for automatically analyzing a number of variables from the data, including estimation of the number of words spoken by adults in the vicinity of the child, in addition to detecting child vocalizations and conversational turns. Instead of using a single static model for word count estimation, WCE-R can be adapted to any target language of interest by using a few hours of orthographically transcribed child-centered audio. While the WCE-R adaptation procedure improves the cross-linguistic performance of the system, the requirement of transcribed domain-specific data greatly limits the practical usability of WCE-R as a standardized tool for developmental research, not least since its performance depends on the quality and representativeness of the adaptation data used for the domain of interest

Objectives

Methods

Results

Conclusion