2020
DOI: 10.31234/osf.io/p95dz
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ALICE: An open-source tool for automatic measurement of phoneme, syllable, and word counts from child-centered daylong recordings

Abstract: Recordings captured by wearable microphones are a standard method for investigating young children’s language environments. A key measure to quantify from such data is the amount of speech present in children’s home environments. To this end, the LENA recorder and software—a popular system for measuring linguistic input—estimates the number of adult words that children may hear over the course of a recording. However, word count estimation is challenging to do in a language-independent manner; the relationship… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
1

Relationship

3
1

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 41 publications
0
3
0
Order By: Relevance
“…Additionally, while our classifier is open-source, LENA software is not; thus, the ability to use this classifier on researchers' own data requires a substantial cost to purchase the LENA recorders and software. Future work should compare whether our classifiers can be used with open-source speech algorithms (e.g., ALICE; Räsänen et al, 2021) to achieve similar performance in sleep and tCDS/ODS classifiers.…”
Section: Limitationsmentioning
confidence: 99%
“…Additionally, while our classifier is open-source, LENA software is not; thus, the ability to use this classifier on researchers' own data requires a substantial cost to purchase the LENA recorders and software. Future work should compare whether our classifiers can be used with open-source speech algorithms (e.g., ALICE; Räsänen et al, 2021) to achieve similar performance in sleep and tCDS/ODS classifiers.…”
Section: Limitationsmentioning
confidence: 99%
“…The system interfaces well with extant annotation standards. Currently, ChildProject supports: LENA annotations in .its (Xu et al, 2008); ELAN annotations following the ACLEW DAS template (Casillas et al 2017, imported using Pympi: Lubbers and Torreira 2013; as well as rttm files outputted by ACLEW tools, namely the Voice Type Classifier (VTC) by , the Linguistic Unit Count Estimator (ALICE) by Räsänen et al (2020), and the VoCalisation Maturity Network (VCMNet) by Futaisi et al (2019). Users can also adapt routines for file types or conventions that vary.…”
Section: Converting and Indexing Annotationsmentioning
confidence: 99%
“…datasets 2020), may provide the ideal starting point for this, since it is cross-cultural and contains images that may allow the identification of nonverbal elements of the social interactions. ACLEW project members have illustrated the importance of coordinated data annotation for developing initial annotations , as well as the usefulness of collaborating with experts of speech technology and machine learning to develop tools that speed up annotation and generalize analyses from the hand-annotated fraction to the day-long scale (Al Futaisi et al, 2019;Le Franc et al, 2018;Räsänen et al, 2020).…”
Section: Roadmapmentioning
confidence: 99%