Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-2000
|View full text |Cite
|
Sign up to set email alerts
|

AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children’s Speech

Abstract: Here we present AusKidTalk [1], an audio-visual (AV) corpus of Australian children's speech collected to facilitate the development of speech based technological solutions for children. It builds upon the technology and expertise developed through the collection of an earlier corpus of Australian adult speech, AusTalk [2,3]. This multi-site initiative was established to remedy the dire shortage of children's speech corpora in Australia and around the world that are sufficiently sized to train accurate automate… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 7 publications
0
4
0
Order By: Relevance
“…A brief description of the available datasets of children's audio-visual emotion speech is presented in Table 1 and in more detail below. AusKidTalk (Australian children's speech corpus) [33]-audio and video recordings of game exercises for 750 children aged three to twelve who speak Australian English. The study participants were 700 children with typical development and 50 children with speech disorders-25 children aged 6-12 years have a diagnosis of autism spectrum disorder.…”
Section: Children's Audio-visual Speech Emotion Corporamentioning
confidence: 99%
“…A brief description of the available datasets of children's audio-visual emotion speech is presented in Table 1 and in more detail below. AusKidTalk (Australian children's speech corpus) [33]-audio and video recordings of game exercises for 750 children aged three to twelve who speak Australian English. The study participants were 700 children with typical development and 50 children with speech disorders-25 children aged 6-12 years have a diagnosis of autism spectrum disorder.…”
Section: Children's Audio-visual Speech Emotion Corporamentioning
confidence: 99%
“…The age range of the children in those databases is restricted; several are non-English speaking. The diversity of English dialects throughout the world also makes it difficult to combine numerous younger children's speech corpora from various nations or backgrounds (34). Furthermore, all of them, including these three, utilized problemspecific protocols with restricted tasks, and none of them is properly annotated.…”
Section: Ecs Transactions 107 (1) 9053-9064 (2022)mentioning
confidence: 99%
“…The goal is to bring together all of the media information associated with the recording and processing of spoken voice, making the task of speech recognition researchers easier (22). The paucity of studies on automated speech processing tools for children may be due to the difficulty of collecting and analyzing kid speech, particularly that of younger children (34). Data can be collected in two ways; first, researchers can collect by itself; second, it can also hire some speech data collection agencies.…”
Section: Ecs Transactions 107 (1) 9053-9064 (2022)mentioning
confidence: 99%
“…For these reasons, it is important to gather and prepare good quality children's speech data to successfully train child-friendly speech-related AI models. However, there are additional challenges in the process of collecting child speech data [43], explaining the limited number of child-speech datasets available for research purposes.…”
mentioning
confidence: 99%