2nd Annual Meeting of the ELRA/ISCA SIG on Under-Resourced Languages (SIGUL 2023) 2023
DOI: 10.21437/sigul.2023-7
|View full text |Cite
|
Sign up to set email alerts
|

From ‘Snippet-lects’ to Doculects and Dialects: Leveraging Neural Representations of Speech for Placing Audio Signals in a Language Landscape

Séverine Guillaume,
Guillaume Wisniewski,
Alexis Michaud

Abstract: XLSR-53, a multilingual model of speech, builds a vector representation from audio, which allows for a range of computational treatments. The experiments reported here use this neural representation to estimate the degree of closeness between audio files, ultimately aiming to extract relevant linguistic properties. We use max-pooling to aggregate the neural representations from a 'snippet-lect' (the speech in a 5-second audio snippet) to a 'doculect' (the speech in a given resource), then to dialects and langu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 20 publications
0
1
0
Order By: Relevance
“…Examining the speech representations learned by complex neural models, which underlie massive recent advances in speech technology, is a promising direction for phonetics (e.g. Millet et al, 2022;Shen et al, 2024;Guillaume et al, 2023).…”
Section: Human Versus Machine Comparisonmentioning
confidence: 99%
“…Examining the speech representations learned by complex neural models, which underlie massive recent advances in speech technology, is a promising direction for phonetics (e.g. Millet et al, 2022;Shen et al, 2024;Guillaume et al, 2023).…”
Section: Human Versus Machine Comparisonmentioning
confidence: 99%