The Speaker and Language Recognition Workshop (Odyssey 2018) 2018
DOI: 10.21437/odyssey.2018-14
|View full text |Cite
|
Sign up to set email alerts
|

Convolutional Neural Network and Language Embeddings for End-to-End Dialect Recognition

Abstract: Dialect identification (DID) is a special case of general language identification (LID), but a more challenging problem due to the linguistic similarity between dialects. In this paper, we propose an end-to-end DID system and a Siamese neural network to extract language embeddings. We use both acoustic and linguistic features for the DID task on the Arabic dialectal speech dataset: Multi-Genre Broadcast 3 (MGB-3). The endto-end DID system was trained using three kinds of acoustic features: Mel-Frequency Cepstr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
62
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 54 publications
(62 citation statements)
references
References 25 publications
0
62
0
Order By: Relevance
“…Many works are oriented on Arabic identification (Ali et al, 2016;El Haj et al, 2017;Shon et al, 2018;Tachicart et al, 2017). These works were respectively presented by Ali et., El Haj et al, Shon et al and Tachicart et al They dealt with the Multi dialectal identification as well as the MSA one where the last one proposes an identification system distinguishing between the Moroccan dialect and MSA.…”
Section: Identification and Recognitionmentioning
confidence: 99%
See 3 more Smart Citations
“…Many works are oriented on Arabic identification (Ali et al, 2016;El Haj et al, 2017;Shon et al, 2018;Tachicart et al, 2017). These works were respectively presented by Ali et., El Haj et al, Shon et al and Tachicart et al They dealt with the Multi dialectal identification as well as the MSA one where the last one proposes an identification system distinguishing between the Moroccan dialect and MSA.…”
Section: Identification and Recognitionmentioning
confidence: 99%
“…More recently Ali (2018) proposed a character-level convolution neural network model for distinguishing between MSA and multi dialects. The authors proposed a CNN model including five layers Shon et al (2018) proposed an end-to-end Dialect identification system and a Siamese neural network to extract language embeddings. The authors used acoustic and linguistic features on the Arabic dialectal speech dataset.…”
Section: Building Resourcesmentioning
confidence: 99%
See 2 more Smart Citations
“…In this work, we adopt the end-to-end dialect identification system proposed in [17]. This system has a stack of convolutional neural network (CNN) layers, followed by a global pooling layer that aggregates frame level representations to produce utterance level representations.…”
Section: End-to-end Dialect Identification Systemmentioning
confidence: 99%