2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018
DOI: 10.1109/icassp.2018.8461486
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting Convolutional Neural Networks for Phonotactic Based Dialect Identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
3
3

Relationship

2
7

Authors

Journals

citations
Cited by 30 publications
(20 citation statements)
references
References 23 publications
0
19
0
1
Order By: Relevance
“…Khurana et al [5] 67 73 Shon et al [9] 69.97 75.00 Najafian et al [7] 59.72 73.27 Bulut et al [10] -79.76 Our approach 73.39 81.36 Table 9: Performance comparison to previous works on same MGB-3 dataset pared to previous studies. Table 9 shows summary performance of previous studies on same MGB-3 Arabic dialect dataset.…”
Section: Systems Accuracy(%) Single System Fusion Systemmentioning
confidence: 89%
See 1 more Smart Citation
“…Khurana et al [5] 67 73 Shon et al [9] 69.97 75.00 Najafian et al [7] 59.72 73.27 Bulut et al [10] -79.76 Our approach 73.39 81.36 Table 9: Performance comparison to previous works on same MGB-3 dataset pared to previous studies. Table 9 shows summary performance of previous studies on same MGB-3 Arabic dialect dataset.…”
Section: Systems Accuracy(%) Single System Fusion Systemmentioning
confidence: 89%
“…A DID benchmark of the task was attained using the i-vector framework using bottleneck features [4]. Using linguistic features such as words and characters results in similar performance to that obtained via acoustic features with a Convolutional Neural Network (CNN)based backend [5,6,7]. Since the linguistic feature space is different from the acoustic feature space, a fusion of the results from both feature representations has been shown to be beneficial [6,5,8].…”
Section: Introductionmentioning
confidence: 99%
“…DNN consisting of multiple interconnected layers between the input and output layers were used in speech recognition tasks (Najafian et al 2018). DNNs had not been used just as classifiers but they were also used as feature extractors (Ali et al 2015).…”
Section: Bottleneck Featuresmentioning
confidence: 99%
“…The fine-grained Arabic Dialect Identification (ADI) task involved dialect identification of speech from YouTube across 17 dialects. The previous MGB-3 challenge resulted in studies covering diverse dialect identification topics such as domain adaptation [4,5], semi-supervised learning [6,7,8,9], and linguistic feature extraction [10,11]. However MGB-3 was limited to 5 dialects.…”
Section: Introductionmentioning
confidence: 99%