2021
DOI: 10.1007/s42979-021-00750-1
|View full text |Cite
|
Sign up to set email alerts
|

Improving Indian Spoken-Language Identification by Feature Selection in Duration Mismatch Framework

Abstract: Paper presents novel duration normalized feature selection technique and two-step modified hierarchical classifier to improve the accuracy of spoken language identification (SLID) using Indian languages for duration mismatched condition. Feature selection averages random forest-based importance vectors of open SMILE features of different duration utterances. Although it improves the SLID system's accuracy for mismatched training and testing durations, the performance is significantly reduced for short-duration… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 30 publications
0
4
0
Order By: Relevance
“…Aarti et al in [2] provided an organized review about the diferent linguistic cues according to the major language families in India. Further, in the literature, hierarchical LID systems with a front-end language family recognizer are proposed for improved LID performance [11,76,160]. Bengali, Hindi, Punjabi, Tamil, Urdu…”
Section: Overview Of Indian Language Recognition and Challenges 31 Br...mentioning
confidence: 99%
See 3 more Smart Citations
“…Aarti et al in [2] provided an organized review about the diferent linguistic cues according to the major language families in India. Further, in the literature, hierarchical LID systems with a front-end language family recognizer are proposed for improved LID performance [11,76,160]. Bengali, Hindi, Punjabi, Tamil, Urdu…”
Section: Overview Of Indian Language Recognition and Challenges 31 Br...mentioning
confidence: 99%
“…This corpus contains speech data of some rarely studied Indian languages, such as Angami, Bodo, Khasi, Hrangkhawl, and Sumi. Some of the other sources for availing standard Indian speech data are Speehocean, 11 Indic-TTS, 12 13 There are also developments in open-source corpora, such as Mozilla Common Voice, 14 OpenSLR, 15 with speech data for the Indian languages. In Table 3, we have summarized the key information about the major speech corpora developed for Indian spoken language recognition research.…”
Section: Other Developmentsmentioning
confidence: 99%
See 2 more Smart Citations