2021
DOI: 10.1007/978-981-15-9492-2_12
|View full text |Cite
|
Sign up to set email alerts
|

Spoken Language Identification of Indian Languages Using MFCC Features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 12 publications
0
4
0
Order By: Relevance
“…Then we gradually decrease the nodes to 64 and 32 to make it suitable for the output layer. The model is optimised by RMSprop (Root Mean Squared Propagation) using sparse categorical cross-entropy as a loss function, given by Equations ( 6) and (7). One of the reasons for using RMSprop as an optimizer is that it converges much faster to the local minima.…”
Section: Experiments and Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Then we gradually decrease the nodes to 64 and 32 to make it suitable for the output layer. The model is optimised by RMSprop (Root Mean Squared Propagation) using sparse categorical cross-entropy as a loss function, given by Equations ( 6) and (7). One of the reasons for using RMSprop as an optimizer is that it converges much faster to the local minima.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…Value of J ranges from 0 to 5 to determine number of categories in which an audio sample is to be classified. CCE = βˆ’ βˆ‘ 𝑑 𝑗 log(S(π‘₯) 𝑗 ) 5 𝑗=0 (7) In the above equation S(x)j defines Softmax probability for the j-th class and tj represents the truth label.…”
Section: S(π‘₯) 𝑖 =mentioning
confidence: 99%
See 1 more Smart Citation
“…MFCCs [2], SDC [3], i-Vector [4], Linear Discriminant Analysis [LDA] [5], and x-vector [6] are the primary conventional methods used for spoken language identification. A mel-spectrogram combined with a Convolutional Recurrent Neural Network [CRNN], X-vector combined with DNN and Wave2Vec speech representations was tested in [7] and it showed the highest F1-score of 91%.…”
Section: Literature Reviewmentioning
confidence: 99%