2019 International Conference on Information Technology (ICIT) 2019
DOI: 10.1109/icit48102.2019.00013
|View full text |Cite
|
Sign up to set email alerts
|

Spoken Language Recognition Using CNN

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…They obtained an accuracy of 76 per cent and 73 per cent using SVM and decision tree classifiers, respectively, on 5 hours of training data. Mukherjee et al (2019) used CNNs for LID in German, Spanish, and English. They used filter banks to extract features from frequency domain representations of the signal.…”
Section: Related Workmentioning
confidence: 99%
“…They obtained an accuracy of 76 per cent and 73 per cent using SVM and decision tree classifiers, respectively, on 5 hours of training data. Mukherjee et al (2019) used CNNs for LID in German, Spanish, and English. They used filter banks to extract features from frequency domain representations of the signal.…”
Section: Related Workmentioning
confidence: 99%
“…Many studies on language identification have been conducted, with various feature extraction and classification techniques being used. Several techniques are used to extract features from the audio data, including phone recognition followed by language modeling (PRLM) [5] and parallel phone recognition followed by language modeling (PPRLM) [5] for phonetic approach or perceptual linear prediction (PLP) [5], mel-frequency cepstral coefficient (MFCC) [6]- [8], i-vector [8], [9] and x-vector [10] for the acoustic approx neural networks [11], convolutional neural networks (CNN) [12], [13], logistic regression (LR) [8], PLDA [14], Gaussian mixture model (GMM) [15], [16], support vector machine [17], [18] are among techniques used to classify the language spoken.…”
Section: Introductionmentioning
confidence: 99%
“…There are several steps to identify language, starting from cleaning the data from noise to help the system get better accuracy, extracting the feature from speech data, and classifying the language. There are several techniques to classify the language spoken, including neural networks [2], convolutional neural networks [3]- [6], logistic regression [7], PLDA [8], gaussian mixture model [9], [10], support vector machine [11], [12], and several techniques to extract the features from the recording, such as MFCC [13], [14], ivector [15]- [17], and x-vector [18].…”
Section: Introductionmentioning
confidence: 99%