2015
DOI: 10.1007/s10772-014-9267-z
|View full text |Cite
|
Sign up to set email alerts
|

Database development and automatic speech recognition of isolated Pashto spoken digits using MFCC and K-NN

Abstract: Automatic recognition of isolated spoken digits is one of the most challenging tasks in the area of Automatic Speech Recognition. In this paper, Database Development and Automatic Speech Recognition of Isolated Pashto Spoken Digits from Sefer (0) to Naha (9) has been presented. A number of 50 individual Pashto native speakers (25 male and 25 female) of different ages, ranging from 18 to 60 years, were involved to utter from Sefer (0) to Naha (9) digits separately. Sony PCM-M 10 linear recorder is used for reco… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
26
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 26 publications
(28 citation statements)
references
References 5 publications
(7 reference statements)
2
26
0
Order By: Relevance
“…K-nearest neighbor (KNN) (or Lazy learning) algorithm is a popular algorithm used for both classification and regression purposes (Hu et al, 2016 ). However, it is mostly used for classification problems (Ali et al, 2015 ; Zuo et al, 2015 ; Khan et al, 2017 ). KNN is an instance-based and non-parametric learning algorithm and can be considered a simple machine learning algorithm (Donaldson, 1967 ; Qin et al, 2013 ).…”
Section: Methodsmentioning
confidence: 99%
“…K-nearest neighbor (KNN) (or Lazy learning) algorithm is a popular algorithm used for both classification and regression purposes (Hu et al, 2016 ). However, it is mostly used for classification problems (Ali et al, 2015 ; Zuo et al, 2015 ; Khan et al, 2017 ). KNN is an instance-based and non-parametric learning algorithm and can be considered a simple machine learning algorithm (Donaldson, 1967 ; Qin et al, 2013 ).…”
Section: Methodsmentioning
confidence: 99%
“…K-nearest neighbor (KNN) is the most popular classification algorithm also called Lazy learning algorithm [62,63]. This algorithm is mostly applied in data mining, image processing, speech recognition, statistics, machine learning and bioinformatics [62,64,65].…”
Section: K-nearest Neighbor (Knn)mentioning
confidence: 99%
“…This algorithm is mostly applied in data mining, image processing, speech recognition, statistics, machine learning and bioinformatics [62,64,65]. KNN has achieved substantial performance compared to many other learning algorithms [42].…”
Section: K-nearest Neighbor (Knn)mentioning
confidence: 99%
“…Working on ASR has started about half century ago for English language but development of ASR for local languages is a new trend, emerged in the last few decades which opens the door for layman to interact with computer in friendly manner. Thus, native speakers [4], can communicate with computer through speech in their own regional languages. The development of an ASR for local languages is quite a challenging task due to the lack of resources such as corpus with enough vocabulary, dialectical variation and so on.…”
Section: Introductionmentioning
confidence: 99%
“…Many works have been done for local languages such as in Punjabi (spoken in Pakistan and India) [5,6,7], Gujrati (local language of India) [8], Urdu (national language of Pakistan and forth most widely spoken language in the world) [9,10,11,12,13], Hindi (an official language of India) [1,14,15,16], Marathi (spoken in India) [17], Arabic (an official language of Arab and fifth widely used language in the world) [18,19], Bengali (spoken language of Bangladesh) [20]. The current work also includes Pashto language but very little works has been done in the development of Pashto speech recognition system [4]. Fortunately, Pashto share some characteristics with other languages like Arabic, Urdu and Persian languages.…”
Section: Introductionmentioning
confidence: 99%