2016 International Conference on Computational Intelligence and Cybernetics 2016
DOI: 10.1109/cyberneticscom.2016.7892562
|View full text |Cite
|
Sign up to set email alerts
|

Speech to text for Indonesian homophone phrase with Mel Frequency Cepstral Coefficient

Abstract: Abstract-In this study, speech to text system for homophone phrases in Indonesian was designed using an extraction method which featured Mel Frequency Cepstral Coefficient (MFCC). Feature extraction results were classified by comparing the two classifiers of Backpropagation Neural Network (BPNN) and KNearest Neigbour (KNN). The input data used were the recordings of each of 3 male and female respondents. The recording process was conducted for 5 seconds at a sampling frequency of 16 kHz and at channel mono. Cl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 2 publications
0
3
0
Order By: Relevance
“…• Contextual difference: homophone words such as "Son" and "Sun", "write" and "Right" are almost the same in English (Indian) pronunciation but are different from their meaning [17,58]. • Style variability: fluency of speaking style affects the information available in both time domain and frequency domain of speech signal [57,67].…”
Section: Challenges For the Development Of Speech Recognition Modelmentioning
confidence: 99%
“…• Contextual difference: homophone words such as "Son" and "Sun", "write" and "Right" are almost the same in English (Indian) pronunciation but are different from their meaning [17,58]. • Style variability: fluency of speaking style affects the information available in both time domain and frequency domain of speech signal [57,67].…”
Section: Challenges For the Development Of Speech Recognition Modelmentioning
confidence: 99%
“…Homonymous and homophone-ambiguous sentences are not a big deal when processed and used to communicate with other human beings, but sometimes it takes a little longer to understand homonymous and Homophone ambiguous sentences (Dalrymple-Alford, 1984;Eviatar, et al, 2023;Bustamin, et al, 2016). When a computer processes ambiguous sentences, there will inevitably be errors in meaning because the computer cannot understand the words that form an ambiguous sentence, Homonym and Homophone.…”
Section: Introductionmentioning
confidence: 99%
“…Based on that research, there is an increasing of elderly voice recognition process for up to 12% (Kwon et al, 2015). Speech recognition research for Indonesian language has been performed by Bustamin et al (2016;Areni et al, 2017b) using the Mel Frequency Cepstral Coefficient (MFCC) method for feature extraction on the word homophone. Cavus (2016) has done a research regarding to intelligent mobile application for learning English (pronunciation) by using voice recognition.…”
Section: Introductionmentioning
confidence: 99%