2020
DOI: 10.35746/jtim.v2i2.99
|View full text |Cite
|
Sign up to set email alerts
|

Metode Wavelet-MFCC dan Korelasi dalam Pengenalan Suara Digit

Abstract: Voice is the sound emitted from living things. With the development of Automatic Speech Recognition (ASR) technology, voice can be used to make it easier for humans to do something. In the ASR extraction process the features have an important role in the recognition process. The feature extraction methods that are commonly applied to ASR are MFCC and Wavelet. Each of them has advantages and disadvantages. Therefore, this study will combine the wavelet feature extraction method and MFCC to maximize the existing… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 6 publications
0
2
0
Order By: Relevance
“…Automatic speech recognition (ASR) is a technology that enables interaction between humans and computers through voice (Dyarbirru and Hidayat, 2020). Google voice search applies an example of ASR technology that converts voice into text to perform everyday commands on mobile devices.…”
Section: Automatic Speech Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…Automatic speech recognition (ASR) is a technology that enables interaction between humans and computers through voice (Dyarbirru and Hidayat, 2020). Google voice search applies an example of ASR technology that converts voice into text to perform everyday commands on mobile devices.…”
Section: Automatic Speech Recognitionmentioning
confidence: 99%
“…Common Voice is a multilingual dataset of transcribed, community-based, Creative Commons Zero (CC0) licensed audio talks built by Mozilla (Handoko and Suyanto, 2019). The Common Voice Indonesian dataset consists of 54 unique voices with a total of 5 hours of speech and 4 hours of validation (Dyarbirru and Hidayat, 2020). The data obtained is the result of crowdsourcing, for several languages the Mozilla Deep Speech and Common Voice models produce an average CER improvement of 5.99 ± 5.48 (Tachbelie et al, 2022).…”
Section: Audio and Language Models In Mozilla Deep Speechmentioning
confidence: 99%