2021
DOI: 10.1088/1742-6596/1722/1/012014
|View full text |Cite
|
Sign up to set email alerts
|

Implementation of audio recognition using mel frequency cepstrum coefficient and dynamic time warping in wirama praharsini

Abstract: Sekar Agung or wirama is a Balinese classic work which contains of moral values, and is usually sung during traditional or religious ceremonies. The classic nature of wirama made this art was abandoned by the younger generation who were less interested to learn or preserve it. Related to the problem, this study aimed at conducting sound matching as a medium to learn wirama praharsini based on the rule of guru and laghu. Wirama Praharsini was chosen because of the simple way of singing and pronunciation when it… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2025
2025

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 2 publications
0
3
0
Order By: Relevance
“…The main extraction algorithms are fast Fourier transform (FFT), Mel filter, logarithmic operation, and discrete cosine transform (DCT). MFCC feature parameters will be used as input to the speech recognition model [24,25]. The speech signal preprocessing is implemented by a firstorder FIR high-pass digital filter in the MATLAB system digital filter toolbox.…”
Section: Algorithm Designmentioning
confidence: 99%
“…The main extraction algorithms are fast Fourier transform (FFT), Mel filter, logarithmic operation, and discrete cosine transform (DCT). MFCC feature parameters will be used as input to the speech recognition model [24,25]. The speech signal preprocessing is implemented by a firstorder FIR high-pass digital filter in the MATLAB system digital filter toolbox.…”
Section: Algorithm Designmentioning
confidence: 99%
“…where ( ) denotes the target audio signal, w(n) represents the window function, and sgn is the sign function defined by Equation (11). When x(m) has the same sign as x(m-1), sgn ensures that their difference is zero.…”
Section: Zero Crossing Ratementioning
confidence: 99%
“…The feature vectors were extracted from the digital signals of the input speech in the format of MFCCs. (4,5) MFCCs were chosen because they are based on the perceptual characteristics of the human auditory system. (6,7) A block diagram of the MFCC feature extraction process is shown in Fig.…”
Section: Feature Extractionmentioning
confidence: 99%