novel voiceprint using ensembled Mel-Chromagram for speaker recognition

Banuroopa, K.; Priyaa, D. Shanmuga

doi:10.53730/ijhs.v6ns4.10404

Cited by 2 publications

(2 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The Hamming window method [36], [37] was used in this study with overlapping frames [38], [39]. The window size was set to 0.050 msec, and the window step value was set to 0.025 msec.…”

Section: Feature Name Dimensionsmentioning

confidence: 99%

Method to Profiling the Characteristics of Indonesian Dangdut Songs, Using K-Means Clustering and Features Fusion

Mahardhika,

Warnars,

Nugroho

et al. 2023

IJCDS

View full text Add to dashboard Cite

There have been numerous studies that discuss profiling for various subjects, including criminal profiling, consumer profiling, and employee profiling, among others. However, song profiling is a relatively rare and underexplored area. In fact, profiling songs can provide us with new insights. Dangdut, one of the most popular musical genres in Indonesia, is a unique blend of musical rhythms from Arabic, Malay, Indian, and local music, and has the ability to captivate listeners and get them dancing and swaying along. In this study, we utilized feature selection techniques and feature fusion in conjunction with the K-Means clustering method to profile 281 Dangdut songs into two groups of clusters, with the best Silhouette score of 0.646. Additionally, we compared our method with non-Dangdut song data and obtained a Silhouette score of 0.549.

show abstract

“…The Hamming window method [36], [37] was used in this study with overlapping frames [38], [39]. The window size was set to 0.050 msec, and the window step value was set to 0.025 msec.…”

Section: Feature Name Dimensionsmentioning

confidence: 99%

Method to Profiling the Characteristics of Indonesian Dangdut Songs, Using K-Means Clustering and Features Fusion

Mahardhika,

Warnars,

Nugroho

et al. 2023

IJCDS

View full text Add to dashboard Cite

show abstract

“…A large-scale audio classification method based on CNN is proposed in [6]. A novel methodology for audio classification is proposed in [7], which utilizes an audio finger approach. The methodology involves creating fingerprints by extracting the Mel-frequency cepstral coefficients (MFCC) spectrum and taking the average value of the spectrum.…”

Section: Introductionmentioning

confidence: 99%

Audio classification based on audio WSOLA and CNN algorithm

Li,

Song,

2023

Second International Conference on Electronic Information Technology (EIT 2023)

View full text Add to dashboard Cite

This paper presents an audio classification algorithm based on the WSOLA and CNN techniques to address the problem of data imbalance in audio classification. Audio classification involves categorizing audio signals into different labels or categories, and is crucial in speech recognition, music classification, sound time detection, and sound quality evaluation. In this study, we propose a method that utilizes the WSOLA algorithm to enhance the audio data with fewer categories in the dataset, which can help to improve the accuracy and stability of classification when the dataset is unbalanced. This approach can also prevent the model from focusing too much on categories with large data volumes while neglecting other categories.By mitigating the issue of audio data imbalance, the model can better learn the characteristics of all categories, thereby enhancing the model's performance. We conducted experiments on the UrbanSound8K dataset, where we enhanced the audio data using the WSOLA method and then classified the audio using the CNN classifier. Our results indicate that the overall classification accuracy and stability were significantly improved, demonstrating that the proposed approach can reasonably classify the audio dataset using the WSOLA and CNN techniques.

show abstract

novel voiceprint using ensembled Mel-Chromagram for speaker recognition

Cited by 2 publications

References 15 publications

Method to Profiling the Characteristics of Indonesian Dangdut Songs, Using K-Means Clustering and Features Fusion

Method to Profiling the Characteristics of Indonesian Dangdut Songs, Using K-Means Clustering and Features Fusion

Audio classification based on audio WSOLA and CNN algorithm

Contact Info

Product

Resources

About