2022
DOI: 10.53730/ijhs.v6ns4.10404
|View full text |Cite
|
Sign up to set email alerts
|

novel voiceprint using ensembled Mel-Chromagram for speaker recognition

Abstract: This research paper proposes a novel voiceprint generation methodology for recognizing the speakers registered in a system. The proposed methodology is a keyword-dependent closed set speaker classification task. The features used are Mel-Spectrogram, Chromagram, MFCC and a new ensembled feature called Mel-Chroma. Mel-Chroma is generated with the combination of Mel-spectrogram and Chromagram. The Mel-Chroma spectrogram generated is converted into a binary image by using the average as the threshold. The recurre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…The Hamming window method [36], [37] was used in this study with overlapping frames [38], [39]. The window size was set to 0.050 msec, and the window step value was set to 0.025 msec.…”
Section: Feature Name Dimensionsmentioning
confidence: 99%
“…The Hamming window method [36], [37] was used in this study with overlapping frames [38], [39]. The window size was set to 0.050 msec, and the window step value was set to 0.025 msec.…”
Section: Feature Name Dimensionsmentioning
confidence: 99%
“…A large-scale audio classification method based on CNN is proposed in [6]. A novel methodology for audio classification is proposed in [7], which utilizes an audio finger approach. The methodology involves creating fingerprints by extracting the Mel-frequency cepstral coefficients (MFCC) spectrum and taking the average value of the spectrum.…”
Section: Introductionmentioning
confidence: 99%