2010
DOI: 10.1007/978-3-642-17641-8_18
|View full text |Cite
|
Sign up to set email alerts
|

A Survey on Automatic Speaker Recognition Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 31 publications
(11 citation statements)
references
References 17 publications
0
11
0
Order By: Relevance
“…Audio analysis has been traditionally focused on the recognition of speech [1,3], speaker identification [6,10,26,27] and scene categorization [4,22]. Recently, research in the area of intelligent surveillance systems shifted its attention to the automatic detection of abnormal or dangerous events through the analysis of audio streams acquired by microphones.…”
Section: Introductionmentioning
confidence: 99%
“…Audio analysis has been traditionally focused on the recognition of speech [1,3], speaker identification [6,10,26,27] and scene categorization [4,22]. Recently, research in the area of intelligent surveillance systems shifted its attention to the automatic detection of abnormal or dangerous events through the analysis of audio streams acquired by microphones.…”
Section: Introductionmentioning
confidence: 99%
“…where x n is the original speech signal, w n is the Hamming window function, and k corresponds to the frequency. A power spectrogram can be converted into a log-power representation using (2).…”
Section: B Experimental Setup 1) Selected Features (Spectrograms)mentioning
confidence: 99%
“…Identifying a person by their voice is an important human trait that is typically taken for granted in natural human-tohuman interactions/communications. Human speech, which is a performance biometric, is different from other kinds of biometrics (such as hand geometry or fingerprints) [1] in that voice biometrics are the only commercial biometric product that process acoustic information [2]. A speaker's identity is represented in the way they speak, but not necessarily in the words being spoken [1].…”
Section: Introductionmentioning
confidence: 99%
“…The specific voice features refer to various analysis such as amplitude spectrum, localization of spectral peaks related to the vocal tract shape or pitch striations related to the user's glottal source. Similar to keystroke analysis, the speaker recognition can be performed either in static (text-dependent) or dynamic mode (text-independent) mode [29]. In the text-dependent mode the use is asked by the biometric system to pronounce a particular phrase, while, in the case of the text-independent mode the user is free to speak any phrase.…”
Section: Speaker Recognitionmentioning
confidence: 99%