2019
DOI: 10.3390/computers8010005
|View full text |Cite
|
Sign up to set email alerts
|

Robust Cochlear-Model-Based Speech Recognition

Abstract: Accurate speech recognition can provide a natural interface for human–computer interaction. Recognition rates of the modern speech recognition systems are highly dependent on background noise levels and a choice of acoustic feature extraction method can have a significant impact on system performance. This paper presents a robust speech recognition system based on a front-end motivated by human cochlear processing of audio signals. In the proposed front-end, cochlear behavior is first emulated by the filtering… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
6
0
9

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(17 citation statements)
references
References 39 publications
2
6
0
9
Order By: Relevance
“…Some models are biologically detailed and accurately capture particular response properties of the auditory nerve (6,(11)(12)(13)(14)(15)(16), while others are abstracted approximations of the signal transformation in the auditory periphery (17)(18)(19). Some have been used to provide inputs for models of auditory neurons (17-21), to generate perceptual models (22), and in machine processing of sounds (2,23). However, few attempts (24) have been made to determine which cochlear models best describe the computational impact of the auditory periphery on neural responses in mammalian auditory cortex, although more progress has been made in the avian auditory system (25).…”
mentioning
confidence: 99%
“…Some models are biologically detailed and accurately capture particular response properties of the auditory nerve (6,(11)(12)(13)(14)(15)(16), while others are abstracted approximations of the signal transformation in the auditory periphery (17)(18)(19). Some have been used to provide inputs for models of auditory neurons (17-21), to generate perceptual models (22), and in machine processing of sounds (2,23). However, few attempts (24) have been made to determine which cochlear models best describe the computational impact of the auditory periphery on neural responses in mammalian auditory cortex, although more progress has been made in the avian auditory system (25).…”
mentioning
confidence: 99%
“…Tapis gammatone dilaporkan memberi perkiraan yang baik dari tapis auditori manusia untuk urutan tapis 3, 4, dan 5 [28]. Berbeda dengan spectrogram, tapis gammatone memiliki bandwidth tidak konstan di seluruh kanal frekuensi, yaitu Equivalent Rectangular Bandwidth (ERB) [29], [30]. Secara umum, ERB dari tapis gammatone direpresentasikan dengan (2).…”
Section: A Fitur Cochleagramunclassified
“…Dalam eksperimen ini, digunakan implementasi tapis gammatone dari Slaney [30], [31], dengan jumlah 64 standar tapis berjarak 50 Hz hingga 8 kHz (sinyal wicara pada 16 kHz).…”
Section: A Fitur Cochleagramunclassified
“…Similarly, the digital filterbank resembles the processing of basilar membrane in auditory modeling. Each bandpass filter is simulated the frequency characteristics of the basilar membrane [14]. The human hearing is the most sensitive to frequencies between 2000-5000Hz and less sensitive in high frequency region.…”
Section: B Gammatone Filterbank Analysismentioning
confidence: 99%