2015
DOI: 10.1109/taslp.2015.2481179
|View full text |Cite
|
Sign up to set email alerts
|

A Framework for Speech Activity Detection Using Adaptive Auditory Receptive Fields

Abstract: One of the hallmarks of sound processing in the brain is the ability of the nervous system to adapt to changing behavioral demands and surrounding soundscapes. It can dynamically shift sensory and cognitive resources to focus on relevant sounds. Neurophysiological studies indicate that this ability is supported by adaptively retuning the shapes of cortical spectro-temporal receptive fields (STRFs) to enhance features of target sounds while suppressing those of task-irrelevant distractors. Because an important … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 97 publications
0
2
0
Order By: Relevance
“…Top-down task–based adaptations have been incorporated in attention systems by modelling the attentional gain as weights in the classifier to optimize performance based on specific task goals [77,78], or as a separate cognitive model deciding which speaker to attend to among competing sources [79]. A more holistic attention mechanism has instead used the goal-directed adaptation framework of physiological STRFs as a pre-processing stage to speech recognition, by enabling the separation of the target speech stream from the distractor soundscape it is embedded in [80]. The attentional filter provides significant gain to the target speech while being robust to previously unseen noise types.…”
Section: Applications Of Auditory Attention Modelsmentioning
confidence: 99%
“…Top-down task–based adaptations have been incorporated in attention systems by modelling the attentional gain as weights in the classifier to optimize performance based on specific task goals [77,78], or as a separate cognitive model deciding which speaker to attend to among competing sources [79]. A more holistic attention mechanism has instead used the goal-directed adaptation framework of physiological STRFs as a pre-processing stage to speech recognition, by enabling the separation of the target speech stream from the distractor soundscape it is embedded in [80]. The attentional filter provides significant gain to the target speech while being robust to previously unseen noise types.…”
Section: Applications Of Auditory Attention Modelsmentioning
confidence: 99%
“…Several neurophysiological studies have focused on understanding the ability of humans and animals to tune their cortical Spectro-Temporal Receptive Fields (STRFs) in order to selectively focus on target sounds, while minimizing the irrelevant acoustics and noise background [25]- [28]. Building on such studies, Carlin and Elhilali [25] trained a Gaussian Mixture Model on features obtained from both the initial and adapted STRFs. They showed that an ensemble of adapted STRFs achieves better performance in detecting speech, in the presence of noise.…”
Section: Related Workmentioning
confidence: 99%