2012
DOI: 10.1007/978-3-642-31980-8_12
|View full text |Cite
|
Sign up to set email alerts
|

Auditory Time-Frequency Masking: Psychoacoustical Data and Application to Audio Representations

Abstract: In this paper, the results of psychoacoustical experiments on auditory time-frequency (TF) masking using stimuli (masker and target) with maximal concentration in the TF plane are presented. The target was shifted either along the time axis, the frequency axis, or both relative to the masker. The results show that a simple superposition of spectral and temporal masking functions does not provide an accurate representation of the measured TF masking function. This confirms the inaccuracy of simple models of TF … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
1
1

Relationship

3
2

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 26 publications
0
7
0
Order By: Relevance
“…Such modification is ubiquitous in signal processing, e.g. in denoising [54], signal detection [55], time-stretching and pitch-shifting [56,57,58,59], modification of the spectrogram [60,61], irrelevance filtering [62,63], speech recognition [64], to name a few. Furthermore, jointly optimizing support and smoothness of the synthesis window can provide block-processing algorithms with reduced delay and blocking artifacts, possibly even at lower redundancy than usual.…”
Section: Related Workmentioning
confidence: 99%
“…Such modification is ubiquitous in signal processing, e.g. in denoising [54], signal detection [55], time-stretching and pitch-shifting [56,57,58,59], modification of the spectrogram [60,61], irrelevance filtering [62,63], speech recognition [64], to name a few. Furthermore, jointly optimizing support and smoothness of the synthesis window can provide block-processing algorithms with reduced delay and blocking artifacts, possibly even at lower redundancy than usual.…”
Section: Related Workmentioning
confidence: 99%
“…For more results and discussion on the origins of masking the interested reader is referred to e.g. [32,62,64].…”
Section: 1mentioning
confidence: 99%
“…To best predict masking in the time-frequency decompositions of sounds, it seems intuitive to have data on the time-frequency spread of masking for such elementary atoms, as this will provide a good match between the masking model and the sound decomposition. This has been investigated in [64]. Precisely, spectral, forward, and time-frequency masking have been measured using Gabor atoms of the form s i (t) = sin(2πξ i t + π/4)e −π(Γt) 2 with Γ = 600 s −1 as masker and target.…”
Section: Temporal Maskingmentioning
confidence: 99%
See 1 more Smart Citation
“…To further reduce the redundancy of the AUDlet representation and improve its perceptual relevance, future work includes introducing perceptual sparsity in the transform domain. Specifically, based on the perceptual irrelevance filter proposed in [5] and recent data on auditory TF masking [34], a binary mask will be computed and applied to the sub-channel coefficients in order to re-synthesize only the audible TF components. Furthermore, future work will focus on how to combine the AUDlet FB and knowledge of TF masking to possibly improve audio codecs.…”
Section: Summary and Concluding Remarksmentioning
confidence: 99%