2016
DOI: 10.1109/taslp.2016.2585864
|View full text |Cite
|
Sign up to set email alerts
|

Near and Far Field Speech-in-Noise Intelligibility Improvements Based on a Time–Frequency Energy Reallocation Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 13 publications
(11 citation statements)
references
References 55 publications
0
11
0
Order By: Relevance
“…H1 sharpened the formants, H2 flattened the spectral tilt, and H3 preemphasized the mid frequency range. A similar voicing index as in [11] was used to adapt the first two operations. The last two stages of MBSS have performed SNR-adaptive gain thresholding and fixed time smoothing.…”
Section: Multiband Spectral Shapingmentioning
confidence: 99%
See 2 more Smart Citations
“…H1 sharpened the formants, H2 flattened the spectral tilt, and H3 preemphasized the mid frequency range. A similar voicing index as in [11] was used to adapt the first two operations. The last two stages of MBSS have performed SNR-adaptive gain thresholding and fixed time smoothing.…”
Section: Multiband Spectral Shapingmentioning
confidence: 99%
“…The first stage had four steps. One was reconstructing the waveform from the spectral coefficients; another one was applying full wave rectification on the previous waveform, extracting 30-ms frames at every 30-ms, picking the maximum value in each frame, and re-sampling the sequence of maxima at the original frame rate of MBSSDRC; at a third stage, the level of the envelope e(m) computed previously was statically compressed using the strategy described in [11], yielding the gains g(m); finally, the dynamics of g(m) were enhanced to have a faster response to on-sets and a slower one to off-sets,…”
Section: Multiband Dynamic Range Compressionmentioning
confidence: 99%
See 1 more Smart Citation
“…The first algorithm (SSDRC) was based on the work of Zorilȃ et al [4,7] and used a two-stage energy reallocation strategy. During the first stage (spectral shaping -SS), the speechto-noise ratio (SNR) at medium and high frequencies was increased by transferring energy from below 500 Hz to higher frequencies.…”
Section: Intelligibility Enhancement Algorithms For Evaluationmentioning
confidence: 99%
“…This problem is known as near-end listening enhancement or speech intelligibility enhancement (SINE) and the constraint is called the equal-RMS (EQR) constraint. Typical solutions use modifications that are either based on previous intelligibility studies (e.g., simulating the differences between Lombard speech and conversational speech or emphasis of the information-bearing segments of speech) [3][4][5][6][7] or they optimize an objective measure that correlates well with measured intelligibility [8][9][10][11]. 'Inclusive' algorithms that would work for both normal-hearing and hearing-impaired listeners or in reverberant conditions are becoming increasingly popular [12,13].…”
Section: Introductionmentioning
confidence: 99%