2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7953170
|View full text |Cite
|
Sign up to set email alerts
|

Statistical normalisation of phase-based feature representation for robust speech recognition

Abstract: In earlier work we have proposed a source-filter decomposition of speech through phase-based processing. The decomposition leads to novel speech features that are extracted from the filter component of the phase spectrum. This paper analyses this spectrum and the proposed representation by evaluating statistical properties at various points along the parametrisation pipeline. We show that speech phase spectrum has a bell-shaped distribution which is in contrast to the uniform assumption that is usually made. I… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
2

Relationship

3
5

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 20 publications
0
6
0
Order By: Relevance
“…That is, the quality/intelligibility of the magnitude-only reconstructed signal is significantly less than the original one. This point has been verified by both subjective [38], [39] and objective [28], [40]- [43] tests. Such detrimental information loss which is important from both theoretical and perceptual perspectives, potentially harms the performance of the raw magnitude-based systems.…”
Section: Information Content Of the Ft's Componentsmentioning
confidence: 59%
“…That is, the quality/intelligibility of the magnitude-only reconstructed signal is significantly less than the original one. This point has been verified by both subjective [38], [39] and objective [28], [40]- [43] tests. Such detrimental information loss which is important from both theoretical and perceptual perspectives, potentially harms the performance of the raw magnitude-based systems.…”
Section: Information Content Of the Ft's Componentsmentioning
confidence: 59%
“…GFNet extracts frequency domain features by multiplying the real and imaginary parts with same parameters, which maintains all phase spectrum. However, the quality of the signal reconstructed using only the amplitude is noticeably lower than that of the original signal (Loweimi et al 2017). The phase spectrum is solely related to the real part and the imaginary part of each frequency.…”
Section: Fda Blockmentioning
confidence: 95%
“…In [19], the statistical properties of the phase spectrum and phase-based representations were scrutinised. Moreover, usefulness of various types of statistical normalisation methods (histogram equalisation, Gaussianisation and mean-variance normalisation) of the phase-based features at different stages along the pipeline were investigated on the A2 task.…”
Section: Phase Spectrum Applications In Asrmentioning
confidence: 99%