2018
DOI: 10.12913/22998624/87028
|View full text |Cite
|
Sign up to set email alerts
|

Advanced Time-Frequency Representation in Voice Signal Analysis

Abstract: The most commonly used time-frequency representation of the analysis in voice signal is spectrogram. This representation belongs in general to Cohen's class, the class of time-frequency energy distributions. From the standpoint of properties of the resolution, spectrogram representation is not optimal. In Cohen class representations are known which have a better resolution properties. All of them are created by smoothing the Wigner-Ville'a distribution characterized by the best resolution, however, the biggest… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 15 publications
(9 citation statements)
references
References 35 publications
0
9
0
Order By: Relevance
“…where STFT mix is the m × n complex matrix of t-f containing in m-rows instantaneous signal spectra (m is the number of STFT time frames). The input data for ICA is a spectrogram (autospectrum) of the signal TFD mix = STFT mix 2 [43,44]. The rows of the TFD mix matrix are treated as individual channels in a multichannel signal.…”
Section: Model Definition and Proceduresmentioning
confidence: 99%
“…where STFT mix is the m × n complex matrix of t-f containing in m-rows instantaneous signal spectra (m is the number of STFT time frames). The input data for ICA is a spectrogram (autospectrum) of the signal TFD mix = STFT mix 2 [43,44]. The rows of the TFD mix matrix are treated as individual channels in a multichannel signal.…”
Section: Model Definition and Proceduresmentioning
confidence: 99%
“…In most systems, overlapping of frames is used to smooth transition from frame to frame. Each time frame is then windowed with a Hamming window to eliminate discontinuities at the edges [17]. The filter coefficients w(n) of a Hamming window of length n are computed according to:…”
Section: Mel Frequency Cepstral Coefficients (Mfcc)mentioning
confidence: 99%
“…As technology progresses, speech recognition will be embedded in more devices used in everyday activities, where environmental variables perform a major part, such as mobile phone voice recognition applications [10], cars [11], integrated access control and information systems [12], emotion identification systems [13], application monitoring [14], disabled assistance [15], and intelligent technology. In addition to voice, many acoustic applications are also essential in diverse engineering issues [16][17][18][19][20][21][22]. A noise decrease method could be deployed to enhance efficiency in real-world noisy settings [23][24][25][26].…”
Section: Introductionmentioning
confidence: 99%
“…Time-frequency representation (TFR), broadly applied to detecting several conditions [ 14 , 15 , 16 , 17 , 18 ], has been recently used to detect pathological changes in voice signals [ 19 ]. TFR enables the evolution of the periodicity and frequency components to be observed over time, allowing the analysis of non-stationary signals, such as voice signals [ 20 ].…”
Section: Introductionmentioning
confidence: 99%