2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6639262
|View full text |Cite
|
Sign up to set email alerts
|

Time-scale modifications based on a full-band adaptive harmonic model

Abstract: In this paper, a simple method for time-scale modifications of speech based on a recently suggested model for AM-FM decomposition of speech signals, is presented. This model is referred to as the adaptive Harmonic Model (aHM). A full-band speech analysis/synthesis system based on the aHM representation is built, without the necessity of separating a deterministic and/or a stochastic component from the speech signal. The aHM models speech as a sum of harmonically related sinusoids that can adapt to the local ch… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 15 publications
0
6
0
Order By: Relevance
“…Thus, any processing ofφ i,h is better conditioned than a processing of the raw instantaneous phase value φ i,h . This property explains the success of the simple processing techniques presented in [17,18]. In [16], the relative phases were then interpolated on a continuous time axis using splines, i.e.,φ i,h ⇒ φ h (t).…”
Section: X[n] ⇔ X(t))mentioning
confidence: 99%
See 1 more Smart Citation
“…Thus, any processing ofφ i,h is better conditioned than a processing of the raw instantaneous phase value φ i,h . This property explains the success of the simple processing techniques presented in [17,18]. In [16], the relative phases were then interpolated on a continuous time axis using splines, i.e.,φ i,h ⇒ φ h (t).…”
Section: X[n] ⇔ X(t))mentioning
confidence: 99%
“…These models have been widely used for speech analysis, resynthesis, and modification [12][13][14]. Sinusoidal models have evolved over the years [3,15], and recently, the so-called adaptive Harmonic Model (aHM) [16] has also been shown to yield practically transparent analysis/resynthesis and excellent modification performance [17,18]. Despite the inherent assumption that speech can be represented only by http://asmp.eurasipjournals.com/content/2014/1/38 harmonic sinusoidal components, even in unvoiced segments, aHM succeeds at capturing the relevant spectral information and noisy nature of a speech signal and thus, representing the speech signal in a uniform way, without using any voicing decision.…”
Section: Introductionmentioning
confidence: 99%
“…These models can be used for speech modeling [1], speech coding and synthesis [2], [3], voice transformation [4], speech enhancement [5] for hearing aids [6]. The parameters computed can be used to build higher-level representations [7] (e.g.…”
Section: Introductionmentioning
confidence: 99%
“…For pitch scaling, the estimation of a spectral magnitude envelope is necessary, but no phase envelope estimation is followed, thus significantly simplifying modification. Part of this work has been published in ICASSP 2013 [KDRS13] and in ICASSP 2014 [KDRS14].…”
Section: Audio Modeling Speech Transformationsmentioning
confidence: 99%
“…However, for the phase, the process is not straightforward because of its rotation due to the time advance across time instants. Therefore, it is proposed to remove this effect using the integral of kf 0 from the start of the signal, and obtain the relative phase -RP [DS13,KDRS13]. Thus, by assuming that the shape of the signal is changing smoothly, the phase values change also smoothly from one analysis time instant to the other.…”
Section: Time Scalingmentioning
confidence: 99%