Text, Speech and Dialogue
DOI: 10.1007/978-3-540-74628-7_46
|View full text |Cite
|
Sign up to set email alerts
|

Non-uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes

Abstract: Abstract. We describe novel speech/audio coding technique designed to operate at medium bit-rates. Unlike classical state-of-the-art coders that are based on short-term spectra, our approach uses relatively long temporal segments of audio signal in critical-band-sized sub-bands. We apply auto-regressive model to approximate Hilbert envelopes in frequency sub-bands. Residual signals (Hilbert carriers) are demodulated and thresholding functions are applied in spectral domain. The Hilbert envelopes and carriers a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 6 publications
0
4
0
Order By: Relevance
“…In the decoder, the sub-band residuals were reconstructed and modulated with corresponding FDLP envelope. Individual DCT contributions from each critical sub-band were summed and inverse DCT was applied to reconstruct output signal [25].…”
Section: Fdlp For Narrow-band Speech Codingmentioning
confidence: 99%
See 1 more Smart Citation
“…In the decoder, the sub-band residuals were reconstructed and modulated with corresponding FDLP envelope. Individual DCT contributions from each critical sub-band were summed and inverse DCT was applied to reconstruct output signal [25].…”
Section: Fdlp For Narrow-band Speech Codingmentioning
confidence: 99%
“…In DPQ, graphically shown in Figure 5, phase spectral components corresponding to relatively low-magnitude spectral components are transmitted with lower resolution, that is, the codebook vector selected from the magnitude codebook is processed by "adaptive thresholding" in the encoder as well as in the decoder [25]. The threshold determines the resolution of quantization levels in uniform SQ.…”
Section: Dynamic Phase Quantization (Dpq)mentioning
confidence: 99%
“…A new audio coding technique based on modeling the spectral dynamics has been proposed in [1], [2]. The input audio signal is first decomposed into frequency sub-bands using a Quadrature Mirror Filter (QMF) bank.…”
Section: Introductionmentioning
confidence: 99%
“…A new speech/audio coding technique based on modeling the temporal evolution of the spectral dynamics was proposed in [1,2]. The approach is based on representing Amplitude Modulating (AM) signal using Hilbert envelope estimate and Frequency Modulating (FM) signal using Hilbert carrier.…”
Section: Introductionmentioning
confidence: 99%