Modern networks are highly variable and, as a result, source coders are commonly used under conditions that they were not designed for. We address this problem with a source-coding philosophy that aims at the instantaneous re-optimization of a source coder to match a wide range of constraints on rate or quality and a wide range of packet-loss rates. We present a number of technologies that can be reconfigured by solving analytic relations that use the current conditions and a statistical description of the source as input. The technologies include distribution-preserving quantizers, flexible multiple-description quantizers, and a rate distribution scheme. Based on the generic technologies, we created a complete audio coder. Formal listening tests show that the resulting audio coding scheme with full flexibility provides a quality that is on-par with the best standardized codecs for any particular rate.Index Terms-audio coding, source modeling, quantization
INTRODUCTIONThe telecommunication infrastructure is increasingly heterogeneous and time-variant. This paper describes a coding philosophy that aims to facilitate coder redesign in real-time. This means that coders that are based on this philosophy are always matched to the network and to quality constraints. The resulting coders are always optimal for the rate constraint and packet-loss conditions at hand. They can replace a range of existing standardized coders that have been designed and optimized for particular operating conditions. Typically, conceptually different approaches are used at different ranges of bit-rates. The best coders aiming at very low bit rates are parametric. Audio coders that are based on the CELP architecture (AMR-WB, G.729.1) provide the state-of-the-art performance at low to intermediate bit rates. State-of-the-art coders that operate at higher bit rates typically are transform-based (G.722.1, AAC) or use a hybrid CELP-transform architecture (AMR-WB+).In contrast to the state-of-the-art solutions, our coding approach is consistent. We propose a coding philosophy that relies on signal modeling and a well-defined distortion measure. This leads to technologies that can be optimized in real-time, and together these technologies yield an adaptive audio coder. The optimal configuration of the coder is found by solving sets of closed-form analytical expressions. In contrast to existing audio coders, our approach is optimal over a continuum of bit-rates.The paper is organized as follows. We first explain the philosophy of scalable coding in Section 2. We present a set of technologies in Section 3. In Section 4, we describe an exemplary implementation of the proposed coding scheme and Section 5 provides both objective performance results and the outcome of the formal listening test of a resulting coder. Conclusions are provided in Section 6.