Recent trends in speech and audio codec standardization include scalability and extending the signal bandwidth beyond wideband (WB) to superwideband (SWB). In this paper we introduce a SWB extension for the ITU-T G.718 WB codec. In the SWB extension the high frequency content is generated utilizing the quantized MDCT domain coefficients of the WB core, which enables low additional delay. The proposed implementation is scalable with 4 kbps layers. In the first layer two different coding modes are used depending on the input signal type. The proposed SWB extension is evaluated with listening tests and complexity analysis.
The paper introduces a new coding methodology of the spectral modified discrete cosine transform (MDCT) coefficients of an audio signal. A lattice quantizer is used for each spectral sub-band, having the dimension equal to the size of the respective sub-band. The information that needs to be encoded consists of lattice codevector indexes, side information relative to the number the bits on which the indexes are represented and the integer exponents of the sub-band scaling factors. The nature of the side information, together with the parameterization of the quantization resolution allows the use of the method for a large range of bitrates e.g. for 44.1kHz sampled mono files, from 128 kbits/s down to 16 kbits/s. Subjective listening tests show similar performance of the proposed method to the advanced audio coding (AAC) codec for high bitrates (128 kbits down to 64 kbits/s) and clearly better performance for lower bitrates.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.