M. Farrugia scite author profile

The integration of fixed and wireless networks have expanded the range in which speech and audio coders were designed to operate. As a result, current research is focusing on a number of developments including algorithms which are able to adapt to different transmission environments and to operate under multiple constraints of bit rate, complexity, delay, robustness to bit errors and diversity of input signals. In this paper, we propose a single coding algorithm for compressing wideband speech and audio signals (0-8kHz) operating with scalable bit rates and with low delay. The algorithm is based on the backward-adaptive linear predictive coding (BA LPC) technique in conjunction with an efficient closedloop optimised excitation structure consisting of sparse pulses of ternary values. The output bit rates range from 17 to 68kb/s. The scalability feature is achieved by means of discrete quantisation layers representing various levels of enhancements of the base-line coder and also flexibility in terms of complexity and bit allocation requirements depending on the particular application and on the network resources. An evaluation of the performance of the coder operating at 17kb/s is carried out using the G.722 standard as a reference.

show abstract

Combined speech and audio coding with bit rate and bandwidth scalability

Farrugia¹

2001

View full text Add to dashboard Cite

The past two decades have witnessed a rapid expansion within the telecommunications industry. This growth has been primarily motivated by the proliferation of digital communication systems and services which have become easily available through wired and wireless systems. Current research trends involve the integration of speech, audio, video and d ata channels into true multimedia communications over fixed and mobile networks. However, while the available bandw idth in wired terrestrial networks is rela tively cheap and expandable, it becomes a limited resource in satellite and cellular-radio systems. In order to accommodate an ever growing number of users while maintaining high quality and low operational costs, it is necessary to maximise spectral efficiency. This has given rise to the development of high rate compression techniques w ith the ability to adapt to a broad class of input signals and to varying network resources.The research carried out in this thesis has mainly focused on the design of a single algorithm for compressing speech and audio signals sampled at different rates. The algorithms are based on the analysis-by-synthesis linear prediction coding (AbS-LPC) scheme, which has been widely employed in various speech coding standards. However, this bit rate reduction technique is based on the speech production mechanism and as such provides a rigid structure which presents a major lim itation for audio coding. In order to improve the audio quality at low rates and to compensate for the errors incurred by the linear prediction during segments of high transitions, the algorithms employ an efficient pulse excitation structure which represents the short innovation sequences with sparse unit magnitude pulses. The scheme proposed for the compression of telephone bandw idth speech and audio signals at 12kb/s achieves similar quality to the G.728 coder at 16kb/s and higher audio quality than the GSM-EFR standard at 12.2kb/s. W ideband speech and audio coding schemes have been designed using both the fullband approach at bit rates of 17 and 19kb/s and also the split band technique at a bit rate of 20kb/s. The perceptual quality is comparable to the G.722 coder operating at 48kb/s.The subband decomposition technique is also adapted to code speech and audio signals sampled at 32kHz. The quality of the coder at 28kb/s is similar to the quality achieved by the MP3 coder at 32kb/s. The algorithm also provides bandw idth and bit rate scalability ranging from 12 to 64kb/s, making it ideal for deployment in rate-adaptive communication systems. A ck n ow led gem en ts I would like to take this opportunity to express my gratitude to Prof. Ahmet Kondoz, my PhD project supervisor, whose guidance and suggestions throughout my research were most helpful and much appreciated. I am also indebted to all my colleagues in the M ultimedia Communications group, especially Stéphane Villette and Yong Duk Cho, whose suggestions have been invaluable. I would like thank my family, especially my grandmother, for the constant support,...

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

M. Farrugia

Pulse-excited LPC for wideband speech and audio coding

Provision of scalable streaming audio services over EGPRS

Lowdelay combined wideband speech and audio coding with scalable bit rates

Combined speech and audio coding with bit rate and bandwidth scalability

Contact Info

Product

Resources

About