Towards efficient and scalable speech compression schemes for robust speech recognition applications

Srinivasamurthy, Naveen; Ortega, Antonio; Zhu, Qifeng; Alwan, Abeer

doi:10.1109/icme.2000.869589

Cited by 7 publications

(4 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…6 and 7 is that the bitrate required to ensure that speech recognition performance is not degraded due to compression is about 4600 b/s for the spoken names task and 5700 b/s for the CSR task. Additionally, it has been shown that the approximate minimum bitrate for transparent operation for an isolated digits task was 1100 b/s (Srinivasamurthy et al, 2001a) and for a connected digits task was 2000 b/s (Srinivasamurthy et al, 2001b). This illustrates that the minimum bitrate for transparent speech recognition is strongly task dependent.…”

Section: Continuous Speech Recognitionmentioning

confidence: 98%

“…Due to this overlap and the underlying correlation in the speech (because of the slow movement of articulators), it is reasonable to expect that MFCC vectors from adjacent frames will exhibit high correlation. To achieve good compression efficiency this correlation has been exploited using linear prediction (Ramaswamy and Gopalakrishnan, 1998;Srinivasamurthy et al, 2000), where a given MFCC in a frame was predicted from the corresponding MFCC in the previous frame. 2 The prediction error e i ¼ u i À aû iÀ1 was quantized using uniform scalar quantization (USQ), where u i is the current sample andû iÀ1 is the reconstruction of the previous sample generated by the coarse prediction loop.…”

Section: Scalable Encodingmentioning

confidence: 99%

“…The effect of various speech coding techniques on speech recognition, including GSM (Digalakis et al, 1999;Srinivasamurthy et al, 2000;Kiss, 2000;Lilly and Paliwal, 1996;Srinivasamurthy et al, 2001b), G.723.1, G.727, G.728, G.729 (Turunen and Vlaj, 2001) (Lilly and Paliwal, 1996) and MELP (Srinivasamurthy et al, 2000;Srinivasamurthy et al, 2001b), has been previously evaluated by a number of researchers. In all cases, it was shown that speech coding significantly degrades speech recognition performance.…”

Section: Introductionmentioning

confidence: 96%

See 2 more Smart Citations

Efficient scalable encoding for distributed speech recognition

Srinivasamurthy

Ortega

Narayanan

2006

Speech Communication

View full text Add to dashboard Cite

Section: Continuous Speech Recognitionmentioning

confidence: 98%

Section: Scalable Encodingmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 96%

See 1 more Smart Citation

Efficient scalable encoding for distributed speech recognition

Srinivasamurthy

Ortega

Narayanan

2006

Speech Communication

View full text Add to dashboard Cite

“…Another approach consists in providing each processing task with the most relevant information in order to maximize its classification accuracy. One potential solution to this problem has been addressed in [26] in the context of scalable speech recognition. The authors considered two sequential speech recognition systems with very different resource requirements.…”

Section: Discussionmentioning

confidence: 99%

Signal Processing Challenges in Distributed Stream Processing Systems

Frossard

Verscheure

Venkatramani

2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings

View full text Add to dashboard Cite

Distributed stream processing represents a novel computing paradigm where data, sensed externally and possibly preprocessed, is pushed asynchronously to various connected computing devices with heterogeneous capabilities for processing. It enables novel applications typically characterized by the need to process high-volume data streams in a timely and responsive fashion. Some example applications include sensor networks, location-tracking services, distributed speech recognition, and network management. Recent work in large-scale distributed stream processing tackle various research challenges in both the application domain as well as in the underlying system. The main focus of this paper is to highlight some of the signal processing challenges such a novel computing framework brings. We first briefly introduce the main concepts behind distributed stream processing. Then we define the notion of relevant information from two related information-theoretic approaches. Finally, we browse existing techniques for sensing and quantizing the information given the set of classification, detection and estimation tasks, which we refer to as task-driven signal processing. We also address some of the related unexplored research challenges.

show abstract

An efficient and scalable 2D DCT-based feature coding scheme for remote speech recognition

Zhu¹,

Alwan

2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)

View full text Add to dashboard Cite

Towards efficient and scalable speech compression schemes for robust speech recognition applications

Cited by 7 publications

References 5 publications

Efficient scalable encoding for distributed speech recognition

Efficient scalable encoding for distributed speech recognition

Signal Processing Challenges in Distributed Stream Processing Systems

An efficient and scalable 2D DCT-based feature coding scheme for remote speech recognition

Contact Info

Product

Resources

About