2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7953233
|View full text |Cite
|
Sign up to set email alerts
|

On the information rate of speech communication

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 12 publications
(16 citation statements)
references
References 19 publications
0
16
0
Order By: Relevance
“…Numerous applications of IB exist in domains such as clustering [ 7 , 8 ], coding theory and quantization [ 9 , 10 , 11 , 12 ], speech and image recognition [ 13 , 14 , 15 , 16 , 17 ], and cognitive science [ 18 ]. Several recent papers have also drawn connections between IB and supervised learning, in particular, classification using neural networks [ 19 , 20 ].…”
Section: Introductionmentioning
confidence: 99%
“…Numerous applications of IB exist in domains such as clustering [ 7 , 8 ], coding theory and quantization [ 9 , 10 , 11 , 12 ], speech and image recognition [ 13 , 14 , 15 , 16 , 17 ], and cognitive science [ 18 ]. Several recent papers have also drawn connections between IB and supervised learning, in particular, classification using neural networks [ 19 , 20 ].…”
Section: Introductionmentioning
confidence: 99%
“…The redundancy in rate of existing speech coders can be determined from estimates of the information rate in speech. A recent rate estimate [5] based on comparing signals with the same message is consistent with lexical information rates computed from phoneme statistics [6]. They suggest that the true information rate is less than 100 b/s.…”
Section: Introductionmentioning
confidence: 57%
“…The representation of speech used in this paper is based on a crude model of the human auditory system and was motivated using information theoretic arguments in [21] and [27]. Let {x i } be a real-valued random process that represents the samples of an acoustic speech signal where i is the sample index and let {x t } be the short-time Fourier transform (STFT) of {x i } where t is the frame index.…”
Section: A the Communication Channelmentioning
confidence: 99%
“…(4) To estimate (3), realisations of M t and Y t are needed. Estimating a realisation of M t requires a chorus of speech signals (see [27]). In typical applications of intelligibility prediction, such a chorus is not available, so instead we use an upper bound on (3).…”
Section: B Information Rate Of the Communication Channelmentioning
confidence: 99%
See 1 more Smart Citation