1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings
DOI: 10.1109/icassp.1996.548008
|View full text |Cite
|
Sign up to set email alerts
|

Application of loudness/pitch/timbre decomposition operators to auditory scene analysis

Abstract: We proposed[1] nonlinear operators which decompose a changing energy of sound in wavelet domain into three orthogonal components: i.e., loudness and pitch as coherent changes, and timbre as incoherent change. We showed that they could detect the discontinuity of a single sound stream with excellent temporal resolution and sensitivity. In this paper, we extend the coherency principle so that it can describe and pursue the individual coherency of non-overlapping sound streams in wavelet domain. It is realized by… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(8 citation statements)
references
References 7 publications
0
8
0
Order By: Relevance
“…Figure 9 shows the continuous traces of the stream parameters based on Eq. (26). The initial value is the maximum point which occurred first significantly.…”
Section: Synthesized Sound With Two Streamsmentioning
confidence: 98%
“…Figure 9 shows the continuous traces of the stream parameters based on Eq. (26). The initial value is the maximum point which occurred first significantly.…”
Section: Synthesized Sound With Two Streamsmentioning
confidence: 98%
“…6. Graphical representation of the Gaussian pdf of a scalar (continuous) random variable (left) and a two-dimensional Gaussian random variable (right).…”
Section: =1mentioning
confidence: 99%
“…The mathematical object p(x|^) admits two interpretations: First, when read as a pdf of x for a given 6, p(x|0) is called the conditional pdf of x^ conditioned on 6. Second, when seen as a function of 6 for given x, p(x|0) is called the parameter likelihood function and it is defined over the space of all possible values of 9 denoted O.…”
Section: Likelihood Functionsmentioning
confidence: 99%
See 2 more Smart Citations