2008
DOI: 10.1109/icassp.2008.4517543
|View full text |Cite
|
Sign up to set email alerts
|

Audio retrieval by latent perceptual indexing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
29
0

Year Published

2011
2011
2020
2020

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 26 publications
(29 citation statements)
references
References 4 publications
0
29
0
Order By: Relevance
“…For this reason, Section 2 addresses the uncovering of an ontology from the tags [14] in an unsupervised form, to investigate whether such an ontology is not an imposed construction. Because a latent structure has been assumed, we use a technique called vector-based semantic analysis, which is a generalization of Latent Semantic Analysis [15] and similar to the methods used in latent semantic mapping [16] and latent perceptual indexing [17]. Thus, although some of the terminology is borrowed from these areas, our method is also different in several crucial respects.…”
Section: Social Taggingmentioning
confidence: 99%
“…For this reason, Section 2 addresses the uncovering of an ontology from the tags [14] in an unsupervised form, to investigate whether such an ontology is not an imposed construction. Because a latent structure has been assumed, we use a technique called vector-based semantic analysis, which is a generalization of Latent Semantic Analysis [15] and similar to the methods used in latent semantic mapping [16] and latent perceptual indexing [17]. Thus, although some of the terminology is borrowed from these areas, our method is also different in several crucial respects.…”
Section: Social Taggingmentioning
confidence: 99%
“…Sparse codes are also known to be analogous to the coding mechanism in neural sensory system (see [97] and references therein). Interestingly, mathematical analogy between sparse representation of data and dimension reduction using matrix factorization (used in the bag-of-units representation by [93]) has also been observed in the context of speech recognition (the reader is referred to [102] for an overview). Starting from perceptual features, retrieval techniques using these representations can therefore be used to further render higher level sensory processes in the auditory system.…”
Section: A Semantic Audio Retrievalmentioning
confidence: 90%
“…By modeling audio as a collection of units, the approach is able to scale to (arbitrary) collection of audio clips. Notable examples include latent perceptual indexing by Sundaram et al [93], [94], the related anchor-space model by Lu et al [95] and Lee et al [96] and the bag-of-patterns representation used by Lyon et al [97]. These techniques formalize the method discussed in Slaney et al [98] where a form of unit-document co-occurrence is implicitly used for semantic information extraction from audio.…”
Section: A Semantic Audio Retrievalmentioning
confidence: 99%
See 1 more Smart Citation
“…Although audio analysis has been widely studied in scene classification [8,9,10], audio segmentation [11,12,13], and audio retrieval [14,15,16], to our knowledge, automatic audio tagging has not been much explored. Bertin-Mahieux et al [17] treated audio tag prediction as a set of binary classification problems and applied the Adaboost algorithm to the task.…”
Section: Introductionmentioning
confidence: 99%