Proceedings of the 18th ACM International Conference on Multimedia 2010
DOI: 10.1145/1873951.1874004
|View full text |Cite
|
Sign up to set email alerts
|

Combining multi-probe histogram and order-statistics based LSH for scalable audio content retrieval

Abstract: In order to improve the reliability and the scalability of content-based retrieval of variant audio tracks from large music databases, we suggest a new multi-stage LSH scheme that consists in (i) extracting compact but accurate representations from audio tracks by exploiting the LSH idea to summarize audio tracks, and (ii) adequately organizing the resulting representations in LSH tables, retaining almost the same accuracy as an exact kNN retrieval. In the first stage, we use major bins of successive chroma fe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 22 publications
(23 citation statements)
references
References 16 publications
0
23
0
Order By: Relevance
“…The MPH is a fixed-size representation of a chroma sequence which is determined by the dominant pitch classes transition between all adjacent frames of the sequence. A more detailed description can be found in [12]. Such histograms reflect the tonal structure of local portions of the audio and allow to model the evolution of the tonal context through time and capture slow varying harmonic patterns that relate to musical patterns.…”
Section: System Overviewmentioning
confidence: 99%
See 1 more Smart Citation
“…The MPH is a fixed-size representation of a chroma sequence which is determined by the dominant pitch classes transition between all adjacent frames of the sequence. A more detailed description can be found in [12]. Such histograms reflect the tonal structure of local portions of the audio and allow to model the evolution of the tonal context through time and capture slow varying harmonic patterns that relate to musical patterns.…”
Section: System Overviewmentioning
confidence: 99%
“…In [8] a contextual measure of similarity that considers sequences of feature frames instead of single frames in the similarity matrix computation allows to strengthen the visualization of repetitive patterns in the audio features. In [6], the evolution of the tonal context in the audio signal is described by concatenating mid-term chroma sequences in Multi-Probe Histograms [12].…”
Section: Introductionmentioning
confidence: 99%
“…(iv) Global summarization retaining harmonic progressions. A multi-probe histogram [5] is calculated from the chroma feature sequences by heuristically probing the transition between major bins of adjacent chroma features, which is able to retain local spectral and temporal information to some degree. It is more concise compared with local summarization, and is more accurate than conventional global summarization by retaining the temporal information of music signals.…”
Section: Related Workmentioning
confidence: 99%
“…for t = 2, 3, · · · , l do Forward iteration 5: for t = l − 1, l − 2, · · · , 1 do Reverse iteration 11: S t ← j∈y t+1 p t+1,j 12:…”
Section: B N-best Chord Progression Recognitionmentioning
confidence: 99%
See 1 more Smart Citation