“…Regarding the first issue, the aim is to improve the accuracy of multi-variant audio track detection and the different proposals rely on pitch [9,10], Mel-Frequency Cepstral Coefficients (MFCC) [11,12,13] or Chroma [2,14]. With regard to the latter research issue, the goal is to accelerate the retrieval by similarity and the existing proposals include tree structures [7,15,16], other hierarchical structures [17], LSH [4,11,18], Exact Euclidean LSH (E 2 LSH) [3,4] and other variants of LSH [6,11]. It is however clear that the two research issues are not independent, since more accurate detection requires more elaborate representations of audio content, with a negative impact on scalability.…”