First International Symposium on Control, Communications and Signal Processing, 2004. 2004
DOI: 10.1109/isccsp.2004.1296517
|View full text |Cite
|
Sign up to set email alerts
|

AMADEUS: a scalable HMM-based audio information retrieval system

Abstract: The new transmission and storage technologies now available have put together a vast amount of digital audio. All this audio is ready and easy to transfer but it might be useless with a clear knowledge of its content as metadata attached to it. This knowledge can be manually added but this is not feasible for millions of on-line files. In this paper we present a method to automatically derive acoustic information about audio files and a technology to classify and retrieve audio examples.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2006
2006
2010
2010

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 8 publications
(2 reference statements)
0
5
0
Order By: Relevance
“…energy-based features, or auto correlation values). In [1,2], for example, MFCCs are used for HMMbased song identification whereas in [9] statistical features describing pitch, rhythm etc. are applied for mixture-density based genre classification.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…energy-based features, or auto correlation values). In [1,2], for example, MFCCs are used for HMMbased song identification whereas in [9] statistical features describing pitch, rhythm etc. are applied for mixture-density based genre classification.…”
Section: Related Workmentioning
confidence: 99%
“…In [1,2] a system for song identification in radio broadcasts has been presented. The authors use an MFCC-based feature representation of music for un-supervised training of HMMs for modeling generic acoustic generators (GAGs).…”
Section: Related Workmentioning
confidence: 99%
“…Many content-based audio information retrieval (AIR) systems have been proposed to date [3][4][5][6][7]. They typically extract a set of acoustic features such as pitch, duration and rhythm from audio and transcribe them into some symbolic representation for efficient audio retrieval.…”
Section: Introductionmentioning
confidence: 99%
“…All the probabilities derived can be seen as a distance between two songs and, therefore, a metric (with some restrictions) can be defined with acoustical similarities as explained in [2]. …”
Section: Rationalementioning
confidence: 99%
“…All the training process is based on a modification of the Baum-Welch algorithm (also based o n the EM algorithm [4]) and is explained in further detail in [2].…”
Section: Training Of the Systemmentioning
confidence: 99%