Semantic Multimedia and Ontologies
DOI: 10.1007/978-1-84800-076-6_5
|View full text |Cite
|
Sign up to set email alerts
|

Audio Content Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0
1

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 38 publications
0
0
0
1
Order By: Relevance
“…Such metrics have often been augmented with temporal information, which was found to improve the robustness of content identification [17,18]. Common modeling of temporal dynamics also ranged from simple summary statistics such as onsets, attack time, velocity, acceleration and higher-order moments to more sophisticated statistical temporal modeling using Hidden Markov Models, Artificial Neural Networks, Adaptive Resonance Theory models, Liquid State Machine systems and Self-Organizing Maps [19,20]. Overall, the choice of features was very dependent on the task at hand, the complexity of the dataset, and the desired performance level and robustness of the system.…”
Section: Introductionmentioning
confidence: 99%
“…Such metrics have often been augmented with temporal information, which was found to improve the robustness of content identification [17,18]. Common modeling of temporal dynamics also ranged from simple summary statistics such as onsets, attack time, velocity, acceleration and higher-order moments to more sophisticated statistical temporal modeling using Hidden Markov Models, Artificial Neural Networks, Adaptive Resonance Theory models, Liquid State Machine systems and Self-Organizing Maps [19,20]. Overall, the choice of features was very dependent on the task at hand, the complexity of the dataset, and the desired performance level and robustness of the system.…”
Section: Introductionmentioning
confidence: 99%
“…Στη βιβλιογραφία των μεθόδων γνώσης, η κυρίαρχη αντιμετώπιση της ανάλυσης πολυμέσων είναι μέσω της κατάτμησης, που μπορεί να είναι χωρική (spatial segmentation) [58], χωρο-χρονική (spatio-temporal segmentation) [56] κατά ομιλητή στον ήχο (speaker segmentation) [98]…”
Section: τοποθέτηση και συμβολή της εργασίαςunclassified