2010
DOI: 10.1007/s00530-010-0212-y
|View full text |Cite
|
Sign up to set email alerts
|

M-MUSICS: an intelligent mobile music retrieval system

Abstract: Accurate voice humming transcription and efficient indexing and retrieval schemes are essential to a large-scale humming-based audio retrieval system. Although much research has been done to develop such schemes, their performance in terms of precision, recall, and F-measure, among all similarity metrics, are still unsatisfactory. In this paper, we propose a new voice query transcription scheme. It considers the following features: note onset detection using dynamic threshold methods, fundamental frequency (F0… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2013
2013
2017
2017

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 21 publications
0
2
0
Order By: Relevance
“…Another possible future direction is to combine data hiding [watermarking (Liu et al 2015 ; Liu et al 2016 ), image and video steganography (Mstafa and Elleithy 2015 ; Muhammad et al 2015 ; Lin et al 2015 )] with the video summarization frameworks by embedding the patient and gynecologists data in DH videos/keyframes, resulting in secure and privacy-preserving VS framework as presented in ( Muhammad et al 2015 ) for secure visual contents retrieval from personalized repositories and other mobile healthcare applications (Lv et al 2016 ). Furthermore, we are also planning to explore deep learning and incorporate GPUs based processing (Mei and Tian 2016 ; Mei 2014 ) for efficient keyframes extraction, their indexing and retrieval (Rho et al 2008 ; Rho et al 2011 ; Rho and Hwang 2006 ).…”
Section: Discussionmentioning
confidence: 99%
“…Another possible future direction is to combine data hiding [watermarking (Liu et al 2015 ; Liu et al 2016 ), image and video steganography (Mstafa and Elleithy 2015 ; Muhammad et al 2015 ; Lin et al 2015 )] with the video summarization frameworks by embedding the patient and gynecologists data in DH videos/keyframes, resulting in secure and privacy-preserving VS framework as presented in ( Muhammad et al 2015 ) for secure visual contents retrieval from personalized repositories and other mobile healthcare applications (Lv et al 2016 ). Furthermore, we are also planning to explore deep learning and incorporate GPUs based processing (Mei and Tian 2016 ; Mei 2014 ) for efficient keyframes extraction, their indexing and retrieval (Rho et al 2008 ; Rho et al 2011 ; Rho and Hwang 2006 ).…”
Section: Discussionmentioning
confidence: 99%
“…Finally, a different kind of IR, related to query by humming, is reported by Rho et al [154]. The target of the paper is music retrieval, where human voice is used to produce a short clip of singing, whistling or humming to give a rough approximation of the music requested.…”
Section: Spoken Inputmentioning
confidence: 99%