2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2013
DOI: 10.1109/apsipa.2013.6694252
|View full text |Cite
|
Sign up to set email alerts
|

Towards a more efficient sparse coding based audio-word feature extraction system

Abstract: This paper is concerned with the efficiency of sparse coding based audio-word feature extraction system. In particular, we have defined and added the concept of early and late temporal pooling to the classic sparse coding based audio-word feature extraction pipeline, and we have tested them on the genre tags subset of the CAL10k data set. We define temporal pooling as any functions that are able to transforms the input time series representation into a more temporally compact representation. Under this definit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2014
2014
2015
2015

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 27 publications
0
3
0
Order By: Relevance
“…The values for these metrics all fall within [0,1], and larger values indicate better performance. For each tag, we rank the test clips in descending order of the decision values computed by SVM and calculate the above measures according to the ranking [16]. We select only one exemplar for each frame in (2) with , and use the voting-based method in (3), such that .…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The values for these metrics all fall within [0,1], and larger values indicate better performance. For each tag, we rank the test clips in descending order of the decision values computed by SVM and calculate the above measures according to the ranking [16]. We select only one exemplar for each frame in (2) with , and use the voting-based method in (3), such that .…”
Section: Methodsmentioning
confidence: 99%
“…to solve for ) [23], [46], and temporal pooling methods (i.e. to get ) [16], [47] have been proposed and compared. The focus of this letter, however, is to investigate efficient ways of exploiting unlabeled exemplars themselves as the dictionary atoms, a topic that is seldom addressed before.…”
Section: A Clip-level Lasso Screeningmentioning
confidence: 99%
See 1 more Smart Citation