2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2014
DOI: 10.1109/icassp.2014.6855122
|View full text |Cite
|
Sign up to set email alerts
|

High-performance Query-by-Example Spoken Term Detection on the SWS 2013 evaluation

Abstract: In the last years, the task of Query-by-Example Spoken Term Detection (QbE-STD), which aims to find occurrences of a spoken query in a set of audio documents, has gained the interest of the research community for its versatility in settings where untranscribed, multilingual and acoustically unconstrained spoken resources, or spoken resources in low-resource languages, must be searched. This paper describes and reports experimental results for a QbE-STD system that achieved the best performance in the recent Sp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
78
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 67 publications
(80 citation statements)
references
References 15 publications
1
78
0
Order By: Relevance
“…For example, dimensionality reduction can be applied to reduce the stacked feature vector. We will also compare our method with the latest DTW QbyE systems, as described in [25,27].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, dimensionality reduction can be applied to reduce the stacked feature vector. We will also compare our method with the latest DTW QbyE systems, as described in [25,27].…”
Section: Discussionmentioning
confidence: 99%
“…In our experiments we ignore this effect and simply choose the first template randomly. Another option is to choose the longest template as the first one, as proposed in [25]. Table 1 lists keywords used in our experiments.…”
Section: Template Averagingmentioning
confidence: 99%
“…Nevertheless, exemplar-based speech processing faces two fundamental problems: (1) The growing size of the databases prohibits efficient search, and (2) The duration variation in speech pronunciation is effectively handled via dynamic time warping that is computationally expensive and sub-optimal due to dependency on the local reference exemplar. This paper addresses these limitations to foster exemplar based solutions for real time applications.…”
Section: State-of-the-art Solutions and Challengesmentioning
confidence: 99%
“…QbE-STD received serious consideration in the context of MediaEval spoken query search benchmarking campaign [1,2,3]. Recent exemplar based speech processing offers high flexibility in speech applications, partly attributed to the lack of complex statistical assumptions that facilitate exploiting "data deluge" with no prejudice on expected answers.…”
Section: State-of-the-art Solutions and Challengesmentioning
confidence: 99%
“…The DTW algorithm is a dynamic programming technique to compute the distance between two sequences of spectral vectors of arbitary length, and is commonly applied in query-by-example spoken term detection and other data mining tasks (Rodriguez-Fuentes et al, 2014;Keogh and Ratanamahatana, 2005). Being a non-parametric approach, it is well-suited for limited-or zero-resource tasks (Versteegh et al, 2015).…”
Section: Dtw Systemmentioning
confidence: 99%