2020
DOI: 10.1007/s10772-020-09732-9
|View full text |Cite
|
Sign up to set email alerts
|

Multilingual spoken term detection: a review

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 41 publications
0
4
0
Order By: Relevance
“…𝑃 = (𝑁 𝑑𝑝 (𝑁 𝑑𝑝 + 𝑁 𝑓𝑝 ) ⁄ ) Γ— 100 (6) In the equations, Ntp, Nfp, Nfn, and Ntotal referred to the number of true positives, false positives, false negatives, and total samples in all the segments respectively. Furthermore, the performance was evaluated using area under curve (AUC) and detection error trade-off (DET) curve.…”
Section: B Evaluation Metricsmentioning
confidence: 99%
See 2 more Smart Citations
“…𝑃 = (𝑁 𝑑𝑝 (𝑁 𝑑𝑝 + 𝑁 𝑓𝑝 ) ⁄ ) Γ— 100 (6) In the equations, Ntp, Nfp, Nfn, and Ntotal referred to the number of true positives, false positives, false negatives, and total samples in all the segments respectively. Furthermore, the performance was evaluated using area under curve (AUC) and detection error trade-off (DET) curve.…”
Section: B Evaluation Metricsmentioning
confidence: 99%
“…One of the popular techniques was Spoken Term searching technique, which could be further divided into Spoken Term Detection (STD) and Keyword Spotting (KWS). Spoken Term Detection (STD), or spoken term discovery, is the identification of recurrent speech fragments from raw speech without prior knowledge of the language, i.e., automatic retrieval of speech from a database through specific audio keywords or queries [4][5][6].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, when applied to large collections with multilingual content, where the recordings cover a wide time span, such as found in OH collections, ASR proves to be a difficult task. A paradigm called transfer learning is a potential solution to training complex ASR models for under-resourced languages [25]. In this approach, a model trained on a well-resourced language serves as a basis.…”
Section: Automatic Speech Recognitionmentioning
confidence: 99%