2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2012
DOI: 10.1109/icassp.2012.6288898
|View full text |Cite
|
Sign up to set email alerts
|

Improving acoustic based keyword spotting using LVCSR lattices

Abstract: This paper investigates detection of English keywords in a conversational scenario using a combination of acoustic and LVCSR based keyword spotting systems. Acoustic KWS systems search predefined words in parameterized spoken data. Corresponding confidences are represented by likelihood ratios given the keyword models and a background model. First, due to the especially high number of false-alarms, the acoustic KWS system is augmented with confidence measures estimated from corresponding LVCSR lattices. Then, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
4
3
3

Relationship

1
9

Authors

Journals

citations
Cited by 28 publications
(15 citation statements)
references
References 8 publications
0
15
0
Order By: Relevance
“…We used the conventional ASR decoder parameters to obtain word recognition lattices [14] (beam width of 13). The same type of lattices has been used previously for various tasks [22,23,24]. From these lattices, we obtain the senone posteriors, by fixing the acoustic scale parameter to 0.01, in order to obtain i-vectors that follow a Gaussian distribution.…”
Section: Asr Systemmentioning
confidence: 99%
“…We used the conventional ASR decoder parameters to obtain word recognition lattices [14] (beam width of 13). The same type of lattices has been used previously for various tasks [22,23,24]. From these lattices, we obtain the senone posteriors, by fixing the acoustic scale parameter to 0.01, in order to obtain i-vectors that follow a Gaussian distribution.…”
Section: Asr Systemmentioning
confidence: 99%
“…Pham et al [3] proposed the system and keyword dependent fusion method SKDWCombMNZ in 2014, which ourperformed other arithmetic-based methods. Discriminative system fusion methods employing classifiers have been explored in [3,4,5]. With large number of features from lattices and detection lists, discriminative fusion can often achieve inspiring performance.…”
Section: Introductionmentioning
confidence: 99%
“…In [2], confusion garbage model was developed to absorb similar pronunciation words confused with the specific keywords of a task. The combination of acoustic and LVCSR based keyword system was proposed as a method that the lattice generated from LVCSR was used to improve performance [3]. Even there are so many methods, the most widely used is the calculation and optimization of confidence measure.…”
Section: Introductionmentioning
confidence: 99%