2020
DOI: 10.48550/arxiv.2008.02323
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…Conventional KWS system has been developed maturely, including large vocabulary continuous speech recognition (LVCSR) based lattice search [1,2,3], hidden Markov model (HMM) based keyword-filler method [4,5,6], discriminative models based on large-margin formulation or recurrent networks and query-by-example (QbyE) based template matching approaches [7,8,9,10,11]. Recently, with the development of deep learning and its successful applications, deep KWS frameworks have been introduced [12,13,14,15,16,17,18]. In the deep KWS family, an acoustic model is trained to predict the sub-word of keyword and a posterior handling method is followed to generate a confidence score of the whole keyword.…”
Section: Introductionmentioning
confidence: 99%
“…Conventional KWS system has been developed maturely, including large vocabulary continuous speech recognition (LVCSR) based lattice search [1,2,3], hidden Markov model (HMM) based keyword-filler method [4,5,6], discriminative models based on large-margin formulation or recurrent networks and query-by-example (QbyE) based template matching approaches [7,8,9,10,11]. Recently, with the development of deep learning and its successful applications, deep KWS frameworks have been introduced [12,13,14,15,16,17,18]. In the deep KWS family, an acoustic model is trained to predict the sub-word of keyword and a posterior handling method is followed to generate a confidence score of the whole keyword.…”
Section: Introductionmentioning
confidence: 99%