“…Conventional KWS system has been developed maturely, including large vocabulary continuous speech recognition (LVCSR) based lattice search [1,2,3], hidden Markov model (HMM) based keyword-filler method [4,5,6], discriminative models based on large-margin formulation or recurrent networks and query-by-example (QbyE) based template matching approaches [7,8,9,10,11]. Recently, with the development of deep learning and its successful applications, deep KWS frameworks have been introduced [12,13,14,15,16,17,18]. In the deep KWS family, an acoustic model is trained to predict the sub-word of keyword and a posterior handling method is followed to generate a confidence score of the whole keyword.…”