2020
DOI: 10.1109/taslp.2020.2984089
|View full text |Cite
|
Sign up to set email alerts
|

Improved External Speaker-Robust Keyword Spotting for Hearing Assistive Devices

Abstract: For certain applications, keyword spotting (KWS) requires some degree of personalization. This is the case for KWS for hearing assistive devices, e.g., hearing aids, where only the device user should be allowed to trigger the KWS system. In this paper, we first develop a new realistic hearing aid experimental framework. Next, using this framework we show that the performance of a state-of-the-art multi-task deep learning architecture exploiting cepstral features for joint KWS and users' own-voice/external spea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
9
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 17 publications
(9 citation statements)
references
References 38 publications
0
9
0
Order By: Relevance
“…Personalized devices, such as hearing assistive devices, require the ability to detect external speakers and prevent them from triggering the device. [22] developed a multi-tasking keyword spotting model with the ability to detect non-users.…”
Section: Multi-tasking Models Attained By Joint Trainingmentioning
confidence: 99%
See 2 more Smart Citations
“…Personalized devices, such as hearing assistive devices, require the ability to detect external speakers and prevent them from triggering the device. [22] developed a multi-tasking keyword spotting model with the ability to detect non-users.…”
Section: Multi-tasking Models Attained By Joint Trainingmentioning
confidence: 99%
“…Joint training of speech and speaker recognition as a unified multi-tasking model usually has the following one or both benefits over training two independent models. First, the two tasks can share the data processing or feature learning pipeline to some extent [26,22,28,18,17]. Second, each can benefit from the improved performance of the other [28,18].…”
Section: Multi-tasking Models Attained By Joint Trainingmentioning
confidence: 99%
See 1 more Smart Citation
“…In this mode, the input segment x is often assigned to the class with highest posterior probability. The non-streaming deep KWS systems generated sharply peaked posterior distributions in [53][54]. One possible explanation for this phenomenon is that the non-streaming systems handle isolated, welldefined class realizations, and not the inter-class transition information, as opposed to the streaming system case.…”
Section: Introductionmentioning
confidence: 97%
“…Wake word spotting (WWS) can be considered as a specific case of keyword spotting (KWS), concerning the identification of predefined wake word(s) in utterances, often used for the wake-up of speech-enabled devices, such as "Hey Siri" in iPhone, "Alexa" in Amazon Echo, and "Ok Google" in Google Home [1,2,3,4], etc. In order to activate the interactions between devices and users, a standby wake word detection module is particularly important [5].…”
Section: Introductionmentioning
confidence: 99%