“…The publicly available Google Speech Commands Dataset [153], [154] has become the de facto open benchmark for [30] for further details). Multiple recent deep KWS works have employed either the first version [16], [30], [32], [43], [48]- [52], [57], [58], [67], [69], [70], [86], [90], [100], [125] or the second version [32], [47], [48], [53], [70], [82], [89], [90], [99], [100], [109], [128]- [130], [159], [175] of the Google Speech Commands Dataset. Despite how valuable this open reference is for KWS research and development, we can raise two relevant points of criticism:…”