2019
DOI: 10.48550/arxiv.1911.02086
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Small-Footprint Keyword Spotting on Raw Audio Data with Sinc-Convolutions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
19
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(19 citation statements)
references
References 0 publications
0
19
0
Order By: Relevance
“…Since pre-processed data like MFCC features won't be always available, few CNN architectures have been developed to work on raw audio data as input. One of the notable ones is the SCN architecture proposed by Mittermaier et al [11], which uses SincNet [14] and DS convolutions [5] to achieve comparable accuracy to the state-of-the-art TC-ResNet models.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Since pre-processed data like MFCC features won't be always available, few CNN architectures have been developed to work on raw audio data as input. One of the notable ones is the SCN architecture proposed by Mittermaier et al [11], which uses SincNet [14] and DS convolutions [5] to achieve comparable accuracy to the state-of-the-art TC-ResNet models.…”
Section: Related Workmentioning
confidence: 99%
“…2 shows the respective architectures. Architectures of TC-ResNet8 and SCN adopted from [4] and [11] respectively.…”
Section: Model Architecturesmentioning
confidence: 99%
See 3 more Smart Citations