2020
DOI: 10.1109/lsp.2019.2959917
|View full text |Cite
|
Sign up to set email alerts
|

Spectro-Temporal Attention-Based Voice Activity Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
15
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 29 publications
(17 citation statements)
references
References 18 publications
1
15
0
1
Order By: Relevance
“…To further improve the performance at low signal-to-noise ratios (SNRs), several studies have proposed the use of context information. For instance, Zhang et al [2] proposed aggregating predictions from different temporal contexts, and other approaches employed diverse forms of attention mechanisms [3], [4], [5].…”
Section: Background and Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…To further improve the performance at low signal-to-noise ratios (SNRs), several studies have proposed the use of context information. For instance, Zhang et al [2] proposed aggregating predictions from different temporal contexts, and other approaches employed diverse forms of attention mechanisms [3], [4], [5].…”
Section: Background and Related Workmentioning
confidence: 99%
“…As opposed to the previous cell-based NAS methods that use eight cells [21], [19], we use only four cells because general VAD models [32], [3], [4], [5] have a shallower depth for lower latency. In our four-cell structure, the first two cells are convolution cells, and the next two cells are attention cells.…”
Section: B Macro Architecturementioning
confidence: 99%
See 3 more Smart Citations