2021
DOI: 10.48550/arxiv.2109.08839
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification

Abstract: Recently, has been a successful and popular approach for speaker verification, which employs a time delay neural network (TDNN) and statistics pooling to extract speaker characterizing embedding from variable-length utterances. Improvement upon the x-vector has been an active research area, and enormous neural networks have been elaborately designed based on the x-vector, e.g., extended TDNN (E-TDNN) [2], factorized TDNN (F-TDNN) [3], and densely connected TDNN (D-TDNN) [4]. In this work, we try to identify th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 48 publications
0
1
0
Order By: Relevance
“…Auto-Vector [11] utilizes an evolutionary algorithm enhanced NAS method to discover a promising x-vector network. SpeechNAS [28] applies Bayesian optimization to conduct branch-wise and channel-wise selection in the search space of Densely connected TDNN. These works may suffer a weak correlation between the performance of the searched architectures and the ones trained from scratch [29].…”
Section: B Neural Architecture Searchmentioning
confidence: 99%
“…Auto-Vector [11] utilizes an evolutionary algorithm enhanced NAS method to discover a promising x-vector network. SpeechNAS [28] applies Bayesian optimization to conduct branch-wise and channel-wise selection in the search space of Densely connected TDNN. These works may suffer a weak correlation between the performance of the searched architectures and the ones trained from scratch [29].…”
Section: B Neural Architecture Searchmentioning
confidence: 99%