2021
DOI: 10.48550/arxiv.2107.04677
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Noisy Training Improves E2E ASR for the Edge

Abstract: Automatic speech recognition (ASR) has become increasingly ubiquitous on modern edge devices. Past work developed streaming End-to-End (E2E) all-neural speech recognizers that can run compactly on edge devices. However, E2E ASR models are prone to overfitting and have difficulties in generalizing to unseen testing data. Various techniques have been proposed to regularize the training of ASR models, including layer normalization, dropout, spectrum data augmentation and speed distortions in the inputs. In this w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 28 publications
(30 reference statements)
0
1
0
Order By: Relevance
“…Using machine learning to enable multiple applications on edge devices requires multiple task-specific persistent models (Yang et al, 2020). These models are used for tasks ranging from computer vision (Howard et al, 2019) to automatic speech recognition (Wang et al, 2021). The trend towards multiple applications and multiple models is constrained by the fact that off-chip memory reads incur high latency and power costs (Sze et al, 2017).…”
Section: Introductionmentioning
confidence: 99%
“…Using machine learning to enable multiple applications on edge devices requires multiple task-specific persistent models (Yang et al, 2020). These models are used for tasks ranging from computer vision (Howard et al, 2019) to automatic speech recognition (Wang et al, 2021). The trend towards multiple applications and multiple models is constrained by the fact that off-chip memory reads incur high latency and power costs (Sze et al, 2017).…”
Section: Introductionmentioning
confidence: 99%