2021
DOI: 10.48550/arxiv.2109.14420
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition

Abstract: formance than the cascaded re-scoring and correction pipeline and can serve as a unified postprocessing module for ASR.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 23 publications
0
2
0
Order By: Relevance
“…Team affecting the effect of simultaneous translation in real scenes. Some work has attempted to improve the pipeline systems by introducing an ASR error correction model (Leng et al, 2021;Zhang et al, 2021a), others proposed pre-training approaches to alleviate the data scarcity problem of speech translation corpora in end-to-end systems Pino et al, 2020;Zheng et al, 2021;Li et al, 2020b;. We hope to see more participants in future workshops investigating how to close the performance gap between the two tracks.…”
Section: Rankmentioning
confidence: 97%
“…Team affecting the effect of simultaneous translation in real scenes. Some work has attempted to improve the pipeline systems by introducing an ASR error correction model (Leng et al, 2021;Zhang et al, 2021a), others proposed pre-training approaches to alleviate the data scarcity problem of speech translation corpora in end-to-end systems Pino et al, 2020;Zheng et al, 2021;Li et al, 2020b;. We hope to see more participants in future workshops investigating how to close the performance gap between the two tracks.…”
Section: Rankmentioning
confidence: 97%
“…To this end, researchers suggested text-to-text error correction methods specifically designed to correct transcribed errors (Mani et al, 2020;Leng et al, 2021a;Hrinchuk et al, 2020). These transformer-based methods typically requires a massive and diverse training data (i.e., transcribed audio segments and their corresponding humanauthored transcriptions) which is rarely at reach.…”
Section: Introductionmentioning
confidence: 99%