2020
DOI: 10.48550/arxiv.2012.10018
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

NeurST: Neural Speech Translation Toolkit

Abstract: NeurST is an open-source toolkit for neural speech translation developed by Bytedance AI Lab. The toolkit mainly focuses on end-toend speech translation, which is easy to use, modify, and extend to advanced speech translation research and products. NeurST aims at facilitating the speech translation research for NLP researchers and provides a complete setup for speech translation benchmarks, including feature extraction, data preprocessing, distributed training, and evaluation. Moreover, The toolkit implements … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…NeurST (Zhao et al, 2020) 22.9 33.3 Fairseq S2T (Wang et al, 2020a) 22.7 32.9 ESPnet-ST (Inaguma et al, 2020) 22.9 32.7 Dual-Decoder (Le et al, 2020) 23.6 33.5…”
Section: Extending To Must-c Language Pairsmentioning
confidence: 99%
“…NeurST (Zhao et al, 2020) 22.9 33.3 Fairseq S2T (Wang et al, 2020a) 22.7 32.9 ESPnet-ST (Inaguma et al, 2020) 22.9 32.7 Dual-Decoder (Le et al, 2020) 23.6 33.5…”
Section: Extending To Must-c Language Pairsmentioning
confidence: 99%
“…These models are simpler in structure and they are more suitable for operating in streaming fashion. Most End-to-End speech translation systems are variants of encoderdecoder architecture with attention models (Bahdanau et al, 2015;Di Gangi et al, 2019;Zhao et al, 2020). This category includes the popular Transformer models, which have been adapted for training End-to-End ST in (Di Gangi et al, 2019).…”
Section: Introductionmentioning
confidence: 99%