2021
DOI: 10.48550/arxiv.2107.00764
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition

Abstract: Commonly used automatic speech recognition (ASR) systems can be classified into frame-synchronous and labelsynchronous categories, based on whether the speech is decoded on a per-frame or per-label basis. Frame-synchronous systems, such as traditional hidden Markov model systems, can easily incorporate existing knowledge and can support streaming ASR applications. Label-synchronous systems, based on attentionbased encoder-decoder models, can jointly learn the acoustic and language information with a single mod… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 34 publications
0
1
0
Order By: Relevance
“…Given that both hybrid and E2E modeling have their own advantages, a natural question to ask is whether we can combine them to have the best of both worlds. Several publications explored this direction [17,18,19]. In these works, hybrid and E2E models are trained independently, and hypothesis level combination between the two systems is carried out by either rescoring the N-best and lattice from hybrid system with an E2E model [17,18], or Minimum Bayes' Risk (MBR) combination of the N-best lists of the two systems [19].…”
Section: Introductionmentioning
confidence: 99%
“…Given that both hybrid and E2E modeling have their own advantages, a natural question to ask is whether we can combine them to have the best of both worlds. Several publications explored this direction [17,18,19]. In these works, hybrid and E2E models are trained independently, and hypothesis level combination between the two systems is carried out by either rescoring the N-best and lattice from hybrid system with an E2E model [17,18], or Minimum Bayes' Risk (MBR) combination of the N-best lists of the two systems [19].…”
Section: Introductionmentioning
confidence: 99%