ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053213
|View full text |Cite
|
Sign up to set email alerts
|

Joint Contextual Modeling for ASR Correction and Language Understanding

Abstract: The quality of automatic speech recognition (ASR) is critical to Dialogue Systems as ASR errors propagate to and directly impact downstream tasks such as language understanding (LU). In this paper, we propose multi-task neural approaches to perform contextual language correction on ASR outputs jointly with LU to improve the performance of both tasks simultaneously. To measure the effectiveness of this approach we used a public benchmark, the 2nd Dialogue State Tracking (DSTC2) corpus. As a baseline approach, w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 42 publications
(31 citation statements)
references
References 21 publications
0
31
0
Order By: Relevance
“…On the other hand, there has been work on re-scoring ASR nbest by exploring the morphological, lexical, and syntactic features [21,22]. [23] shows a joint model for ASR error correction and language understanding tasks such as dialog act prediction and slot filling. Our work differs in that we are not attempting to correct the ASR error by re-aligning the hypotheses but considering the n-best directly for downstream NLU tasks jointly..…”
Section: Effect Of Multiple Cnnsmentioning
confidence: 99%
“…On the other hand, there has been work on re-scoring ASR nbest by exploring the morphological, lexical, and syntactic features [21,22]. [23] shows a joint model for ASR error correction and language understanding tasks such as dialog act prediction and slot filling. Our work differs in that we are not attempting to correct the ASR error by re-aligning the hypotheses but considering the n-best directly for downstream NLU tasks jointly..…”
Section: Effect Of Multiple Cnnsmentioning
confidence: 99%
“…The inverse transformation of correcting ASR errors has also been explored, for example in Weng et al (2020).…”
Section: Related Workmentioning
confidence: 99%
“…In AED, DB can also be applied by combining a biasing vector with the decoder output before passing them to the Softmax output layer, as shown in Eqn. (12).…”
Section: Joint Networkmentioning
confidence: 99%
“…Contextual biasing, which integrates contextual knowledge into an automatic speech recognition (ASR) system, has become increasingly important to many applications [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18]. Contextual knowledge is often represented by a list (referred to as a biasing list) of words or phrases (referred to as biasing words) that are likely to appear in an utterance in a given context.…”
Section: Introductionmentioning
confidence: 99%