2010 IEEE International Conference on Acoustics, Speech and Signal Processing 2010
DOI: 10.1109/icassp.2010.5494997
|View full text |Cite
|
Sign up to set email alerts
|

Improved statistical models for SMT-based speaking style transformation

Abstract: Automatic speech recognition (ASR) results contain not only ASR errors, but also disfluencies and colloquial expressions that must be corrected to create readable transcripts. We take the approach of statistical machine translation (SMT) to "translate" from ASR results into transcript-style text. We introduce two novel modeling techniques in this framework: a context-dependent translation model, which allows for usage of context to accurately model translation probabilities, and log-linear interpolation of con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 11 publications
0
9
0
Order By: Relevance
“…In the future, we will address taking a more sophisticated transformation model to deal with the correction of colloquial expressions like [10], which are difficult to consider in an unsupervised manner. In addition, if the construction of a confusion network is problematic, for example, in real-time applications, it is better to use lattices instead of confusion networks to represent multiple hypotheses.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…In the future, we will address taking a more sophisticated transformation model to deal with the correction of colloquial expressions like [10], which are difficult to consider in an unsupervised manner. In addition, if the construction of a confusion network is problematic, for example, in real-time applications, it is better to use lattices instead of confusion networks to represent multiple hypotheses.…”
Section: Discussionmentioning
confidence: 99%
“…(6) plays an important role in improving the readability of ASR results, because our method assumes that there are no parallel data to obtain transformation rules between raw and cleaned transcripts, unlike [10], and only uses a few heuristic rules for the transformation.…”
Section: Language Model For Clean Transcriptsmentioning
confidence: 99%
See 3 more Smart Citations