2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2012
DOI: 10.1109/icassp.2012.6289034
|View full text |Cite
|
Sign up to set email alerts
|

Phrase-level transduction model with reordering for spoken to written language transformation

Abstract: This paper proposes a first-ever phrase-level transduction model with reordering to transform colloquial speech directly to written-style transcription. This model is capable of performing n-m transductions. Our transduction model is trained from a parallel corpus of verbatim transcription and written-style transcription. Deletions, substitutions, insertions are well represented using this model. Inversion transduction cases can also be identified and represented. We implement our transduction model using weig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 10 publications
0
1
0
Order By: Relevance
“…1. When appropriate modeling techniques (e.g., [88]) are used, this additional stage would also fit well in the optimization approach presented in this paper. Without including the prosodic processing/modeling stage, we simply treat the difference between what goes (as the ''noisy'' text input) into the downstream processing components in the SCIP system and the ''clean'' text input to the traditional MT, IR, or NLP systems as a combination of ASR errors and normal speech disfluency.…”
Section: A Architectural Overview Of Scip Systemsmentioning
confidence: 91%
“…1. When appropriate modeling techniques (e.g., [88]) are used, this additional stage would also fit well in the optimization approach presented in this paper. Without including the prosodic processing/modeling stage, we simply treat the difference between what goes (as the ''noisy'' text input) into the downstream processing components in the SCIP system and the ''clean'' text input to the traditional MT, IR, or NLP systems as a combination of ASR errors and normal speech disfluency.…”
Section: A Architectural Overview Of Scip Systemsmentioning
confidence: 91%