2013
DOI: 10.1007/978-3-642-41190-8_26
|View full text |Cite
|
Sign up to set email alerts
|

Dealing with Bilingualism in Automatic Transcription of Historical Archive of Czech Radio

Abstract: One of the biggest challenges in the automatic transcription of the historical audio archive of Czech and Czechoslovak radio is bilingualism. Two closely related languages, Czech and Slovak, are mixed in many archive documents. Both were the official languages in former Czechoslovakia (1918-1992) and both were used in media. The two languages are considered similar, although they differ in more than 75 % of their lexical inventories, which complicates automatic speech-to-text conversion. In this paper, we pres… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2013
2013
2017
2017

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 12 publications
0
1
0
Order By: Relevance
“…3. The utterances that were classified as Slovak by the language identification module [11] were removed. 4.…”
Section: Unsupervised Lm Adaptationmentioning
confidence: 99%
“…3. The utterances that were classified as Slovak by the language identification module [11] were removed. 4.…”
Section: Unsupervised Lm Adaptationmentioning
confidence: 99%