8th European Conference on Speech Communication and Technology (Eurospeech 2003) 2003
DOI: 10.21437/eurospeech.2003-458
|View full text |Cite
|
Sign up to set email alerts
|

Large vocabulary continuous speech recognition in greek: corpus and an automatic dictation system

Abstract: In this work, we present the creation of the first Greek Speech Corpus and the implementation of a Dictation System for workflow improvement in the field of journalism. The current work was implemented under the project called Logotypografia (Logos = logos, speech and Typografia = typography) sponsored by the General Secretariat of Research and Development of Greece. This paper presents the process of data collection (texts and recordings), waveform processing (transcriptions), creation of the acoustic and lan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2007
2007
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 6 publications
0
2
0
Order By: Relevance
“…Initially the raw recordings are segmented into 30 second segments and the transcriptions are split into smaller segments of approximately 1000 words called documents. Each segment is decoded using a seed acoustic model trained on the Logotypografia corpus [66] and a 4gram biased LM trained on the corresponding transcription of each recording. The best path transcript of each segment is obtained and paired with the best matching document via TF-IDF similarity.…”
Section: A Collection and Curation Of Hparlmentioning
confidence: 99%
See 1 more Smart Citation
“…Initially the raw recordings are segmented into 30 second segments and the transcriptions are split into smaller segments of approximately 1000 words called documents. Each segment is decoded using a seed acoustic model trained on the Logotypografia corpus [66] and a 4gram biased LM trained on the corresponding transcription of each recording. The best path transcript of each segment is obtained and paired with the best matching document via TF-IDF similarity.…”
Section: A Collection and Curation Of Hparlmentioning
confidence: 99%
“…2) Logotypografia: Logotypografia [66] is one of the first corpora for Large Vocabulary Continuous Speech Recognition in Greek. The dataset contains 33, 136 newscast utterances, or 72 hours of speech.…”
Section: B Including Corpora From Different Domainsmentioning
confidence: 99%