2018
DOI: 10.1007/s10278-018-0085-8
|View full text |Cite
|
Sign up to set email alerts
|

Retrospective Analysis of Clinical Performance of an Estonian Speech Recognition System for Radiology: Effects of Different Acoustic and Language Models

Abstract: The aim of this study was to analyze retrospectively the influence of different acoustic and language models in order to determine the most important effects to the clinical performance of an Estonian language-based non-commercial radiology-oriented automatic speech recognition (ASR) system. An ASR system was developed for Estonian language in radiology domain by utilizing open-source software components (Kaldi toolkit, Thrax). The ASR system was trained with the real radiology text reports and dictations coll… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 16 publications
0
5
0
Order By: Relevance
“…Although neural networks have been used for statistical modeling for decades, in recent years they have become increasingly utilized in the field of speech technology due to development of powerful GPUs. Using more computational power makes it possible to train DNN-based acoustic models on a very large scale, which has led to lower word error rate in comparison to GMMs even with small languages such as Estonian [16]. Yet, deep learning models are still rarely used for automatic voice quality detection in today's phonetic research.…”
Section: Introductionmentioning
confidence: 99%
“…Although neural networks have been used for statistical modeling for decades, in recent years they have become increasingly utilized in the field of speech technology due to development of powerful GPUs. Using more computational power makes it possible to train DNN-based acoustic models on a very large scale, which has led to lower word error rate in comparison to GMMs even with small languages such as Estonian [16]. Yet, deep learning models are still rarely used for automatic voice quality detection in today's phonetic research.…”
Section: Introductionmentioning
confidence: 99%
“…As for the ASR and post-processing part, we were inspired by the development of the Estonian ASR for Radiology [3], since Estonian is a similar-size language having comparable amount of language resources available, but again -RUTA:MED adds the dual workflow and interactive editing support.…”
Section: Related Workmentioning
confidence: 99%
“…Words that were not classified automatically need to be manually classified and transcribed. Words that occur at least 1,000 times in the text corpus (almost 14k words) 4 ,…”
Section: Pronunciation Lexiconmentioning
confidence: 99%
“…ASR systems trained on general-purpose speech and text corpora, however, are not applicable 3 for the very specific language of medical reports. Domain-adapted language model and pronunciation lexicon (both derived from a text corpus of written medical reports) give the most significant boost in ASR accuracy, while a domain-adapted acoustic model (derived from a speech corpus of dictated medical reports) makes a considerable impact as well [4]. The work presented in this paper is a part of an ongoing collaborative project between a language technology research group and the largest hospital in Latvia on Latvian ASR for medical applications.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation