Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-1447
|View full text |Cite
|
Sign up to set email alerts
|

Iterative Learning of Speech Recognition Models for Air Traffic Control

Abstract: Automatic Speech Recognition (ASR) has recently proved to be a useful tool to reduce the workload of air traffic controllers leading to significant gains in operational efficiency. Air Traffic Control (ATC) systems in operation rooms around the world generate large amounts of untranscribed speech and radar data each day, which can be utilized to build and improve ASR models. In this paper, we propose an iterative approach that utilizes increasing amounts of untranscribed data to incrementally build the necessa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(9 citation statements)
references
References 14 publications
0
9
0
Order By: Relevance
“…[11,12] -that initial system quality directly affects subsequent SST success, we here hypothesize that optimally-performed SST may specifically be much more sensitive to the quality of the LM than the AM or system as a whole: for a sufficient LM may be able to counteract the weaknesses of a poor AM (hence poor frame-level predictions), allowing for SST to make gains in situations otherwise lost to error-propagation (c.f. [13]). By establishing our own semi-supervised pipeline and comparing how varyingquality seed AMs, varying-quality LMs, and, importantly, combinations of such AMs and LMs, affect final Word Error Rate (WER) achieved, we explore how seed acoustic model (AM) quality and, independently, language model (LM) quality impact upon SST success.…”
Section: Introductionmentioning
confidence: 99%
“…[11,12] -that initial system quality directly affects subsequent SST success, we here hypothesize that optimally-performed SST may specifically be much more sensitive to the quality of the LM than the AM or system as a whole: for a sufficient LM may be able to counteract the weaknesses of a poor AM (hence poor frame-level predictions), allowing for SST to make gains in situations otherwise lost to error-propagation (c.f. [13]). By establishing our own semi-supervised pipeline and comparing how varyingquality seed AMs, varying-quality LMs, and, importantly, combinations of such AMs and LMs, affect final Word Error Rate (WER) achieved, we explore how seed acoustic model (AM) quality and, independently, language model (LM) quality impact upon SST success.…”
Section: Introductionmentioning
confidence: 99%
“…Srinivasamurthy et al studied the ASR approach [52] in the ATC domain, in which the extracted prior knowledge is applied to improve the final performance. A semi-supervised approach was developed to achieve the iterative training of the ASR model [53].…”
Section: B Asr In Atcmentioning
confidence: 99%
“…New systems like controller-pilot data link communications (CPDLC), which use text based communication, reduce the load on the voice communication channels [1]. Projects like AcListant and MALORCA 1 aim to support the ATCO by speech recognition systems [2,3]. The problem with developing such systems, is the lack of training…”
Section: Introductionmentioning
confidence: 99%