2017
DOI: 10.48550/arxiv.1711.07274
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Speech recognition for medical conversations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 9 publications
0
10
0
Order By: Relevance
“…Accurate meeting transcriptions are the one of the processing steps in a pipeline for several tasks like summarization, topic extraction, etc. Similarly, the same transcription system can be used in other domains such as healthcare [198]. Although this task was introduced by NIST in the Rich Transcription Evaluation series back in 2003 [180,188,199], the initial systems had very poor performance, and consequently commercialization of the technology was not possible.…”
Section: Meeting Transcriptionmentioning
confidence: 99%
“…Accurate meeting transcriptions are the one of the processing steps in a pipeline for several tasks like summarization, topic extraction, etc. Similarly, the same transcription system can be used in other domains such as healthcare [198]. Although this task was introduced by NIST in the Rich Transcription Evaluation series back in 2003 [180,188,199], the initial systems had very poor performance, and consequently commercialization of the technology was not possible.…”
Section: Meeting Transcriptionmentioning
confidence: 99%
“…Basically, SR-oriented and SS-oriented studies attempt to build automatic computer systems for interconversion between speech and text in the area of smart healthcare, making human-machine interaction as natural and flexible as human-human interaction [115]. For speech recognition, these efforts encompass the improvement in acoustic modelling [116], [117], language modelling [118], and the whole system pipeline [119], [120] to enhance recognition accuracy. For speech synthesis, recent advancements have been made in investigating and making synthesized speech natural [121], [122], intelligible [123]- [127] and expressive [128]- [130], which will help stimulate the enthusiasm of human-machine interaction [131].…”
Section: Nlp Approachmentioning
confidence: 99%
“…Importantly, we did not perform text normalization specific to each domain. In the domain of medical datasets, we use I2B2'14 (Stubbs and Uzuner, 2015), which consists of identified textual medical notes with PHI tagging, and the Audio Medical Conversations dataset from (Chiu et al, 2017), denoted AMC'17, which contains de-identified audio of doctor-patient conversations and their corresponding manual transcripts. Processing the AMC'17 conversations was facilitated by the fact that it is a de-identified dataset, which provides us with the locations of the PHI in the audio and the transcripts.…”
Section: Datasetsmentioning
confidence: 99%
“…Due to the rise of tele-medicine (Weinstein et al, 2014), clinical records consist of many other types of data, such as audio conversations (Chiu et al, 2017), scanned documents, video, and im-Figure 1: High level audio de-ID pipeline ages. In this work, we direct our attention towards the task of de-identifying clinical audio data.…”
Section: Introductionmentioning
confidence: 99%