MedFilter: Improving Extraction of Task-relevant Utterances through Integration of Discourse Structure and Ontological Knowledge

Khosla, Sopan; Vashishth, Shikhar; Lehman, Jill Fain; Rosé, Carolyn Penstein

doi:10.18653/v1/2020.emnlp-main.626

Cited by 8 publications

(12 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…After screening the titles and abstracts of these articles, we assessed 144 full-text articles for eligibility. We included 20 articles [19][20][21][22][23][24][25][26][27][28][29][30][31][32][33][34][35][36][37][38] for our analysis (Fig. 1 and Supplementary Table 2).…”

Section: Study Selectionmentioning

confidence: 99%

“…1 and Supplementary Table 2). Of these, ten were conference proceedings [19][20][21]23,27,28,32,38 , seven were workshop proceedings 22,26,29,[34][35][36][37] , two were journal articles 24,25 , and three were Arxiv preprints 30,31,33 .…”

Section: Study Selectionmentioning

confidence: 99%

“…Three studies focused on improving the ASR for clinical conversations as the first step towards accurately extracting information from them 19,21,36 . Eleven studies chose to manually transcribe the conversations and performed NLP tasks on the transcripts 20,22,24,25,27,[30][31][32]34,35,40 . Five studies used input data representative of the input of an implemented digital scribe (ASR transcripts or chat dialogs) 26,28,33,37,38 .…”

Section: Setting and Research Phasementioning

confidence: 99%

“…Settings differed greatly between studies, as most did not define a specific specialty 19,[21][22][23]26,[31][32][33][34][35][36]38 , while others were focused on primary care 20,25,27 , home hemodialysis 24 , orthopedic encounters 37 , cardiology, family medicine, internal medicine 31 , and patient-clinician dialogs via a telemedicine platform 28 . Fifteen studies were performed by or in collaboration with a company [19][20][21]23,[25][26][27][28]30,[33][34][35][36][37] .…”

Section: Setting and Research Phasementioning

confidence: 99%

“…The NLP tasks that were performed could be split into three categories: entity extraction 20,[25][26][27]30,32,35,38 , classification 22,24,[30][31][32][33][34][35] , and summarization 22,24,28,29,31,37 (see Fig. 2 and Supplementary Table 4).…”

Section: Natural Language Processing (Nlp) Tasks and Modelsmentioning

confidence: 99%

See 4 more Smart Citations

The digital scribe in clinical practice: a scoping review and research agenda

et al. 2021

View full text Add to dashboard Cite

The number of clinician burnouts is increasing and has been linked to a high administrative burden. Automatic speech recognition (ASR) and natural language processing (NLP) techniques may address this issue by creating the possibility of automating clinical documentation with a “digital scribe”. We reviewed the current status of the digital scribe in development towards clinical practice and present a scope for future research. We performed a literature search of four scientific databases (Medline, Web of Science, ACL, and Arxiv) and requested several companies that offer digital scribes to provide performance data. We included articles that described the use of models on clinical conversational data, either automatically or manually transcribed, to automate clinical documentation. Of 20 included articles, three described ASR models for clinical conversations. The other 17 articles presented models for entity extraction, classification, or summarization of clinical conversations. Two studies examined the system’s clinical validity and usability, while the other 18 studies only assessed their model’s technical validity on the specific NLP task. One company provided performance data. The most promising models use context-sensitive word embeddings in combination with attention-based neural networks. However, the studies on digital scribes only focus on technical validity, while companies offering digital scribes do not publish information on any of the research phases. Future research should focus on more extensive reporting, iteratively studying technical validity and clinical validity and usability, and investigating the clinical utility of digital scribes.

show abstract

Section: Study Selectionmentioning

confidence: 99%

Section: Study Selectionmentioning

confidence: 99%

Section: Setting and Research Phasementioning

confidence: 99%

Section: Setting and Research Phasementioning

confidence: 99%

Section: Natural Language Processing (Nlp) Tasks and Modelsmentioning

confidence: 99%

See 3 more Smart Citations

The digital scribe in clinical practice: a scoping review and research agenda

et al. 2021

View full text Add to dashboard Cite

show abstract

Data Generation Strategies for Enhanced Atrial Fibrillation Prediction in Pacemaker Patients

Ngom,

Ba,

Sarr

et al. 2023

2023 First International Conference on the Advancements of Artificial Intelligence in African Context (AAIAC)

View full text Add to dashboard Cite

UMLS-KGI-BERT: Data-Centric Knowledge Integration in Transformers for Biomedical Entity Recognition

Mannion

Chevalier

Schwab

et al. 2023

Proceedings of the 5th Clinical Natural Language Processing Workshop

View full text Add to dashboard Cite

Pre-trained transformer language models (LMs) have in recent years become the dominant paradigm in applied NLP. These models have achieved state-of-the-art performance on tasks such as information extraction, question answering, sentiment analysis, document classification and many others. In the biomedical domain, significant progress has been made in adapting this paradigm to NLP tasks that require the integration of domain-specific knowledge as well as statistical modelling of language. In particular, research in this area has focused on the question of how best to construct LMs that take into account not only the patterns of token distribution in medical text, but also the wealth of structured information contained in terminology resources such as the UMLS. This work contributes a data-centric paradigm for enriching the language representations of biomedical transformer-encoder LMs by extracting text sequences from the UMLS. This allows for graph-based learning objectives to be combined with masked-language pre-training. Preliminary results from experiments in the extension of pre-trained LMs as well as training from scratch show that this framework improves downstream performance on multiple biomedical and clinical Named Entity Recognition (NER) tasks. All pre-trained models, data processing pipelines and evaluation scripts will be made publicly available.

show abstract

MedFilter: Improving Extraction of Task-relevant Utterances through Integration of Discourse Structure and Ontological Knowledge

Cited by 8 publications

References 33 publications

The digital scribe in clinical practice: a scoping review and research agenda

The digital scribe in clinical practice: a scoping review and research agenda

Data Generation Strategies for Enhanced Atrial Fibrillation Prediction in Pacemaker Patients

UMLS-KGI-BERT: Data-Centric Knowledge Integration in Transformers for Biomedical Entity Recognition

Contact Info

Product

Resources

About