2021
DOI: 10.3390/make3020015
|View full text |Cite
|
Sign up to set email alerts
|

Templated Text Synthesis for Expert-Guided Multi-Label Extraction from Radiology Reports

Abstract: Training medical image analysis models traditionally requires large amounts of expertly annotated imaging data which is time-consuming and expensive to obtain. One solution is to automatically extract scan-level labels from radiology reports. Previously, we showed that, by extending BERT with a per-label attention mechanism, we can train a single model to perform automatic extraction of many labels in parallel. However, if we rely on pure data-driven learning, the model sometimes fails to learn critical featur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 30 publications
0
6
0
Order By: Relevance
“…There is potential for exploration with other types of machine learning models that may perform better across different data sources. Schrempf et al ( 18 ) compared the EdIE-R and ALARM+ approaches on their dataset and found similar findings. However, the reason for the differences could relate to other variances in the training data, such as underlying population characteristics.…”
Section: Discussionmentioning
confidence: 71%
See 2 more Smart Citations
“…There is potential for exploration with other types of machine learning models that may perform better across different data sources. Schrempf et al ( 18 ) compared the EdIE-R and ALARM+ approaches on their dataset and found similar findings. However, the reason for the differences could relate to other variances in the training data, such as underlying population characteristics.…”
Section: Discussionmentioning
confidence: 71%
“…Chapman et al ( 23 ), who developed ConTEXT, which looks for contextual features (negation, temporality, or who has experienced the condition, e.g., patient, family, and member), have shown how this may assist annotators when labelling by identifying these uncertain conditions to support classification. Other work, such as that by the ALARM+ authors, considered how a template method could improve understanding of uncertain terminology ( 18 ). They defined terminology that should be used to map to uncertain, positive, and negative entities, and this vocabulary was gathered throughout the annotation.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, performance of these models is often lower than what would be required clinically without additional feature engineering 13,15 or fine-tuning on thousands of manually-derived labels 14,16 specific to the task. This likely reflects the fact that medical report text has specific structure and meaning while comprising only a small proportion of the general language used to train these models.…”
Section: Many Published Methods For Extraction Of Multiple Values Use...mentioning
confidence: 99%
“…Machine learning has been used on clinical reports but not specifically for extraction of concepts from echocardiogram reports. Many have used various implementations of BERT (Bidirectional Encoder Representations from Transformers), an early large language model (LLM), to extract radiographic clinical findings 13 , mentions of devices 14 , study characteristics 15 , and result keywords 16 from radiology or pathology reports.…”
Section: Introductionmentioning
confidence: 99%