2019
DOI: 10.48550/arxiv.1904.02633
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Clinically Accurate Chest X-Ray Report Generation

Guanxiong Liu,
Tzu-Ming Harry Hsu,
Matthew McDermott
et al.

Abstract: The automatic generation of radiology reports given medical radiographs has significant potential to operationally and improve clinical patient care. A number of prior works have focused on this problem, employing advanced methods from computer vision and natural language generation to produce readable reports. However, these works often fail to account for the particular nuances of the radiology domain, and, in particular, the critical importance of clinical accuracy in the resulting generated reports. In thi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
32
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(32 citation statements)
references
References 30 publications
0
32
0
Order By: Relevance
“…The dataset contains frontal (anteroposterio (AP), posteroanterior (PA)), and lateral view images. Some previous work makes no distinction between view positions [7,23], but this might lead to a miss-match between some findings between an image and report pair. For example, cardiomegaly can only be accurately assessed in the PA view, while the AP view will exaggerate the heart silhouette due to magnification [28].…”
Section: Datasetsmentioning
confidence: 99%
See 2 more Smart Citations
“…The dataset contains frontal (anteroposterio (AP), posteroanterior (PA)), and lateral view images. Some previous work makes no distinction between view positions [7,23], but this might lead to a miss-match between some findings between an image and report pair. For example, cardiomegaly can only be accurately assessed in the PA view, while the AP view will exaggerate the heart silhouette due to magnification [28].…”
Section: Datasetsmentioning
confidence: 99%
“…Tienet [36] is a pioneering CNN-RNN model with image-text attention mechanism for VLU (e.g., thorax disease classification) and VLG (e.g., report generation) tasks by using ChestXray14 [35] dataset. Liu et al [23] only focuses on the VLG task to generate the radiology report utilizing a CNN-RNN-RNN architecture with a hierarchical generation strategy from MIMIC-CXR [17] and Open-I dataset [11]. Hsu et al [14] focuses on a VLU task, specifically medical image-report retrieval in the MIMIC-CXR dataset, based on supervised and unsupervised methods.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition, studies on medical image captioning also have been active [8][9][10]. Baoyu et al [11] introduced a co-attention (CoATT) mechanism for localizing sub-regions in the In addition, the CEDT uses all levels of information of the encoders in order not to lose features extracted from the GLVE when decoding phrases.…”
Section: Related Workmentioning
confidence: 99%
“…In addition, studies on medical image captioning also have been active [8][9][10]. Baoyu et al [11] introduced a co-attention (CoATT) mechanism for localizing sub-regions in the image and generating the corresponding descriptions, as well as a hierarchical LSTM model to generate long paragraphs.…”
Section: Related Workmentioning
confidence: 99%