2021
DOI: 10.1007/978-3-030-69541-5_36
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical X-Ray Report Generation via Pathology Tags and Multi Head Attention

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
19
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(19 citation statements)
references
References 24 publications
0
19
0
Order By: Relevance
“…Fueled by recent progresses in the closely related computer vision problem of image-based captioning (Vinyals et al, 2015;Tran et al, 2020), there have been a number of research efforts in medical report generation in recent years (Jing et al, 2018(Jing et al, , 2019Li et al, 2018Xue et al, 2018;Yuan et al, 2019;Wang et al, 2018;Lovelace and Mortazavi, 2020;Srinivasan et al, 2020). These methods often perform reasonably well in addressing the language fluency aspect; on the other hand, as is also evidenced in our empirical evaluation, their results are notably less satisfactory in terms of clinical accuracy.…”
Section: Introductionmentioning
confidence: 76%
“…Fueled by recent progresses in the closely related computer vision problem of image-based captioning (Vinyals et al, 2015;Tran et al, 2020), there have been a number of research efforts in medical report generation in recent years (Jing et al, 2018(Jing et al, , 2019Li et al, 2018Xue et al, 2018;Yuan et al, 2019;Wang et al, 2018;Lovelace and Mortazavi, 2020;Srinivasan et al, 2020). These methods often perform reasonably well in addressing the language fluency aspect; on the other hand, as is also evidenced in our empirical evaluation, their results are notably less satisfactory in terms of clinical accuracy.…”
Section: Introductionmentioning
confidence: 76%
“…Dataset bias is a common problem in medical report generation as there are far more sentences describing normalities than abnormalities. To mitigate this bias, Srinivasan [348] propose a hierarchical classification approach using a transformer as a decoder. Specifically, the transformer decoder leverage attention between and across features obtained from reports, images, and tags for effective report generation.…”
Section: Dataset Biasmentioning
confidence: 99%
“…Apart from some familiar topics such as disease detection (Oh et al, 2020;Luo et al, 2020;Lu et al, 2020b;Rajpurkar et al, 2017;Lu et al, 2020a;Ranjan et al, 2018) and lung segmentation (Eslami et al, 2020), the most related computer vision task is the emerging topic of image-based captioning, which aims at generating realistic sentences or topic-related paragraphs to summarize visual contents from images or videos (Vinyals et al, 2015;Xu et al, 2015;Goyal et al, 2017;Rennie et al, 2017;Huang et al, 2019;Feng et al, 2019;Pei et al, 2019;Tran et al, 2020). Not surprisingly, the recent progresses in medical report generation (Jing et al, 2018(Jing et al, , 2019Li et al, 2018Xue et al, 2018;Yuan et al, 2019;Wang et al, 2018;Lovelace and Mortazavi, 2020;Srinivasan et al, 2020;Zhang et al, 2020;Huang et al, 2021;Gasimova et al, 2020;Singh et al, 2019;Nishino et al, 2020) have been particularly influenced by the successes in image-based captioning. The work of (Vinyals et al, 2015;Xu et al, 2015) is among the early approaches in medical report generation, where visual features are extracted by convolution neural networks (CNNs); they are subsequently fed into recurrent neural networks (RNNs) to generate textual descriptions.…”
Section: Image-based Captioning and Medical Report Generationmentioning
confidence: 99%
“…It has been noted by (Jing et al, 2018(Jing et al, , 2019Li et al, 2018) that traditional RNNs are not well suited in generating long sentences and paragraphs (Vaswani et al, 2017;Krause et al, 2017), which renders them insufficient in medical report generation task (Jing et al, 2018). This issue is relieved by either conceiving hierarchical RNN architectures (Krause et al, 2017) (Jing et al, 2018(Jing et al, , 2019Li et al, 2018;Xue et al, 2018;Yuan et al, 2019;Wang et al, 2018;, or resorting to alternative techniques including in particular the recently developed transformer architectures (Vaswani et al, 2017) (Srinivasan et al, 2020;Lovelace and Mortazavi, 2020).…”
Section: Image-based Captioning and Medical Report Generationmentioning
confidence: 99%
See 1 more Smart Citation