2022
DOI: 10.1007/s10115-022-01684-7
|View full text |Cite
|
Sign up to set email alerts
|

Diagnostic captioning: a survey

Abstract: Diagnostic captioning (DC) concerns the automatic generation of a diagnostic text from a set of medical images of a patient collected during an examination. DC can assist inexperienced physicians, reducing clinical errors. It can also help experienced physicians produce diagnostic reports faster. Following the advances of deep learning, especially in generic image captioning, DC has recently attracted more attention, leading to several systems and datasets. This article is an extensive overview of DC. It prese… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(2 citation statements)
references
References 97 publications
(245 reference statements)
0
1
0
1
Order By: Relevance
“…Pathological captioning tasks are being studied recently to automatically generate diagnostic texts based on patient medical images, assist inexperienced doctors, and reduce clinical errors ( 46 ). The typical representative is still PathVQA ( 19 ).…”
Section: Related Studymentioning
confidence: 99%
“…Pathological captioning tasks are being studied recently to automatically generate diagnostic texts based on patient medical images, assist inexperienced doctors, and reduce clinical errors ( 46 ). The typical representative is still PathVQA ( 19 ).…”
Section: Related Studymentioning
confidence: 99%
“…A principal abordagem para gerar relatório médico automático a partir de imagem, usando técnicas de Deep Learning, é o Image Captioning, que em sua abordagem mais simples, consiste em uma rede receber a imagem como entrada e produzir uma palavra por vez assim formando o texto do relatório. Esta técnica é amplamente utilizada no contexto de imagens de raio-x de tórax [Pavlopoulos et al 2022]. As arquiteturas, em sua maioria, seguem o modelo Encoder-Decoder, usando Backbones e redes neurais recorrentes (RNN) [Shin et al 2016] , [Zhang et al 2017].…”
Section: Introduc ¸ãOunclassified