2023
DOI: 10.21203/rs.3.rs-3307492/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

FFA-GPT: an Interactive Visual Question Answering System for Fundus Fluorescein Angiography

Danli Shi,
Xiaolan Chen,
Weiyi Zhang
et al.

Abstract: Background: While large language models (LLMs) have demonstrated impressive capabilities in question-answering (QA) tasks, their utilization in analyzing ocular imaging data remains limited. We aim to develop an interactive system that harnesses LLMs for report generation and visual question answering in the context of fundus fluorescein angiography (FFA).Methods: Our system comprises two components: an image-text alignment module for report generation and a GPT-based module (Llama 2) for interactive QA. To co… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 25 publications
0
2
0
Order By: Relevance
“…This can be achieved by facilitating access to eye care and supporting clinical decision-making through an objective, data-driven approach. While several studies have explored the use of FFA images for automatic report generation [8,10,22], it has been reported only 52% of the 204 reported ophthalmic databases are available online. [23] Furthermore, there are no publicly available large angiographic datasets for research purposes.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…This can be achieved by facilitating access to eye care and supporting clinical decision-making through an objective, data-driven approach. While several studies have explored the use of FFA images for automatic report generation [8,10,22], it has been reported only 52% of the 204 reported ophthalmic databases are available online. [23] Furthermore, there are no publicly available large angiographic datasets for research purposes.…”
Section: Discussionmentioning
confidence: 99%
“…[8] Chen et al combined a vision transformer with a large language model for FFA report generation and question answering. [10] However, while medical report generation requires substantial and interpretable real-world data for training [11], there is a lack of publicly available FFA and ICGA datasets for report generation. [12] To address these challenges, we have curated a large angiographic dataset that includes both FFA and ICGA images with interpretable labels and proposed baseline methods for angiographic report generation.…”
Section: Introductionmentioning
confidence: 99%