Proceedings of the 27th ACM International Conference on Multimedia 2019
DOI: 10.1145/3343031.3356066
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Disease Detection and Report Generation for Gastrointestinal Tract Examination

Abstract: In this paper, we present a method to automatically identify diseases from videos of gastrointestinal (GI) tract examinations using a Deep Convolutional Neural Network (DCNN) that processes images from digital endoscopes. Our goal is to aid domain experts by automatically detecting abnormalities and generating a report that summarizes the main findings. We have implemented a model that uses two different DCNN architectures to generate our predictions, which are also capable of running on a mobile device 1 . Us… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
13
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(13 citation statements)
references
References 11 publications
0
13
0
Order By: Relevance
“…We also identify a group of papers [46,48,74,92,95] following a Template based approach. The language component in these works operates programmatically by following if-then rules or other heuristics in order to retrieve, fill and/or combine templates from a database in order to generate a report.…”
Section: Language Componentmentioning
confidence: 99%
See 1 more Smart Citation
“…We also identify a group of papers [46,48,74,92,95] following a Template based approach. The language component in these works operates programmatically by following if-then rules or other heuristics in order to retrieve, fill and/or combine templates from a database in order to generate a report.…”
Section: Language Componentmentioning
confidence: 99%
“…The visual component typically outputs discrete classification labels that the language component processes programmatically. In the case of Harzig et al 2019b [48], image localizations per class are also recovered using CAM [157], and in the case of Han et al [46] the visual component outputs an image segmentation. In both cases the language component includes special localization-based rules or templates, thus incorporating location information in the generated report.…”
Section: Language Componentmentioning
confidence: 99%
“…Both sides of the pylorus must be examined to detect abnormalities like ulcer or erosion [ 5 , 7 ]. With screening GI tract, the physician can ensure that the pylorus can control the motion of food by condensing muscles [ 8 ].…”
Section: Introductionmentioning
confidence: 99%
“…Automatic detection of anatomical landmark has been considered in many previous studies [ 4 , 8 , 10 ].…”
Section: Introductionmentioning
confidence: 99%
“…The dataset was later extended to 8,000 images. Using Kvasir, researchers all over the world have started developing different ML models and AI systems for GI endoscopy [23][24][25][26][27][28][29][30][31][32][33][34][35][36][37][38] . Moreover, the Kvasir dataset has been used to organize international competitions, i.e., the Medico Task at MediaEval in 2017 39 and 2018 40 and the ACM Multimedia 2019 BioMedia Grand Challenge 41 .…”
mentioning
confidence: 99%