INTRODUCTION:
Patients with atrophic gastritis (AG) or gastric intestinal metaplasia (GIM) have elevated risk of gastric adenocarcinoma. Endoscopic screening and surveillance have been implemented in high incidence countries. The study aimed to evaluate the accuracy of a deep convolutional neural network (CNN) for simultaneous recognition of AG and GIM.
METHODS:
Archived endoscopic white light images with corresponding gastric biopsies were collected from 14 hospitals located in different regions of China. Corresponding images by anatomic sites containing AG, GIM, and chronic non-AG were categorized using pathology reports. The participants were randomly assigned (8:1:1) to the training cohort for developing the CNN model (TResNet), the validation cohort for fine-tuning, and the test cohort for evaluating the diagnostic accuracy. The area under the curve (AUC), sensitivity, specificity, and accuracy with 95% confidence interval (CI) were calculated.
RESULTS:
A total of 7,037 endoscopic images from 2,741 participants were used to develop the CNN for recognition of AG and/or GIM. The AUC for recognizing AG was 0.98 (95% CI 0.97–0.99) with sensitivity, specificity, and accuracy of 96.2% (95% CI 94.2%–97.6%), 96.4% (95% CI 94.8%–97.9%), and 96.4% (95% CI 94.4%–97.8%), respectively. The AUC for recognizing GIM was 0.99 (95% CI 0.98–1.00) with sensitivity, specificity, and accuracy of 97.9% (95% CI 96.2%–98.9%), 97.5% (95% CI 95.8%–98.6%), and 97.6% (95% CI 95.8%–98.6%), respectively.
DISCUSSION:
CNN using endoscopic white light images achieved high diagnostic accuracy in recognizing AG and GIM.
Esophagogastroduodenoscopy (EGD) has been widely applied for gastrointestinal (GI) examinations. However, there is a lack of mature technology to evaluate the quality of the EGD inspection process. In this Letter, the authors design a multi-task anatomy detection convolutional neural network (MT-AD-CNN) to evaluate the EGD inspection quality by combining the detection task of the upper digestive tract with ten anatomical structures and the classification task of informative video frames. The authors' model is able to eliminate non-informative frames of the gastroscopic videos and detect the anatomies in real time. Specifically, a sub-branch is added to the detection network to classify NBI images, informative and non-informative images. By doing so, the detected box will be only displayed on the informative frames, which can reduce the false-positive rate. They can determine the video frames on which each anatomical location is effectively examined, so that they can analyse the diagnosis quality. Their method reaches the performance of 93.74% mean average precision for the detection task and 98.77% accuracy for the classification task. Their model can reflect the detailed circumstance of the gastroscopy examination process, which shows application potential in improving the quality of examinations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.