Background: Patients with a history of Helicobacter pylori-negative idiopathic bleeding ulcers have an increased risk of recurring ulcer complications. Aim: To build a machine learning model to identify patients at high risk for recurrent ulcer bleeding. Methods: Data from a retrospective cohort of 22 854 patients (training cohort) diagnosed with peptic ulcer disease in 2007-2016 were analysed to build a model (IPU-ML) to predict recurrent ulcer bleeding. We tested the IPU-ML in all patients with a diagnosis of gastrointestinal bleeding (n = 1265) in 2008-2015 from a different catchment population (independent validation cohort). Any co-morbid conditions which had occurred in >1% of study population were eligible as predictors. Results: Recurrent ulcer bleeding developed in 4772 patients (19.5%) in the training cohort, during a median follow-up period of 2.7 years. IPU-ML model built on six parameters (age, baseline haemoglobin, and presence of gastric ulcer, gastrointestinal diseases, malignancies, and infections) identified patients with bleeding recurrence within 1 year with an area under the receiver operating characteristic curve (AUROC) of 0.648. When we set the IPU-ML cutoff value at 0.20, 27.5% of patients were classified as high risk for rebleeding with a sensitivity of 41.4%, specificity of 74.6%, and a negative predictive value of 91.1%. In the validation cohort, the IPU-ML identified patients with a recurrence ulcer bleeding within 1 year with an AUROC of 0.775, and 84.3% of overall accuracy.
Conclusion:We developed a machine-learning model to identify those patients with a history of idiopathic gastroduodenal ulcer bleeding who are not at high risk for recurrent ulcer bleeding.
This paper explores the bottleneck of feature representations of deep neural networks (DNNs), from the perspective of the complexity of interactions between input variables encoded in DNNs. To this end, we focus on the multi-order interaction between input variables, where the order represents the complexity of interactions. We discover that a DNN is more likely to encode both too simple interactions and too complex interactions, but usually fails to learn interactions of intermediate complexity. Such a phenomenon is widely shared by different DNNs for different tasks. This phenomenon indicates a cognition gap between DNNs and human beings, and we call it a representation bottleneck. We theoretically prove the underlying reason for the representation bottleneck. Furthermore, we propose a loss to encourage/penalize the learning of interactions of specific complexities, and analyze the representation capacities of interactions of different complexities.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.