“…According to the specific lesions, these systems can be classified to handle bleeding [30,31], tumors [32,33], Helicobacter pylori [34], cancer [35,36], Crohn's disease [37] and polyps [38]. Moreover, some other applications include pose detection for endoscopy [39], video segmentation [40] and three-dimensional reconstruction of the digestive wall [41]. For video summarization, as discussed above, most previous works mainly focus on the summarization of structured videos, which have well-defined temporal structures and characteristics for selecting key frames.…”