In the field of medical endoscopy more and more surgeons are changing over to record and store videos of their endoscopic procedures for long-term archival. These endoscopic videos are a good source of information for explanations to patients and follow-up operations. As the endoscope is the "eye of the surgeon", the video shows the same information the surgeon has seen during the operation, and can describe the situation inside the patient much more precisely than an operation report would do. Recorded endoscopic videos can also be used for training young surgeons and in some countries the long-term archival of video recordings from endoscopic procedures is even enforced by law. A major challenge, however, is to efficiently access these very large video archives for later purposes. One problem, for example, is to locate specific images in the videos that show important situations, which are additionally captured as static images during the procedure. This work addresses this problem and focuses on contentbased video retrieval in data from laparoscopic surgery. We propose to use feature signatures, which can appropriately and concisely describe the content of laparoscopic images, and show that by using this content descriptor with an appropriate metric, we are able to efficiently perform content-based retrieval in laparoscopic videos. In a dataset with 600 captured static images from 33 hours recordings, we are able to find the correct video segment for more than 88% of these images.Feature signatures, as for instance investigated in the work of Beecks, 1 are a compact yet efficient way of describing the content of an image. They are obtained by clustering characteristic image features into a compact feature representation that adapts to individual image contents. In this paper, we propose to utilize feature signatures over a 7-dimensional feature space including position, color, and texture information for content-based retrieval in laparoscopic videos. To this end, we investigate different distance-based similarity measures including the Earth Mover's Distance, 2 the Signature Quadratic Form Distance, 3, 4 and the Signature Matching Distance. 5 Through a large evaluation with video data from 33 hours of laparoscopic surgery we show that the proposed signature-based approaches are able to efficiently perform content-based retrieval in laparoscopic videos.
Description of PurposeIn endoscopic surgery, operating surgeons usually record video segments and capture static images during the operation, which they often need after the procedure. 6 However, while the static images can be used for later discussions with patients and colleagues, and can be easily integrated into operation reports, they lack of a complete and precise information, such as movements of operation instruments and the dynamics of performed actions. The recorded videos, on the other hand, contain all these important details, since they show exactly the same images that were visible on the monitors in the operation room, which were also used by the...