ViCoMo: visual context modeling for scene understanding in video surveillance

Creusen, IM Ivo; Javanbakhti, S Solmaz; Loomans, Mjh Marijn; Hazelhoff, Lykele; Roubtsova, NS Nadejda; Zinger, Svitlana

doi:10.1117/1.jei.22.4.041117

Cited by 6 publications

(2 citation statements)

References 38 publications

(45 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…a. Color: the HSV color space is used (Creusen et al, 2013a). b. Texture: a group of Gabor filters are applied.…”

Section: (A) Semantic Region Labelingmentioning

confidence: 99%

See 1 more Smart Citation

Adding Context Information to Video Analysis for Surveillance Applications

Javanbakhti¹,

Sanberg²,

Bao³

et al. 2016

Advances in Multimedia and Interactive Technologies

Self Cite

View full text Add to dashboard Cite

Smart surveillance systems become more meaningful if they both grow in reliability and robustness, while simultaneously offering a higher semantic level of understanding. To achieve a higher level of semantic scene understanding, the objects and their actions have to be interpreted in the given context, so that the extraction of contextual information is required. This chapter explores several techniques for extracting the contextual information such as spatial, motion, depth and co-occurrence, depending on applications. Afterwards, the chapter provides specific case studies to evaluate the usefulness of context information, based on: (1) region labeling of the surroundings of objects, (2) motion analysis of the water for moving ships, (3) traffic sign recognition for safety event evaluation and (4) the use of depth signals for obstacle detection. The chapter shows that the previous cases can be solved in an improved way with respect to robustness and semantic understanding. Case studies indicate up to 6.8% improvement of reliable correct object understanding and the novel possibility of labeling scene events as safe/unsafe depending on the object behavior and the detected surrounding context. In this chapter, it is shown that using contextual information improves automated video surveillance analysis, as it not only improves the reliability of moving object detection, but also enables scene understanding that is far beyond object understanding.

show abstract

“…a. Color: the HSV color space is used (Creusen et al, 2013a). b. Texture: a group of Gabor filters are applied.…”

Section: (A) Semantic Region Labelingmentioning

confidence: 99%

“…Screenshots from this scenario can be seen in Figure 14. These scenarios are described in more detail in (Creusen et al, 2013a). The presented framework is capable to recognize actions in traffic surveillance video.…”

Section: Table 3 Characteristics Of the Used Datasets Figure 12 Examentioning

confidence: 99%