Pixel-wise Ground Truth Annotation in Videos - An Semi-automatic Approach for Pixel-wise and Semantic Object Annotation

Schöning, Julius; Faion, Patrick; Heidemann, Gunther

doi:10.5220/0005823306900697

Cited by 11 publications

(3 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Nowhere is this more problematic than in semantic segmentation applications where every pixel needs to be annotated accurately. There are many useful tools available to semi-automate the process as reviewed by [20], many of which take advantage of algorithmic approaches such as ORB features [55], polygon morphing [63], semi-automatic Area of Interest (AOI) fitting [55] and all of the above [63].…”

Section: Dataset Annotation and Augmentationmentioning

confidence: 99%

“…Today, the traditional techniques are used when the problem can be simplified so that they can be deployed on low cost microcontrollers or to limit the problem for deep learning techniques by highlighting certain features in data, augmenting data [19] or aiding in dataset annotation [20]. We will discuss later in this paper how many image transformation techniques can be used to improve your neural net training.…”

Section: Advantages Of Traditional Computer Vision Techniquesmentioning

confidence: 99%

See 1 more Smart Citation

Deep Learning vs. Traditional Computer Vision

Mahony

Campbell

Carvalho

et al. 2019

Advances in Intelligent Systems and Computing

704

383

View full text Add to dashboard Cite

Deep Learning has pushed the limits of what was possible in the domain of Digital Image Processing. However, that is not to say that the traditional computer vision techniques which had been undergoing progressive development in years prior to the rise of DL have become obsolete. This paper will analyse the benefits and drawbacks of each approach. The aim of this paper is to promote a discussion on whether knowledge of classical computer vision techniques should be maintained. The paper will also explore how the two sides of computer vision can be combined. Several recent hybrid methodologies are reviewed which have demonstrated the ability to improve computer vision performance and to tackle problems not suited to Deep Learning. For example, combining traditional computer vision techniques with Deep Learning has been popular in emerging domains such as Panoramic Vision and 3D vision for which Deep Learning models have not yet been fully optimised.

show abstract

Section: Dataset Annotation and Augmentationmentioning

confidence: 99%

Section: Advantages Of Traditional Computer Vision Techniquesmentioning

confidence: 99%

Deep Learning vs. Traditional Computer Vision

Mahony

Campbell

Carvalho

et al. 2019

Advances in Intelligent Systems and Computing

704

383

View full text Add to dashboard Cite

show abstract

“…As a solution Brône et al [2011] proposed a "training-by-looking-at"-step to be done prior to the experiments. Semi-automated approaches, such as [Kurzhals et al 2017;Pontillo et al 2010;Schöning et al 2016], on the other hand, do not require any training data.…”

Section: Computer Vision and Data Science Approachesmentioning

confidence: 99%

Neural Networks for Semantic Gaze Analysis in XR Settings

Stubbemann

Dürrschnabel

Refflinghaus

2021

ACM Symposium on Eye Tracking Research and Applications

View full text Add to dashboard Cite

Fig. 1. Experimental design research (right) is one example among many, where augmented and virtual reality technologies are used in combination with eye-tracking. This work addresses how semantic gaze analysis in real-world and virtual-reality settings can be done using convolutional neural networks (left). ©Lena Stubbemann Virtual-reality (VR) and augmented-reality (AR) technology is increasingly combined with eye-tracking. This combination broadens both fields and opens up new areas of application, in which visual perception and related cognitive processes can be studied in interactive but still well controlled settings. However, performing a semantic gaze analysis of eye-tracking data from interactive three-dimensional scenes is a resource-intense task, which so far has been an obstacle to economic use. In this paper we present a novel approach which minimizes time and information necessary to annotate volumes of interest (VOIs) by using techniques from object recognition. To do so, we train convolutional neural networks (CNNs) on synthetic data sets derived from virtual models using image augmentation techniques. We evaluate our method in real and virtual environments, showing that the method can compete with state-of-the-art approaches, while not relying on additional markers or preexisting databases but instead offering cross-platform use.

show abstract

Visual Video Analytics for Interactive Video Content Analysis

Schöning

Heidemann

2018

Advances in Intelligent Systems and Computing

View full text Add to dashboard Cite

Pixel-wise Ground Truth Annotation in Videos - An Semi-automatic Approach for Pixel-wise and Semantic Object Annotation

Cited by 11 publications

References 13 publications

Deep Learning vs. Traditional Computer Vision

Deep Learning vs. Traditional Computer Vision

Neural Networks for Semantic Gaze Analysis in XR Settings

Visual Video Analytics for Interactive Video Content Analysis

Contact Info

Product

Resources

About