2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2020
DOI: 10.1109/cvprw50498.2020.00176
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Temporal Consistency Metric for Video Segmentation in Highly-Automated Driving

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
15
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3
2

Relationship

2
6

Authors

Journals

citations
Cited by 33 publications
(15 citation statements)
references
References 30 publications
0
15
0
Order By: Relevance
“…Among online-capable metrics for the overall performance prediction, some only focus at malfunction detection and correction (again involving an ensemble of DNNs) [17], [64], or exploit temporal inconsistency between consecutive predictions [15], which has to be defined in a highly task-specific way. The closest prior work to ours is presumably from Löhdefink et al [18], who propose to train an autoencoder to reconstruct an image on the same data a semantic segmentation DNN is trained on, showing a correlation between both task's metrics.…”
Section: Performance Prediction Of Neural Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…Among online-capable metrics for the overall performance prediction, some only focus at malfunction detection and correction (again involving an ensemble of DNNs) [17], [64], or exploit temporal inconsistency between consecutive predictions [15], which has to be defined in a highly task-specific way. The closest prior work to ours is presumably from Löhdefink et al [18], who propose to train an autoencoder to reconstruct an image on the same data a semantic segmentation DNN is trained on, showing a correlation between both task's metrics.…”
Section: Performance Prediction Of Neural Networkmentioning
confidence: 99%
“…one typically assumes that an offline-measured performance of a DNN is also valid in inference, this is actually not true due to the mentioned environment changes. Meanwhile, less-frequently proposed online-capable algorithms are either task-specific [15], rely on ensembles of DNNs [16], [17], or only show the correlation of a proposed metric to the absolute performance metric without further outlining an online-capable predictive scheme [15], [18]. Naively using the confidence scores of the network itself [19] is not recommended as DNNs often assign a probability of close to one to a single class [20], and even more important, the uncertainty of measurements does not bear predictive power to estimate the absolute DNN performance.…”
mentioning
confidence: 99%
“…However, misalignment of key frames with nearby frames might harm accuracy relative to the original image segmentation models. Another use of optical flow is in [26], where the authors introduce a flow-based consistency measure to evaluate, rather than directly improve, the quality of video semantic segmentation.…”
Section: Related Workmentioning
confidence: 99%
“…This, however, does not factor in motion. As such, researchers incorporate motion estimation (e.g., optical flow) when measuring temporal consistency [16,24,19,33]. However, estimating accurate flow on real-world data can be very challenging, and in many cases, more error-prone and time-consuming than the segmentation task itself.…”
Section: Related Workmentioning
confidence: 99%
“…This approach, however, does not factor in the object movements and changing occlusions. Most of the recent works [16,24,33] utilize motion-based pixel correspondence between two consecutive frames (i.e., optical flow [13]), to measure temporal consistency. More specifically, given two consecutive video frames, the segmentation of one frame is warped to the other based on the estimated flow, and the warped and actual segmentation maps are then compared to measure the segmentation consistency between these two frames.…”
Section: Introductionmentioning
confidence: 99%