2021
DOI: 10.48550/arxiv.2112.08001
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Autoencoder-based background reconstruction and foreground segmentation with background noise estimation

Abstract: Even after decades of research, dynamic scene background reconstruction and foreground object segmentation are still considered as open problems due various challenges such as illumination changes, camera movements, or background noise caused by air turbulence or moving trees. We propose in this paper to model the background of a video sequence as a low dimensional manifold using an autoencoder and to compare the reconstructed background provided by this autoencoder with the original image to compute the foreg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 36 publications
(45 reference statements)
0
2
0
Order By: Relevance
“…It appears however that the interaction between these two models during training is a very challenging issue, because of the competition between them to reconstruct the image. We handle this problem by training the background model before the foreground model: We observe that the AE-NE model [40], which is dedicated to dynamic background reconstruction and segmentation, is trained without any foreground reconstruction module, and is able to perform an accurate background reconstruction and segmentation not only on videos, but also on frame sequences which are not organized as videos. We then use this model as a pre-trained separate module: This background model is first trained independently from the other parts of the model, and the weights of this background model are then frozen during the training of the foreground model which is described below.…”
Section: A Separate Pre-trained Model For Background Reconstructionmentioning
confidence: 99%
See 1 more Smart Citation
“…It appears however that the interaction between these two models during training is a very challenging issue, because of the competition between them to reconstruct the image. We handle this problem by training the background model before the foreground model: We observe that the AE-NE model [40], which is dedicated to dynamic background reconstruction and segmentation, is trained without any foreground reconstruction module, and is able to perform an accurate background reconstruction and segmentation not only on videos, but also on frame sequences which are not organized as videos. We then use this model as a pre-trained separate module: This background model is first trained independently from the other parts of the model, and the weights of this background model are then frozen during the training of the foreground model which is described below.…”
Section: A Separate Pre-trained Model For Background Reconstructionmentioning
confidence: 99%
“…3 on ObjectsRoom, 6 on ShapeStacks and 10 on CLEVRTEX). On CLEVR, which shows a fixed background, we reduce the number of background training iterations from 500 000 to 2500, as recommended in the AE-NE paper [40], and decrease the fixed background accuracy threshold τ since the background reconstruction is far more accurate when the background is fixed. We use isotropic scaling since all objects have similar widths and heights in these datasets.…”
Section: Quantitative Evaluation On Synthetic Benchmarksmentioning
confidence: 99%