2022
DOI: 10.3390/s22145190
|View full text |Cite
|
Sign up to set email alerts
|

Event Collapse in Contrast Maximization Frameworks

Abstract: Contrast maximization (CMax) is a framework that provides state-of-the-art results on several event-based computer vision tasks, such as ego-motion or optical flow estimation. However, it may suffer from a problem called event collapse, which is an undesired solution where events are warped into too few pixels. As prior works have largely ignored the issue or proposed workarounds, it is imperative to analyze this phenomenon in detail. Our work demonstrates event collapse in its simplest form and proposes colla… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
38
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
2

Relationship

4
4

Authors

Journals

citations
Cited by 27 publications
(38 citation statements)
references
References 51 publications
0
38
0
Order By: Relevance
“…This allowed us to focus on reformulating the image reconstruction part as a linear inverse problem and propose efficient, explainable solvers. The experiments so far used the flow estimated by methods designed for low degrees-of-freedom (DOF) motions [26], [60], [61], [62], which are often a good approximation and produce very accurate results if the true motion satisfies their assumptions [53], [63]. More complex scenes may require dense optical flow estimation, as given by [27], [28], [31] (i.e., higher DOFs).…”
Section: Combination With State-of-the-art Dense Optical Flow Methodsmentioning
confidence: 99%
“…This allowed us to focus on reformulating the image reconstruction part as a linear inverse problem and propose efficient, explainable solvers. The experiments so far used the flow estimated by methods designed for low degrees-of-freedom (DOF) motions [26], [60], [61], [62], which are often a good approximation and produce very accurate results if the true motion satisfies their assumptions [53], [63]. More complex scenes may require dense optical flow estimation, as given by [27], [28], [31] (i.e., higher DOFs).…”
Section: Combination With State-of-the-art Dense Optical Flow Methodsmentioning
confidence: 99%
“…Zhu et al [58] report that the contrast objective (variance) overfits to the events. This is in part because the warp (4) can describe very complex flow fields, which can push the events to accumulate in few pixels [42]. To mitigate overfitting, we reduce the complexity of the flow field by dividing the image plane into a tile of non-overlapping patches, defining a flow vector at the center of each patch and interpolating the flow on all other pixels (we show the tiles in Sec.…”
Section: Multi-reference Focus Objective Functionmentioning
confidence: 99%
“…The intuitive interpretation is to estimate the motion by recovering the sharp (motion-compensated) image of edge patterns that caused the events. Preliminary work on applying CM to estimate optical flow has reported a problem of overfitting to the data, producing undesired flows that warp events to few pixels or lines [58] (i.e., event collapse [42]). This issue has been tackled by changing the objective function, from contrast to the energy of an average timestamp image [21,34,58], but this loss is not straightforward to interpret and makes training difficult to converge [13].…”
Section: Introductionmentioning
confidence: 99%
“…The effectiveness of EVO is largely due to its mapping module, Event-based Multi-View Stereo (EMVS) [10], which enables 3D reconstruction without the need to recover image intensity, without having to explicitly solve for data association between events, and without the need of a GPU (it is fast on a standard CPU -e.g., speed of 1.20 Mev/s/core [10]). Additionally, EMVS admits an interpretation in terms of event refocusing or event alignment (contrast maximization) [16], which is the state of the art framework to tackle other vision problems [17]- [25]. Our goal is to extend EMVS to the multi-camera setting (i.e., two or more event cameras in a multi-view configuration sharing a common clock), and in particular to the stereo setting, in order to benefit from these advantages and connections (Fig.…”
Section: Monocularmentioning
confidence: 99%