2022
DOI: 10.1109/tip.2022.3162962
|View full text |Cite
|
Sign up to set email alerts
|

Asynchronous Spatio-Temporal Memory Network for Continuous Event-Based Object Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
37
1
2

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 57 publications
(40 citation statements)
references
References 52 publications
0
37
1
2
Order By: Relevance
“…Gehrig et al [135] implement an event-based sampling module in CARLA [327], which renders high-frame-rate images and converts them into dynamic events using ESIM [138]. Li et al [136] utilize V2E [139] to convert videos into dynamic events for object detection, and they directly use existing large-scale annotated labels from KITTI [325] dataset. Lin et al [137] propose an omnidirectional discrete gradient algorithm to convert frames into event streams.…”
Section: B Categorizationmentioning
confidence: 99%
“…Gehrig et al [135] implement an event-based sampling module in CARLA [327], which renders high-frame-rate images and converts them into dynamic events using ESIM [138]. Li et al [136] utilize V2E [139] to convert videos into dynamic events for object detection, and they directly use existing large-scale annotated labels from KITTI [325] dataset. Lin et al [137] propose an omnidirectional discrete gradient algorithm to convert frames into event streams.…”
Section: B Categorizationmentioning
confidence: 99%
“…For instance, single-modal event-based object detection is currently in the experimental phase, where learned methods make up the majority of the state-of-the-art. Learned implementations typically combine and embed events in an image-like representation, which are used with modified frame-based DNN architectures to make them compatible with event data, in either a temporal 29 31 or a nontemporal 32 manner. Recurrent and temporal approaches are typically more suitable for event data, given that events only provide brightness changes and not absolute brightness, at a given point in time, in contrast to frames.…”
Section: Related Workmentioning
confidence: 99%
“…Thus the temporal approaches would incorporate meaningful input history, instead of treating each split, of the input stream, independently, yet at the expense of higher computation. Although the single-modal event-based approaches are promising, their performance typically lags both the frame-based solutions, under normal conditions, as well as the combined solutions, which incorporate both modalities 8 , 29 , 33 35 The main reason behind this is due to the nature of the event data, especially in scenes where there is limited motion.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The highly sparse and fluctuating nature of events poses challenges for conventional object detection techniques based on Artificial Neural Networks (ANNs). While recurrent architectural and algorithmic approaches [44,32] have been proposed for preprocessing events, they often come with high computation costs and increased latency. In contrast, the Spiking Neural Networks (SNNs) are a new type of network inspired by the brain that propagate information through discrete spikes generated from their inherent temporal dynamics.…”
Section: Introductionmentioning
confidence: 99%