2023
DOI: 10.48550/arxiv.2303.11228
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Bimodal SegNet: Instance Segmentation Fusing Events and RGB Frames for Robotic Grasping

Abstract: Object segmentation for robotic grasping under dynamic conditions often faces challenges such as occlusion, low light conditions, motion blur and object size variance. To address these challenges, we propose a Deep Learning network that fuses two types of visual signals, event-based data and RGB frame data. The proposed Bimodal SegNet network has two distinct encoders, one for each signal input and a spatial pyramidal pooling with atrous convolutions. Encoders capture rich contextual information by pooling the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 27 publications
0
2
0
Order By: Relevance
“…Eventbased methods have advantages as they can handle motion blur and high dynamic range [2] but may require labeled images such as the Event-based Semantic Segmentation (ESS) method [28] and have limitations in segmenting small objects [3]. Multiple modalities can be integrated to leverage complementarity, with CMX using transformer-based architecture [14] and Bimodal SegNet [13] fusing RGB with event frames. Although these methods demonstrate promising results, their limitations include overlooking the high temporal resolution of event-based data.…”
Section: Image Segmentation Methodsmentioning
confidence: 99%
“…Eventbased methods have advantages as they can handle motion blur and high dynamic range [2] but may require labeled images such as the Event-based Semantic Segmentation (ESS) method [28] and have limitations in segmenting small objects [3]. Multiple modalities can be integrated to leverage complementarity, with CMX using transformer-based architecture [14] and Bimodal SegNet [13] fusing RGB with event frames. Although these methods demonstrate promising results, their limitations include overlooking the high temporal resolution of event-based data.…”
Section: Image Segmentation Methodsmentioning
confidence: 99%
“…Semantic Segmentation [169], [170], [171] Advancements in semantic segmentation using CNNs, knowledge distillation, and unsupervised domain adaptation. Instance Segmentation [172], [173], [174] Techniques for instance segmentation fusing event-based data with RGB frames, using posterior attention modules, and modality translation and fusion. Efficient Semantic Segmentation [175] Introduction of a computationally efficient semantic segmentation approach using spiking encoder-decoder network.…”
Section: Category Focus Area Studies Key Contributionsmentioning
confidence: 99%