2020
DOI: 10.1109/access.2020.2995886
|View full text |Cite
|
Sign up to set email alerts
|

Accurate and Efficient LIF-Nets for 3D Detection and Recognition

Abstract: 3D object detection and recognition are crucial tasks for many spatiotemporal processing applications, such as computer-aided diagnosis and autonomous driving. Although prevalent 3D Convolution Nets (ConvNets) have continued to improve the accuracy and sensitivity, excessive computing resources are required. In this paper, we propose Leaky Integrate and Fire Networks (LIF-Nets) for 3D detection and recognition tasks. LIF-Nets have rich inter-frame sensing capability brought from membrane potentials, and low po… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 45 publications
1
2
0
Order By: Relevance
“…Therefore, compared with Conv3D and ConvLSTM, ConvLIAF can save a significant amount of computational resources and storage overhead. The impact on accuracy is reported in literature Shi et al ( 2020 ) and Wu et al ( 2021 ), and is also discussed in our experiment section.…”
Section: Generalized Spatiotemporal Processing Via Neural Dynamicssupporting
confidence: 69%
“…Therefore, compared with Conv3D and ConvLSTM, ConvLIAF can save a significant amount of computational resources and storage overhead. The impact on accuracy is reported in literature Shi et al ( 2020 ) and Wu et al ( 2021 ), and is also discussed in our experiment section.…”
Section: Generalized Spatiotemporal Processing Via Neural Dynamicssupporting
confidence: 69%
“…In this paper, we feed the static image a (l=0) ∈ R d0 to our input layer and convert it to the binary map s (1) = H(u (1) − u th ), where u (1) = W (1) a (0) . In the context of SNNs, this direct image input is called direct encoding (Shi et al, 2020;Rueckauer et al, 2017;Zheng et al, 2021;Rathi & Roy, 2020;Lu & Sengupta, 2020). The difference to the SNN's direct encoding is that our encoding outputs the binary map, not the spike train.…”
Section: Encodingmentioning
confidence: 99%
“…Li et al [12] proposed 3D compression with excitation encoder-decoder structure and adopted focal loss as the loss function to solve the imbalance of positive and negative samples. Shi et al [13] proposed a brain-inspired LIF-NET using the spatiotemporal properties of the novel LIF neural network and achieved comparable detection accuracy while reducing computation complexity by over 60%. While the one-stage lung nodule detection methods described above are fast and consume fewer resources, they usually have a higher false-positive rate.…”
Section: Introductionmentioning
confidence: 99%