2017
DOI: 10.48550/arxiv.1705.09587
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Enhancement of SSD by concatenating feature maps for object detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
83
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 69 publications
(83 citation statements)
references
References 0 publications
0
83
0
Order By: Relevance
“…The main feature fusion methods in the field of deep learning are divided into two categories: multi-network feature fusion [3,25,14] and single network feature fusion [11,3]. Multi-network feature fusion is always accompanied by multi-scale features, like Inception Model [25] or a set of multiple independent networks [3].…”
Section: Feature Fusion Methodsmentioning
confidence: 99%
“…The main feature fusion methods in the field of deep learning are divided into two categories: multi-network feature fusion [3,25,14] and single network feature fusion [11,3]. Multi-network feature fusion is always accompanied by multi-scale features, like Inception Model [25] or a set of multiple independent networks [3].…”
Section: Feature Fusion Methodsmentioning
confidence: 99%
“…Small object detection Recently, several ideas has been proposed for detecting small object [10,2,7,8]. Liu et al [10] augmented small object data by reducing the size of large objects for overcoming the not-enough-data problem.…”
Section: Related Workmentioning
confidence: 99%
“…However, it has the limitation of increased model complexity and slow down an speed due to applying deconvolution module to all feature maps. R-SSD [7] combines features of different scales through pooling and deconvolution and obtained improved accuracy and speed compared to DSSD. Li et al [8] uses Generative Adversarial Network(GAN) [5] to generate high-resolution features using low-resolution features as input to GAN.…”
Section: Related Workmentioning
confidence: 99%
“…Although deconvolution approaches have been very successful in the task of extracting the semantic information of the target being detected [8,11,13], adding too much deconvolution layers will inevitably lead to a significant increase in computational time-consuming [8,11], while providing too little semantic information will certainly lose some detection performance [13]. Based on the above consideration, three extension modules are applied in our proposed framework as shown in Figure 1, which have extended right amount of semantic information and can get good enough results without losing too much detection speed.…”
Section: Extension Modulementioning
confidence: 99%