2016
DOI: 10.1007/978-3-319-46493-0_22
|View full text |Cite
|
Sign up to set email alerts
|

A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

Abstract: Abstract. A unified deep neural network, denoted the multi-scale CNN (MS-CNN), is proposed for fast multi-scale object detection. The MS-CNN consists of a proposal sub-network and a detection sub-network. In the proposal sub-network, detection is performed at multiple output layers, so that receptive fields match objects of different scales. These complementary scale-specific detectors are combined to produce a strong multi-scale object detector. The unified network is learned end-to-end, by optimizing a multi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
963
0
9

Year Published

2017
2017
2022
2022

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 1,314 publications
(974 citation statements)
references
References 41 publications
2
963
0
9
Order By: Relevance
“…Recently, it has also been introduced to remote sensing data analysis, such as remote sensing scene classification [43][44][45] and object detection [49][50][51]. However, its superiority has not been explored in SM retrieval.…”
Section: Discussionmentioning
confidence: 99%
“…Recently, it has also been introduced to remote sensing data analysis, such as remote sensing scene classification [43][44][45] and object detection [49][50][51]. However, its superiority has not been explored in SM retrieval.…”
Section: Discussionmentioning
confidence: 99%
“…Late fusion is used in decision-level, which combines several decision results by voting, finding maximum, or counting mean value [40][41][42][43]. Generally, it takes multiple loss functions to build multiple multi-task optimization problems, and the final output consists of every weighted decision scores.…”
Section: Fusion Strategymentioning
confidence: 99%
“…As distinguished from MS-CNN [25], Multi-PerNet is trained once for the classification layer and regression layer. Thus, the loss function is defined as:…”
Section: Figurementioning
confidence: 99%