2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00237
|View full text |Cite
|
Sign up to set email alerts
|

ScratchDet: Training Single-Shot Object Detectors From Scratch

Abstract: Current state-of-the-art object objectors are fine-tuned from the off-the-shelf networks pretrained on large-scale classification dataset ImageNet, which incurs some additional problems: 1) The classification and detection have different degrees of sensitivity to translation, resulting in the learning objective bias; 2) The architecture is limited by the classification network, leading to the inconvenience of modification. To cope with these problems, training detectors from scratch is a feasible solution. How… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
71
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
2
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 141 publications
(71 citation statements)
references
References 29 publications
0
71
0
Order By: Relevance
“…We believe that our idea of adding an extra branch for the center keypoint can be potentially generalized to other existing one-stage approaches (e.g., SSD [27]). Meanwhile, some advanced training strategies [46] can be used for better performance. We leave as our future work.…”
Section: Discussionmentioning
confidence: 99%
“…We believe that our idea of adding an extra branch for the center keypoint can be potentially generalized to other existing one-stage approaches (e.g., SSD [27]). Meanwhile, some advanced training strategies [46] can be used for better performance. We leave as our future work.…”
Section: Discussionmentioning
confidence: 99%
“…(iii) The strides of conv5 x are too coarse to localize objects. DetNet [36] and ScratchDet [91] also discuss this problem and change the strides for object detection. Unlike these works, our finding is that SGD (with other regularization methods) automatically limits the intrinsic dimensionalities of standard ResNet without changing the strides.…”
Section: Eigenspectrum Dynamicsmentioning
confidence: 99%
“…The generator tries to model data distribution by generating fake images using a noise vector input and use these fake images to confuse the discriminator, while the discriminator competes with the generator to identify the real images from fake images. GAN and its variants [155,156,157] There are some works [107,160,161] exploring training object detectors from scratch. Shen et al [107] first proposed a novel framework DSOD (Deeply Supervised Object Detectors) to train detectors from scratch.…”
Section: Othersmentioning
confidence: 99%