2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00647
|View full text |Cite
|
Sign up to set email alerts
|

MegDet: A Large Mini-Batch Object Detector

Abstract: The development of object detection in the era of deep learning, from R-CNN [11], Fast/Faster R-CNN [10, 31] to recent Mask R-CNN [14] and RetinaNet [24], mainly come from novel network, new framework, or loss design. However, mini-batch size, a key factor for the training of deep neural networks, has not been well studied for object detection. In this paper, we propose a Large Mini-Batch Object Detector (MegDet) to enable the training with a large minibatch size up to 256, so that we can effectively utilize a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
225
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
2
2
1

Relationship

2
8

Authors

Journals

citations
Cited by 300 publications
(229 citation statements)
references
References 40 publications
3
225
1
Order By: Relevance
“…This implies that our HRNet benefits more from longer training. Table 11 reports the comparison of our network to state-of-the-art single-model object detectors on COCO test-dev without using multi-scale training and multi- scale testing that are done in [65], [77], [88], [93], [102], [103]. In the Faster R-CNN framework, our networks perform better than ResNets with similar parameter and computation complexity: HRNetV2p-W32 vs. ResNet-101-FPN, HRNetV2p-W40 vs. ResNet-152-FPN, HRNetV2p-W48 vs. X-101-64 × 4d-FPN.…”
Section: Coco Object Detectionmentioning
confidence: 99%
“…This implies that our HRNet benefits more from longer training. Table 11 reports the comparison of our network to state-of-the-art single-model object detectors on COCO test-dev without using multi-scale training and multi- scale testing that are done in [65], [77], [88], [93], [102], [103]. In the Faster R-CNN framework, our networks perform better than ResNets with similar parameter and computation complexity: HRNetV2p-W32 vs. ResNet-101-FPN, HRNetV2p-W40 vs. ResNet-152-FPN, HRNetV2p-W48 vs. X-101-64 × 4d-FPN.…”
Section: Coco Object Detectionmentioning
confidence: 99%
“…(ii) The fact that some regions are over-sampled and some are undersampled might have adverse effects on learning, as the size of sample (i.e. batch size) is known to be related to the optimal learning rate [166].…”
Section: Imbalance In Overlapping Bbsmentioning
confidence: 99%
“…Notably, due to limitations on our computational resources, we could not increase the batch size inputted into DeepLabV3+ beyond a size of 8, whereas U-Net could use a batch size up to 32 (Table 3). In theory, a larger mini-batch size should help the network converge to a better minimum and therefore better final accuracy [38]. However, we did not see the benefit of larger batch-size in our model training for U-Net, where a model trained with a batch size of 4 achieved the best performance.…”
Section: Comparing the Performance Of A U-net Architecture Based Deepmentioning
confidence: 71%