OMNIA Faster R-CNN: Detection in the wild through dataset merging and soft distillation

Ramé, Alexandre; Garreau, Emilien; Ben-younes, Hedi; Ollion, Charles

doi:10.48550/arxiv.1812.02611

Cited by 8 publications

(16 citation statements)

References 59 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Incremental Learning (IL): Gradually adding new categories while trying to limit the catastrophic forgetting [22]. Dataset Merging: [23] Dataset merging is closest work to our study. It proposes to combine datasets by filling the missing annotations of non-overlapping categories.…”

Section: Related Workmentioning

confidence: 99%

Model Composition: Can Multiple Neural Networks Be Combined into a Single Network Using Only Unlabeled Data?

Banitalebi-Dehkordi¹,

Kang²,

Zhang³

2021

Preprint

View full text Add to dashboard Cite

The diversity of deep learning applications, datasets, and neural network architectures necessitates a careful selection of the architecture and data that match best to a target application. As an attempt to mitigate this dilemma, this paper investigates the idea of combining multiple trained neural networks using unlabeled data. In addition, combining multiple models into one can speed up the inference, result in stronger, more capable models, and allows us to select efficient device-friendly target network architectures. To this end, the proposed method makes use of generation, filtering, and aggregation of reliable pseudo-labels collected from unlabeled data. Our method supports using an arbitrary number of input models with arbitrary architectures and categories. Extensive performance evaluations demonstrated that our method is very effective. For example, for the task of object detection and without using any ground-truth labels, an EfficientDet-D0 trained on Pascal-VOC and an EfficientDet-D1 trained on COCO, can be combined to a RetinaNet-ResNet50 model, with a similar mAP as the supervised training. If fine-tuned in a semi-supervised setting, the combined model achieves +18.6%, +12.6%, and +8.1% mAP improvements over supervised training with 1%, 5%, and 10% of labels. Code is released as supplementary [7].

show abstract

Section: Related Workmentioning

confidence: 99%

Model Composition: Can Multiple Neural Networks Be Combined into a Single Network Using Only Unlabeled Data?

Banitalebi-Dehkordi¹,

Kang²,

Zhang³

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…This has been done extensively in semi-supervised settings with the use of pseudo-labels. 20 Similarly, the authors of OMNIA 21 enable merging of datasets with different target classes using model predictions as a weakly supervised training signal. Our method of partial backpropagation takes inspiration from the literature.…”

Section: Related Workmentioning

confidence: 99%

“…23 Close to our work, the idea has been applied to histopathology to solve spatially partial segmentation annotations, 24 as well as in OMNIA. 21 Finally, we make use of domain adaptation techniques to alleviate any data distribution shift impact on performances. We refer the reader to the review on domain adaptation for segmentation by Toldo, Marco, et al 25 Domain adaptation techniques often use a regularization term preventing the network to learn different representation per input space.…”

Section: Related Workmentioning

confidence: 99%

“…Another way to use the original baseline model to reduce false positives on the new data is to use its highconfidence predicted areas as ground truth. The intuition behind the method takes inspiration from OMNIA, 21 and works as follows. If the baseline model predicts an area as healthy with a sufficiently high confidence (determined with a fixed threshold), the area can then safely be considered as truly healthy, and used for backpropagation during the training of the new model.…”

Section: Omnia-likementioning

confidence: 99%

“…Thus, we experiment four techniques to improve the method and effectively reduce false positives: transfer learning (using the previous model state, i.e. re-training the model from the baseline model), OMNIA 21 -like (using the previous model confident background predictions as ground truth), taking advantage of additional prior knowledge on the healthyness of the mentored images (healthy-flagged mentored images are treated as fully annotated), and domain adaptation through domain-adversarial training (DANN). 2 We show that all four techniques can improve mAP on the mentored evaluation set by around 0.20 points compared to the baseline (Table 1), which is significantly better than our first mentored model.…”

Section: Improving Generalizationmentioning

confidence: 99%

See 2 more Smart Citations

Active learning using weakly supervised signals for quality inspection

Cordier,

Das,

Gutierrez

2021

Preprint

View full text Add to dashboard Cite

Because manufacturing processes evolve fast and production visual aspect can vary significantly on a daily basis, the ability to rapidly update machine vision based inspection systems is paramount. Unfortunately, supervised learning of convolutional neural networks requires a significant amount of annotated images in order to learn effectively from new data. Acknowledging the abundance of continuously generated images coming from the production line and the cost of their annotation, we demonstrate it is possible to prioritize and accelerate the annotation process. In this work, we develop a methodology for learning actively, 1 from rapidly mined, weakly (i.e. partially) annotated data, enabling a fast, direct feedback from the operators on the production line and tackling a big machine vision weakness: false positives. These may arise with covariate shift, which happens inevitably due to changing conditions of the data acquisition setup. In that regard, we show domain-adversarial training 2 to be an efficient way to address this issue.

show abstract

What Can be Seen is What You Get: Structure Aware Point Cloud Augmentation

Hasecke

Alsfasser

Kummert

2022

2022 IEEE Intelligent Vehicles Symposium (IV)

View full text Add to dashboard Cite

To train a well performing neural network for semantic segmentation, it is crucial to have a large dataset with available ground truth for the network to generalize on unseen data. In this paper we present novel point cloud augmentation methods to artificially diversify a dataset. Our sensor-centric methods keep the data structure consistent with the lidar sensor capabilities. Due to these new methods, we are able to enrich low-value data with high-value instances, as well as create entirely new scenes. We validate our methods on multiple neural networks with the public SemanticKITTI [3] dataset and demonstrate that all networks improve compared to their respective baseline. In addition, we show that our methods enable the use of very small datasets, saving annotation time, training time and the associated costs.

show abstract

OMNIA Faster R-CNN: Detection in the wild through dataset merging and soft distillation

Cited by 8 publications

References 59 publications

Model Composition: Can Multiple Neural Networks Be Combined into a Single Network Using Only Unlabeled Data?

Model Composition: Can Multiple Neural Networks Be Combined into a Single Network Using Only Unlabeled Data?

Active learning using weakly supervised signals for quality inspection

What Can be Seen is What You Get: Structure Aware Point Cloud Augmentation

Contact Info

Product

Resources

About