2021 IEEE International Conference on Image Processing (ICIP) 2021
DOI: 10.1109/icip42928.2021.9506112
|View full text |Cite
|
Sign up to set email alerts
|

Improving Filling Level Classification with Adversarial Training

Abstract: We investigate the problem of classifying -from a single imagethe level of content in a cup or a drinking glass. This problem is made challenging by several ambiguities caused by transparencies, shape variations and partial occlusions, and by the availability of only small training datasets. In this paper, we tackle this problem with an appropriate strategy for transfer learning. Specifically, we use adversarial training in a generic source dataset and then refine the training with a task-specific dataset. We … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
28
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 16 publications
(28 citation statements)
references
References 21 publications
0
28
0
Order By: Relevance
“…Object mass estimation requires the reasoning on different physical properties, especially when the object is a container. Existing perception algorithms use uni-modal or multi-modal data, such as audio, images, and videos, to classify the content type and level as well as the container capacity [4]- [7]. Convolutional neural networks can be trained to classify the content level within a range of containers from a single image when hand occlusions are present [7].…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Object mass estimation requires the reasoning on different physical properties, especially when the object is a container. Existing perception algorithms use uni-modal or multi-modal data, such as audio, images, and videos, to classify the content type and level as well as the container capacity [4]- [7]. Convolutional neural networks can be trained to classify the content level within a range of containers from a single image when hand occlusions are present [7].…”
Section: Related Workmentioning
confidence: 99%
“…Existing perception algorithms use uni-modal or multi-modal data, such as audio, images, and videos, to classify the content type and level as well as the container capacity [4]- [7]. Convolutional neural networks can be trained to classify the content level within a range of containers from a single image when hand occlusions are present [7]. While the performance is limited by the uni-modal input, the choice of training strategy, e.g.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations