2019
DOI: 10.48550/arxiv.1902.03057
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

OrthographicNet: A Deep Transfer Learning Approach for 3D Object Recognition in Open-Ended Domains

Abstract: Service robots are expected to be more autonomous and efficiently work in human-centric environments. For this type of robots, open-ended object recognition is a challenging task due to the high demand for two essential capabilities: (i) the accurate and real-time response, and (ii) the ability to learn new object categories from very few examples on-site. These capabilities are required for such robots since no matter how extensive the training data used for batch learning, the robot might be faced with an un… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 28 publications
0
2
0
Order By: Relevance
“…The DGFE module helps 3D convolutions hierarchically acquire global information, allowing the network to capture the contextual neighborhood of points. Despite using viewpoints in a predefined sequence, as opposed to any random views by DeepPano (Shi et al, 2015 ), Gan classifier (Varga et al, 2020 ), GPSP-DWRN (Long et al, 2021 ), OrthographicNet (Kasaei, 2019 ), PANORAMA-NN (Sfikas et al, 2017 ), and SeqViews2SeqLabels (Han et al, 2019 ) both of which are multi-view techniques, the method outperforms these approaches, making it suitable for high resolution input. The proposed method also outperforms PolyNet (Yavartanoo et al, 2021 ), a mesh-based 3D representation network that combined the features in a much smaller dimension using PolyShape's multi-resolution structure.…”
Section: Methodsmentioning
confidence: 99%
“…The DGFE module helps 3D convolutions hierarchically acquire global information, allowing the network to capture the contextual neighborhood of points. Despite using viewpoints in a predefined sequence, as opposed to any random views by DeepPano (Shi et al, 2015 ), Gan classifier (Varga et al, 2020 ), GPSP-DWRN (Long et al, 2021 ), OrthographicNet (Kasaei, 2019 ), PANORAMA-NN (Sfikas et al, 2017 ), and SeqViews2SeqLabels (Han et al, 2019 ) both of which are multi-view techniques, the method outperforms these approaches, making it suitable for high resolution input. The proposed method also outperforms PolyNet (Yavartanoo et al, 2021 ), a mesh-based 3D representation network that combined the features in a much smaller dimension using PolyShape's multi-resolution structure.…”
Section: Methodsmentioning
confidence: 99%
“…In the continuation of this work, we will investigate the possibility of using deep transfer learning methods for 3D object recognition in open-ended domains. Some results obtained with a deep transfer learning approach have already been published [38].…”
Section: System Demonstrationmentioning
confidence: 99%