OCRTOC: A Cloud-Based Competition and Benchmark for Robotic Grasping and Manipulation

Liu, Ziyuan; Liu, Wei; Qin, Yuzhe; Xiang, Fanbo; Gou, Minghao; Xin, Songyan; Roa, Máximo A.; Çallı, Berk; Su, Hao; Sun, Yu; Tan, Ping

doi:10.48550/arxiv.2104.11446

Cited by 5 publications

(7 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our work is concerned with rearranging objects, an area that has a long history in robotics [31,32,34,51,54] but has recently gained traction in the vision and learning communities [2,19,39,67] thanks to the advances in simulation platforms. The works most relevant to ours are that of Labbé et al [34] and NeRP [51], which also address the rearrangement task with the goal state specified by an image.…”

Section: Related Workmentioning

confidence: 99%

IFOR: Iterative Flow Minimization for Robotic Object Rearrangement

Goyal¹,

Mousavian²,

Paxton³

et al. 2022

Preprint

View full text Add to dashboard Cite

Figure 1. An example of IFOR being applied to real data. The initial and goal scenes are shown on the left.Our approach allows the robot to repeatedly identify transformations that will minimize the flow for various objects between the current and goal scenes. It can then repeatedly grasp, move, and place objects, rotating as necessary, in order to achieve the configuration in the goal scene. The system is trained completely on synthetic data and transfers to the real world in zero-shot manner.

show abstract

Section: Related Workmentioning

confidence: 99%

IFOR: Iterative Flow Minimization for Robotic Object Rearrangement

Goyal¹,

Mousavian²,

Paxton³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…To illustrate, we provide the matching evaluations on objects with simulated images outside the training set in Table 7. Twenty unseen objects are selected from the OCRTOC dataset (Liu et al 2021), in which half of them have the same class as YCB-Video objects but with different shapes or textures (seen class), and other objects with novel class have not been seen in training (unseen class). The evaluation protocol is similar to real-real matching in 4.1.…”

Section: Model Efficiency and Generalizationmentioning

confidence: 99%

“…Twenty unseen objects are selected from the OCRTOC dataset (Liu et al 2021), in which half of them have the same class as YCB-Video objects but with different shapes or textures (seen class), which can be seen in Figure 14. The left with novel classes has not been seen in training (unseen class), which are visualized in Figure 15.…”

Section: A5 Model Generalizationmentioning

confidence: 99%

Sim2Real Object-Centric Keypoint Detection and Description

Zhong¹,

Liu²,

Qi³

et al. 2022

Preprint

View full text Add to dashboard Cite

Keypoint detection and description play a central role in computer vision. Most existing methods are in the form of scene-level prediction, without returning the object classes of different keypoints. In this paper, we propose the objectcentric formulation, which, beyond the conventional setting, requires further identifying which object each interest point belongs to. With such fine-grained information, our framework enables more downstream potentials, such as objectlevel matching and pose estimation in a clustered environment. To get around the difficulty of label collection in the real world, we develop a sim2real contrastive learning mechanism that can generalize the model trained in simulation to real-world applications. The novelties of our training method are three-fold: (i) we integrate the uncertainty into the learning framework to improve feature description of hard cases, e.g., less-textured or symmetric patches; (ii) we decouple the object descriptor into two output branches-intra-object salience and inter-object distinctness, resulting in a better pixel-wise description; (iii) we enforce cross-view semantic consistency for enhanced robustness in representation learning. Comprehensive experiments on image matching and 6D pose estimation verify the encouraging generalization ability of our method from simulation to reality. Particularly for 6D pose estimation, our method significantly outperforms typical unsupervised/sim2real methods, achieving a closer gap with the fully supervised counterpart. Additional results and videos can be found at https://zhongcl-thu.github.io/rock/.

show abstract

“…Vehicle Navigation CommonRoad [15] 2017 × × × × × Robot@Home [16] 2017 × × × × × Multi-Agent Path-Find Benchmark [17] 2019 × × × × × MAVBench [18] 2020 × × × × × BARN [19] 2020 × × × Bench-MR [20] 2021 × × × × PathBench [21] 2021 × × × General Robotics OMPLBenchmarks [22] 2015 × × × × × Robobench [23] 2016 × × Roboturk (Teleoperation database) [24] 2019 × × × RLBench [25] 2020 × OCRTOC [26] 2021 × Robot Manipulation ACRV picking benchmark [2] 2017 × × RoboNet [27] 2019 × × × GraspNet [28] 2020 × × × × × × Brown Planning Benchmarks [29] 2020 × × Aerial Manipulation [30] 2020 × × × Bimanual Manipulation Benchmark [31] 2020 × × In-hand manipulation benchmark [32] 2020 × × × × ProbRobScene [33] 2021…”

Section: Sensed Representation Articulated Robotsmentioning

confidence: 99%

“…The second category of datasets is focused on general robotics. These works aim at covering broad robotic categories like providing datasets and tools for remote teleoperation [24] or object rearrangement [26]. While many papers are concentrating on learning-based approaches [25], there is also a trend towards more reproducibility, for example by using containerization [21] to ease comparison over different operating systems or configurations.…”

Section: Sensed Representation Articulated Robotsmentioning

confidence: 99%

MotionBenchMaker: A Tool to Generate and Benchmark Motion Planning Datasets

Chamzas,

Quintero-Peña,

Kingston

et al. 2021

Preprint

View full text Add to dashboard Cite

Recently, there has been a wealth of development in motion planning for robotic manipulation-new motion planners are continuously proposed, each with their own unique strengths and weaknesses. However, evaluating new planners is challenging and researchers often create their own ad-hoc problems for benchmarking, which is time-consuming, prone to bias, and does not directly compare against other state-of-the-art planners. We present MOTIONBENCHMAKER, an open-source tool to generate benchmarking datasets for realistic robot manipulation problems. MOTIONBENCHMAKER is designed to be an extensible, easyto-use tool that allows users to both generate datasets and benchmark them by comparing motion planning algorithms. Empirically, we show the benefit of using MOTIONBENCHMAKER as a tool to procedurally generate datasets which helps in the fair evaluation of planners. We also present a suite of 40 prefabricated datasets, with 5 different commonly used robots in 8 environments, to serve as a common ground to accelerate motion planning research.

show abstract

OCRTOC: A Cloud-Based Competition and Benchmark for Robotic Grasping and Manipulation

Cited by 5 publications

References 32 publications

IFOR: Iterative Flow Minimization for Robotic Object Rearrangement

IFOR: Iterative Flow Minimization for Robotic Object Rearrangement

Sim2Real Object-Centric Keypoint Detection and Description

MotionBenchMaker: A Tool to Generate and Benchmark Motion Planning Datasets

Contact Info

Product

Resources

About