Abstract:This paper presents a cooperative object transportation technique using deep reinforcement learning (DRL) based on curricula. Previous studies on object transportation highly depended on complex and intractable controls, such as grasping, pushing, and caging. Recently, DRL-based object transportation techniques have been proposed, which showed improved performance without precise controller design. However, DRL-based techniques not only take a long time to learn their policies but also sometimes fail to learn.… Show more
“…Manko et al [ 77 ] used CNN-based DRL architecture for multi-robot collaborative transportation where the objective is to carry an object from the start to the goal location. Eoh and Park [ 79 ] proposed a curriculum-based deep reinforcement learning method for training robots to cooperatively transport an object. In a curriculum-based RL, past experiences are organized and sorted to improve training efficiency [ 252 ].…”
Section: Multi-robot System Applications Of Multi-agent Deep Reinforc...mentioning
Deep reinforcement learning has produced many success stories in recent years. Some example fields in which these successes have taken place include mathematics, games, health care, and robotics. In this paper, we are especially interested in multi-agent deep reinforcement learning, where multiple agents present in the environment not only learn from their own experiences but also from each other and its applications in multi-robot systems. In many real-world scenarios, one robot might not be enough to complete the given task on its own, and, therefore, we might need to deploy multiple robots who work together towards a common global objective of finishing the task. Although multi-agent deep reinforcement learning and its applications in multi-robot systems are of tremendous significance from theoretical and applied standpoints, the latest survey in this domain dates to 2004 albeit for traditional learning applications as deep reinforcement learning was not invented. We classify the reviewed papers in our survey primarily based on their multi-robot applications. Our survey also discusses a few challenges that the current research in this domain faces and provides a potential list of future applications involving multi-robot systems that can benefit from advances in multi-agent deep reinforcement learning.
“…Manko et al [ 77 ] used CNN-based DRL architecture for multi-robot collaborative transportation where the objective is to carry an object from the start to the goal location. Eoh and Park [ 79 ] proposed a curriculum-based deep reinforcement learning method for training robots to cooperatively transport an object. In a curriculum-based RL, past experiences are organized and sorted to improve training efficiency [ 252 ].…”
Section: Multi-robot System Applications Of Multi-agent Deep Reinforc...mentioning
Deep reinforcement learning has produced many success stories in recent years. Some example fields in which these successes have taken place include mathematics, games, health care, and robotics. In this paper, we are especially interested in multi-agent deep reinforcement learning, where multiple agents present in the environment not only learn from their own experiences but also from each other and its applications in multi-robot systems. In many real-world scenarios, one robot might not be enough to complete the given task on its own, and, therefore, we might need to deploy multiple robots who work together towards a common global objective of finishing the task. Although multi-agent deep reinforcement learning and its applications in multi-robot systems are of tremendous significance from theoretical and applied standpoints, the latest survey in this domain dates to 2004 albeit for traditional learning applications as deep reinforcement learning was not invented. We classify the reviewed papers in our survey primarily based on their multi-robot applications. Our survey also discusses a few challenges that the current research in this domain faces and provides a potential list of future applications involving multi-robot systems that can benefit from advances in multi-agent deep reinforcement learning.
“…For instance, Luo et al [78] exploit a curriculum that gradually adjusts the precision requirements for multigoal reach experiments and show that it improves performance in a faster way. Eoh et al [29], on the other hand, employ a curriculum learning approach for challenging multirobot object transportation tasks that gradually increases both the transportation distance and number of robots involved. Leyendecker et al [70] propose a combination of reward curriculum and domain randomization to develop a robust sim-to-real transferable policy to execute a manipulation task in an industrial setup.…”
ecent successes aside, reinforcement learning (RL) still faces significant challenges in its application to the real-world robotics domain. Guiding the learning process with additional knowledge offers a potential solution, thus leveraging the strengths of data-and knowledge-driven approaches. However, this field of research encompasses several disciplines and hence would benefit from a structured overview.In this article, we propose a concept of guided RL that provides a systematic approach toward accelerating the training process and improving performance for real-world robotics settings. We introduce a taxonomy that structures guided RL approaches and shows how different sources of knowledge can be integrated into the learning pipeline in a practical way. Based on this, we describe available approaches in this field and quantitatively evaluate their specific impact in terms of efficiency, effectiveness, and sim-to-real transfer within the robotics domain. However, learning control policies in such a way naturally requires many interactions with the environment. This emphasizes the importance of both collecting highquality samples and exploring the search space in a sample-efficient manner. While directly learning on real robots is appealing, it comes along with substantial challenges, such as high sample cost, partial observability, and safety constraints [28]. Hence, simulators are often
“…(2) At each grasping point, the roll and pitch angle of the end effector is the same for the cooperative aerial manipulators. (3) Configurations of the payload are previously given, so aerial manipulators know the grasping point of the payload. However, the exact relative distances and heading angles between each robot are unknown.…”
Section: Kinematic Parameter Estimationmentioning
confidence: 99%
“…Among them, cooperative UAVs are widely exploited to handle a heavy or large payload [ 2 ] beyond the limits of a robot’s transportation capabilities. Recently, researchers have developed cooperative mobile manipulators [ 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 ] by exploiting grasping capability. However, due to several issues including the complexity associated with multiple aerial robots, they have focused on solving a control and coordination problem.…”
This paper presents an integrated framework that integrates the kinematic and dynamic parameter estimation of an irregular object with non-uniform mass distribution for cooperative aerial manipulators. Unlike existing approaches, including impedance-based control which requires expensive force/torque sensors or the first-order-momentum-based estimator which is weak to noise, this paper suggests a method without such sensor and strong to noise by exploiting the decentralized dynamics and sliding-mode-momentum observer. First, the kinematic estimator estimates the relative distances of multiple aerial manipulators by using translational and angular velocities between aerial robots. By exploiting the distance estimation, the desired trajectories for each aerial manipulator are set. Second, the dynamic parameter estimation is performed for the mass of the common object and the vector between the end-effector frame and the center of mass of the object. Finally, the proposed framework is validated with simulations using aerial manipulators combined with two degrees-of-freedom robotic arms using a noisy measurement. Throughout the simulation, we can decrease the mass estimation error by 60% compared to the existing first-order momentum-based method. In addition, a comparison study shows that the proposed method satisfactorily estimates an arbitrary center-of-mass of an unknown payload in noisy environments.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.