The paper presents the results of research on the topic of using reinforcement learning algorithms in drone control systems, with the aim of improving the quality and increasing the speed of this type of systems, their distribution and implementation in the relevant areas in Ukraine. The following main stages are highlighted and described: review of training systems with reinforcement, determination of the main parameters according to which training will be conducted, comparison of results obtained on different networks. After analyzing the results, it was found that creating a drone position stabilization system using reinforcement learning is a relevant and appropriate task today, and the most effective tool for this is the use of reinforcement learning in combination with deep neural networks. Drone settings contain many parameters. Selecting these parameters and learning how to control the drone takes a lot of time. Drone pilots usually rely on their own experience and intuition when flying. This article examines the use of deep reinforcement learning to assist the pilot in typical or complex situations, as well as to extend the life of drones and avoid out-of-state situations. A general model represents an algorithm with input parameters equal to those required to represent the possible states and output parameters of the system sufficient to describe the possible actions. The algorithm automatically selects different models according to different parameters. It is determined that the algorithm can successfully start work with a low-efficiency model template and show good model performance and adjust the parameters of the number of layers, policy, entropy ratio, etc. This shows the potential for further application of these algorithms for designing drones. The result obtained during the execution of this work was a system that allows to simplify the process of choosing a deep learning algorithm with reinforcement in any created simulation environment for an agent of any complexity simulated in the Unreal Engine 4 game engine. The drone setup master must correctly formulate the task that the drone must perform, determine the main requirements for performance and the main possible bad options for performance. As a result of training, the drone will be able to stabilize itself from different positions, which will help to avoid emergency situations. This work can be widely applied in modern realities.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.