The development of computer vision and the wide applicability of its applied components determine the relevance of research in this field of science. One of the most interesting tasks of computer vision is to monitor the behavior of people, which includes the analysis of their actions and carried out for various purposes. Examples of use are systems for monitoring compliance with safety regulations and the wearing of personal protective equipment by workers in factories, systems such as “smart home”, which track actions, systems for monitoring the condition of people in medical or social institutions, home systems for monitoring the condition of the elderly, which are able to notify relatives in cases of emergency situations. There is no comprehensive program that can solve the described problem and its variations, regardless of the scope of application. Therefore, the development of its prototype, which is a module that solves the human action recognition problem in the video, is an important problem. The article describes the creation of a software module that solves the human action recognition problem in a video. An overview of existing data sets suitable for training a neural network is provided, and data collection and processing for a custom dataset for actions of four different classes is described. The key features of the stages of creating, training and testing a neural network with the LSTM (Long Short-Term Memory) architecture, as well as options for its practical application, are described below. The developed module is quite flexible, there is a possibility to increase the number of classes of recognized actions depending on the scope of its application, as well as the possibility of integration with other modules for monitoring the behavior of people who have a similar device.