In this article, we wrote not a large program to solve tasks for detection and tracking objects in real-time. The program was written in Python programming language. For object detection, a convolutional neural network was used with YOLOV3 architecture. A preliminary analysis was carried out of several variations of YOLO with CNN models. In the article, we justify why we want to use YOLO, and what it is and how to use and process the model output. We will also present the code in the form of a flowchart and as a result of the program's performance, we will show a picture of the program's operation in real-time, which was launched at one of the live lectures at the University.