The power sector is one of the most important engineering sectors, with a lot of equipment that needs to be appropriately maintained, often spread over large areas. With the recent advances in deep learning techniques, many applications can be developed that could be used to automate the power line inspection process, replacing previously manual activities. However, in addition to these novel algorithms, this approach requires specialized datasets, collections that have been properly curated and labeled with the help of experts in the field. When it comes to visual inspection processes, these data are mainly images of various types. This paper consists of two main parts. The first one presents information about datasets used in machine learning, especially deep learning. The need to create domain datasets is justified using the example of the collection of data on power infrastructure objects, and the selected repositories of different collections are compared. In addition, selected collections of digital image data are characterized in more detail. The latter part of the review also discusses the use of an original dataset containing 2630 high-resolution labeled images of power line insulators and comments on the potential applications of this collection.