Supervising the process of garbage collection and transportation is a very crucial task with significance implication to the reusing and the recycle of the garbage. Previous works mainly focus on the detection of certain kind of garbage instead of overseeing the whole process. This paper proposed a supervision approach based on the improved YOLOv3 to decrease the incidence of unexpected mixing of the different kinds of garbage during the transportation and the collection which will cause the inferior performance of waste classification. Firstly, to reduce the parameters and arithmetic operations of YOLOv3 and improving the model's detection speed, we displaced the standard convolution in YOLOv3 by depthwise separable convolution. Secondly, to solve the problem that YOLOv3 has poor location accuracy and performs poorly in muti-targets, triplet attention is introduced into the backbone, which increases almost no parameters to automatically learn cross-dimensional interactions, enhance the effective feature channel weights, and strengthen the feature extraction capability. Finally, we built a dataset of waste classification supervision using the images provided by a city environmental protection bureau on which we did massive experiments. The experimental result shows that compared with other detection algorithms, the improved YOLO v3 algorithm has better performance. The mAP is 98.5%, which is 0.7% and 1.1% higher than the YOLOv5l and the EfficientDet-B0, respectively, and the average detection speed of the model is 14.6ms/it, which meets the requirements of regulatory real-time and environment complexity of the supervision system.