“…Before training, we used K-means cluster mothed to define the sizes of the anchor boxes. We set k=9, after experiment, the result showed 9 different size of anchor boxes, they were (10,25), (12,44), (12,38), (14,23), (16,32), (18,55), (19,22), (24,26), (44, 35), while the pixel size of the image was fixed to 416×416.…”