Despite the considerable advancements in automated identification methods of highway hidden distress with ground-penetrating radar (GPR) images, there still exist challenges in realizing automated identification of highway hidden distress owing to the quantity, variability, and reliability of the distress samples and diversity of classification models. Firstly, the dataset collected contains 31,640 samples categorized into four categories: interlayer debonding, interlayer loosening, interlayer water seepage, and structural loosening from 1500 km highway, for obtaining larger enough samples and covering the variable range of distress samples. Secondly, the distresses were labeled by experienced experts, and the labels were verified with drilled cores to ensure their reliability. Lastly, 18 exemplary convolutional neural network (CNN) models from 8 different architectures were evaluated using evaluation metrics such as precision, recall, and f1-score. Further, confusion matrix and Grad-CAM techniques were utilized to analyze these models. The experimental results show that VGG13 performed most prominently and stably, while the lightweight network SqueezeNet1_1 performed particularly well with a batch size of 64. Furthermore, this study indicates that models with fewer layers can achieve comparable or better performance than deeper models.