Crack detection on roads is essential nowadays because it has a significant impact on ensuring the safety and reliability of road infrastructure. Thus, it is necessary to create more effective and precise crack detection techniques. A safer road network and a better driving experience for all road users can result from the implementation of the ROAD (Robotics-Assisted Onsite Data Collecting) system for spotting road cracks using deep learning and robots. The suggested solution makes use of a robot vision system’s capabilities to gather high-quality data about the road and incorporates deep learning methods for automatically identifying cracks. Among the tested algorithms, Xception stands out as the most accurate and predictive model, with an accuracy of over 90% during the validation process and a mean square error of only 0.03. In contrast, other deep neural networks, such as DenseNet201, InceptionResNetV2, MobileNetV2, VGG16, and VGG19, result in inferior accuracy and higher losses. Xception also achieves high accuracy and recall scores, indicating its capability to accurately identify and classify different data points. The high accuracy and superior performance of Xception make it a valuable tool for various machine learning tasks, including image classification and object recognition.