Keeping large-scale transportation infrastructure networks, such as railway networks, operational under all conditions is one of the major challenges today. The budgetary constraints for maintenance purposes and the network dimension are two of the main factors that make the management of a transportation network such a challenging task. Accordingly, aiming to assist the management of a transportation network, a data-driven model is proposed for stability condition prediction of embankment slopes. For such purpose, the highly flexible learning capabilities of Artificial Neural Networks (ANN) and Support Vector Machines (SVM) were used to fit data-driven models for Earthwork Hazard Category (EHC) prediction. Moreover, the data-driven models were created using visual information that is easy to collect during routine inspections. The proposed models were addressed following two different data modeling strategies: nominal classification and regression. Moreover, to overcome the problem of imbalanced data (since typically good conditions are much common than bad ones), three training sampling approaches were explored: no resampling, SMOTE and Oversampling. The achieved modeling results are presented and discussed, comparing the predictive performance of ANN and SVM algorithms, as well as the effect of the sampling approaches. A comparison between nominal classification and regression strategies was also carried out. Moreover, aiming a better understanding of the proposed data-driven 1 Tinoco et al., May 29, 2018 models, a detailed sensitivity analysis was applied, allowing to quantify the relative importance of each model input, as well as measuring their global effect on the prediction of embankments stability conditions.