“…Over the last decades a large amount of machine learning methods have emerged. Among the most widely used are decision trees (Breiman, 1984;Leibovici et al, 2011;Qi and Zhu, 2011;Zhang et al, 2009), artificial neural networks (Baykan and Yilmaz, 2010;Bue and Stepinski, 2006;Canty, 2009;Dubois et al, 2007;Mas and Flores, 2008;Pavel et al, 2011), support vector machines (Lima et al, 2012;Mountrakis et al, 2011;Petropoulos et al, 2012;Yu et al, 2012;Zuo and Carranza, 2011) and classifier ensembles (Breiman, 1996;Rodriguez-Galiano et al, 2012a, just to mention a few. These methods start from very diverse conceptual bases, although all of them have a series of shared advantages: (i) ability to learn complex patterns, considering nonlinear relationships between explanatory and dependent variables; (ii) generalization ability, hence applicable to incomplete or noisy databases; (iii) possibility to incorporate a priori information; and (iv) integration of different types of data in the analysis due to the absence of assumptions about the data used (e.g.…”