Assessment of the ongoing activity of volcanoes is one of the key factors to reduce volcanic risks. In this paper, two Machine Learning (ML) approaches are presented to classify volcanic activity using multivariate geophysical data, namely the Decision Tree (DT) and K-Nearest Neighbours (KNN). The models were implemented using a data set recorded at Mount Etna (Italy), in the period 01 January 2011 -31December 2015, encompassing lava fountain events and intense Strombolian activity. Here a data set consisting of five geophysical features, namely the root-mean-square of seismic tremor (RMS) and its source depth, counts of clustered infrasonic events, radar RMS backscattering power and tilt derivative, was considered. Model performances were assessed by using a set of statistical indices commonly considered for classification approaches. Results show that between the investigated approaches the DT model is the most appropriate for classification of volcano activity and is suitable for early warning systems applications. Furthermore, the comparison with a different classifier approach, reported in literature, based on Bayesian Network (BN), is performed. FIGURE 2. From top to bottom: a) time series showing the state of Mount Etna during 01 January 2011 -31 December 2015; b) seismic RMS amplitudes; c) depth of volcanic tremor source centroid; d) number of distances of spatially clustered infrasonic events; e) signal obtained by the Doppler radar VOLDORAD 2B; f) tilt derivative of the signal recorded by the CBD station [after Cannavò et al. 2017].FIGURE 3. Example to classify the volcanic activity in terms of three features: average seismic RMS amplitude (rms); radar signal (radar); tilt derivative (tilt).
NUMERICAL RESULTSAs mentioned in the section 2, in this study two kinds of classifiers were considered, namely the DTs and the KNNs. In more detail, as a first stage, for each kind of HAJIAN ET AL. 6 FIGURE 4. Schematic of the Leave-One-Out cross validation method used in this study.