Summary
Infrasound monitoring plays an important role in the framework of the surveillance of Mt. Etna, Europe’s largest active volcano. Compared to seismic monitoring, which is particularly effective for buried sources, infrasound signals mirror the activity of shallow sources like Strombolian explosions or degassing. The interpretation of infrasound signals is difficult to the untrained eye, as we have to account for volcanic and non-volcanic sources. The problem of handling large and complex data sets can be tackled with machine learning, namely pattern recognition techniques. Here we focus on so-called “Unsupervised Learning”, where we identify groups of patterns being similar to each other. The degree of similarity is based on a metric measuring the distance among the features of the patterns. This work aims at the identification of typical regimes of infrasound radiation and their relation to the state of volcanic activity at Mt. Etna. For this goal we defined features describing any infrasound pattern. These features were obtained using wavelet transform. We applied “Self-Organizing Maps” (SOM) to the features projecting them to a 2D representation space—the “map”. An intriguing aspect of SOM resides in the fact that the position of the patterns on the map can be expressed by a color code, in a manner that similar patterns are assigned a similar color code. This simplified representation of multivariate patterns allows to follow the development of their characteristics with time efficiently. During a training phase we considered a reference data set, which encompassed a large variety of scenarios. We identified typical groups of patterns which correspond to a specific regime of activity, being representative of the state of the volcano or noise conditions. These groups form areas on the 2D maps. In a second step we considered a test data set, which was not used during the training phase. Applying the same pre-processing as for the training data, we blindly assigned the test patterns to the regimes found before, identifying the one whose color code is most similar to the one calculated to the test pattern. We are thus able to assess the validity of the prediction. The classification scheme presented provides a reliable assessment of the state of activity and adds useful and supplementary details to the results of the real-time automatic system in operation at INGV-OE. This is of particular importance when no visible information of the volcanic activity is available either for unfavorable meteorological conditions or during night time.