“…In order to deal with such domains, five mainstream approaches can be pursued [ 10 ]: - Feature generation and/or feature engineering, where numerical features are extracted ad-hoc from structured patterns (e.g., using their properties or via measurements) and can be further merged according to different strategies (e.g., in a multi-modal way [ 11 ]);
- Ad-hoc dissimilarities in the input space, where custom dissimilarity measures are designed in order to process structured patterns directly in the input domain without moving towards Euclidean (or metric) spaces. Common—possibly parametric—edit distances include the Levenshtein distance [ 12 ] for sequence domains and graph edit distances [ 13 ] for graphs domains;
- Embedding via information granulation and granular computing [ 3 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 ];
- Dissimilarity representations [ 26 , 27 , 28 ], where structured patterns are embedded in the Euclidean space according to their pairwise dissimilarities;
- Kernel methods, where the mapping between the original input space and the Euclidean space exploits positive-definite kernel functions [ 29 , 30 , 31 , 32 , 33 ].
…”