“…They aim at selecting a reduced subset of features to predict the target class, yet increasing the predictive accuracy of the classifier [17]. Although many methods address this problem [6,12,15,17,18,22,32], only few of them explore the hierarchical information in order to improve their effectiveness [11,20,24,29,30,31]. Existing hierarchical feature selection methods usually find a suitable subset of features by keeping those features with higher values of relevance and removing redundancy among hierarchically related features.…”