“…To address this issue, research efforts have been directed towards various essential aspects like feature selection [48,49,50,21], training data selection [51,15], and classifier selection/fusion [52,53]. Among them, feature selection is considered especially applicable in big data analysis because it eliminates features with little predictive information, which also reduces the dimensionality of data and allows the learning algorithms to operate faster and more effectively [50]. In addition, research shows that a well designed feature selection method can not only handle high-dimensional data sets, but also successfully enhance classification performance in coping with imbalanced data [49,21].…”