Summary
From the last few years, we have witnessed the fourth generation industrial revolution (Industry 4.0), impact of which will be seen in the years to come in various disciplines such as healthcare, transportation, IoT, smart grid, autonomous vehicles, and image processing. These applications in Industry 4.0 may have data in the form of images, speech signals, videos having high dimensions containing multiple dimensions to represent data along different axis. So, the complexity of data processing increases with an increase in the dimensions of the dataset. Complexity can be viewed in terms of detecting and exploiting the relationships among different features of the dataset. These complexities among different attributes can be reduced with the help of dimensionality reduction techniques. These techniques reduce the dimensions from the original input dataset to a lower dimensional dataset. Dimensionality reduction methods are broadly categorized into two types as feature extraction and feature selection. In feature selection method, out of the original set, a subset of features are identified to get a smaller subset which can be used to build the model whereas, the feature extraction method reduces the dataset of high dimensions to a lower dimension space, that is, a space with a less number of features having different values in comparison to the original dataset. Keeping focus on these points, in this article, we have compared and analyzed different data dimensionality reduction techniques which reduce the dimensions of a large and complex dataset during data processing. In addition, we have discussed various data dimensionality reduction techniques and compared these techniques with respect to various parameters. The comparison among various techniques provides insights to the readers about the applicability of a specific technique to the stand‐alone or a group of applications.