A PCA-based variable ranking and selection approach for electric energy load forecasting

Bezerra, Francisco Elânio; Grassi, Flavio; Dias, Cleber Gustavo; Pereira, Fábio Henrique

doi:10.1108/ijesm-12-2019-0009

Cited by 5 publications

(5 citation statements)

References 49 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With respect to attribute selection, although the works of Giordano et al [10], Chang et al [11], and Bezerra et al [19] have addressed industrial applications, none of them address the application of attribute selection techniques to prioritize the selection of the most relevant sensors for data collection.…”

Section: Resultsmentioning

confidence: 99%

“…Subsequently, PCA identifies additional orthogonal directions, each capturing subsequent levels of variability, creating subsequent principal components. This iterative process continues until the principal components encapsulate all significant variations in the data [18][19][20].…”

Section: Feature Selectionmentioning

confidence: 99%

“…Performs dimensionality reduction by projecting the data into a new feature space composed of the most significant principal components [18][19][20].…”

Section: Principal Component Analysismentioning

confidence: 99%

“…In this work, PCA was applied to the data as indicated by [19,20] to verify each variable's percentage of total variance using the main components. To do this, we employed Sklearn's PCA Decomposition library in Python, which has a function called ex-plained_variance_ratio.…”

Section: Feature Selection and Comparison Of Techniques (Step 2)mentioning

confidence: 99%

See 3 more Smart Citations

Impacts of Feature Selection on Predicting Machine Failures by Machine Learning Algorithms

Bezerra,

Oliveira Neto,

Cervi

et al. 2024

Applied Sciences

Self Cite

View full text Add to dashboard Cite

In the context of Industry 4.0, managing large amounts of data is essential to ensure informed decision-making in intelligent production environments. It enables, for example, predictive maintenance, which is essential for anticipating and identifying causes of failures in machines and equipment, optimizing processes, and promoting proactive management of human, financial, and material resources. However, generating accurate information for decision-making requires adopting suitable data preprocessing and analysis techniques. This study explores the identification of machine failures based on synthetic industrial data. Initially, we applied the feature selection techniques Principal Component Analysis (PCA), Minimum Redundancy Maximum Relevance (mRMR), Neighborhood Component Analysis (NCA), and Denoising Autoencoder (DAE) to the collected data and compared their results. In the sequence, a comparison among three widely known machine learning classifiers, namely Random Forest (RF), Support Vector Machine (SVM), and Multilayer Perceptron neural network (MLP), was conducted, with and without considering feature selection. The results showed that PCA and RF were superior to the other techniques, allowing the classification of failures with rates of 0.98, 0.97, and 0.98 for the accuracy, precision, and recall metrics, respectively. Thus, this work contributes by solving an industrial problem and detailing techniques to identify the most relevant variables and machine learning algorithms for predicting machine failures that negatively impact production planning. The findings provided by this study can assist industries in giving preference to employing sensors and collecting data that can contribute more effectively to machine failure predictions.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Feature Selectionmentioning

confidence: 99%

“…Performs dimensionality reduction by projecting the data into a new feature space composed of the most significant principal components [18][19][20].…”

Section: Principal Component Analysismentioning

confidence: 99%

Section: Feature Selection and Comparison Of Techniques (Step 2)mentioning

confidence: 99%

See 2 more Smart Citations

Impacts of Feature Selection on Predicting Machine Failures by Machine Learning Algorithms

Bezerra,

Oliveira Neto,

Cervi

et al. 2024

Applied Sciences

Self Cite

View full text Add to dashboard Cite

show abstract

“…There are many current modeling methods for power load forecasting, and power load forecasting models with good performance have emerged [3]. Traditional models include models such as linear regression and moving averages, which collect and analyze the historical data of power loads, from which they find the changing law between power loads and factors and fit the future value of power loads according to the changing law [4][5][6]. The traditional model assumes that the power load is a fixed and unchanging trend, such as an upward trend or a downward trend [7][8][9].…”

Section: Introductionmentioning

confidence: 99%

Traceless Kalman filter threshold estimation for distributed power loads

Bao,

Zhang,

2023

Applied Mathematics and Nonlinear Sciences

View full text Add to dashboard Cite

Load forecasting is an important part of smart grid construction, energy management, and sustainable design of power systems, and has a great impact on the reliable operation of power grids, facility planning and other decisions. In this paper, we utilize the traceless transform of the UKF algorithm to obtain the sigma feature points generated by the mean value of the high-frequency wavelet components of the power signals and combine them with the EKF algorithm. We put forward a novel neural network hybrid Kalman TUKF algorithm, which will be used to carry out simulation experiments on the distributed electric loads and to estimate the threshold value of the loads in the numerical experiments. The results show that in comparison with the actual measurements, the TUKF algorithm improves by 34.7% in the RMSE metrics, 38.7% in the MAE metrics, and 40.6% in the MAPE metrics compared to the PFWNN. The TUKF algorithm is closer to the real curves and has the best prediction performance for all the time intervals of the prediction. The change of the threshold value has a more obvious effect on the prediction accuracy, and the best effect is in the scale δ = 0.5, i.e., the threshold frequency is selected as the middle value of the intermediate frequency.

show abstract

Assessment of spatial and temporal variations in water quality using multivariate statistical analysis in the Munroe Island, Kerala, India

Arya

Biju

Benchamin

2023

Acta Ecologica Sinica

View full text Add to dashboard Cite

A PCA-based variable ranking and selection approach for electric energy load forecasting

Cited by 5 publications

References 49 publications

Impacts of Feature Selection on Predicting Machine Failures by Machine Learning Algorithms

Impacts of Feature Selection on Predicting Machine Failures by Machine Learning Algorithms

Traceless Kalman filter threshold estimation for distributed power loads

Assessment of spatial and temporal variations in water quality using multivariate statistical analysis in the Munroe Island, Kerala, India

Contact Info

Product

Resources

About