2021
DOI: 10.3390/s21206715
|View full text |Cite
|
Sign up to set email alerts
|

Improved Random Forest Algorithm Based on Decision Paths for Fault Diagnosis of Chemical Process with Incomplete Data

Abstract: Fault detection and diagnosis (FDD) has received considerable attention with the advent of big data. Many data-driven FDD procedures have been proposed, but most of them may not be accurate when data missing occurs. Therefore, this paper proposes an improved random forest (RF) based on decision paths, named DPRF, utilizing correction coefficients to compensate for the influence of incomplete data. In this DPRF model, intact training samples are firstly used to grow all the decision trees in the RF. Then, for e… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 69 publications
0
6
0
Order By: Relevance
“…Zhang et al [105] used fuzzy neural network to establish the normal behavior model of key equipment parameters, calculated the residual between the sample set data and the normal model, constructed a multivariate Gaussian distribution model of the residual, and set the abnormal state threshold by using the contour of Gaussian probability density. In addition, methods such as SVM [106] , least squares SVM [107] , random forest [108] , and gradient boosting [109] are also often applied to anomaly detection.…”
Section: Data-driven Methodsmentioning
confidence: 99%
“…Zhang et al [105] used fuzzy neural network to establish the normal behavior model of key equipment parameters, calculated the residual between the sample set data and the normal model, constructed a multivariate Gaussian distribution model of the residual, and set the abnormal state threshold by using the contour of Gaussian probability density. In addition, methods such as SVM [106] , least squares SVM [107] , random forest [108] , and gradient boosting [109] are also often applied to anomaly detection.…”
Section: Data-driven Methodsmentioning
confidence: 99%
“…Likewise, another mathematical model based on N-manifolds may also be considered [51]. On the other hand, such procedures may not be really accurate when some parameters are missing, hence random forests based on decision paths are also proposed, leading to the proposition of an improved random forest based on decision paths [52]. With respect to decision trees, the use of multi-sensor data fusion with different fusion layers offers interesting solutions.…”
Section: Fault Diagnosis and Detectionmentioning
confidence: 99%
“…The random forest (RF) classifier is an ensemble learning algorithm that uses a combination of decision trees and implements voting laws for achieving statistical learning. 43 The RF classifier can deal with the nonlinear data, and it has found various applications, such as sample identification, feature selection, and classification of various machineries. [44][45][46] The above-prepared datasets (dataset I, II, and III) are supplied as input to the RF algorithm to detect the defects and discriminate among the gearbox's health states.…”
Section: Random Forest Algorithmmentioning
confidence: 99%