2018
DOI: 10.1088/1757-899x/383/1/012042
|View full text |Cite
|
Sign up to set email alerts
|

Intelligent classification model for railway signal equipment fault based on SMOTE and ensemble learning

Abstract: Abstract. In this paper, we propose a novel intelligent classification model to classify the railway signal equipment fault based on SMOTE and ensemble learning. To tackle the imbalanced fault text data, the model uses SMOTE algorithm to generate the minority railway signal equipment fault class data randomly, making the data balanced. Then the model adopts the base classifier, such as Logistic Regression, Multinomial Naive Bayes, SVM and the ensemble classifier, such as GBDT, Random Forests to classify the da… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 15 publications
(14 citation statements)
references
References 6 publications
0
14
0
Order By: Relevance
“…It was used to generate the fault data randomly for making the data balanced. Result showed that the sample has significantly improved the error classification [10].…”
Section: Related Workmentioning
confidence: 95%
“…It was used to generate the fault data randomly for making the data balanced. Result showed that the sample has significantly improved the error classification [10].…”
Section: Related Workmentioning
confidence: 95%
“…In literature [11], for fault data generated by high-speed rail on-board equipment, a topic model is used to extract the feature information in the text, and the classification is completed by the Bayesian structure learning algorithm HDBN_SL. In literature [12], for unbalanced fault text data, the electrical service signal equipment fault text data was transformed into feature vectors by TF-IDF (Term Frequency-Inverse Document Frequency), and a voting-based approach was used to achieve integrated multiclassifier learning classification. Literature [13] used TF-IDF to transform on-board fault logs into feature vectors, built an ensemble classifier under the bagging framework, and used KELM (Kernel-Based Extreme Learning Machine) as the basic classifier to complete the text classification task; with the rise of deep learning in the field of NLP, it has also been applied to the field of text processing for rail transportation.…”
Section: Related Workmentioning
confidence: 99%
“…First of all, the Chinese text content should be segmented. In this paper, the Jieba word segmentation tool based on the professional corpus and the common corpus is used to segment the signal fault text [13], and the auxiliary words such as "de," "Le," and other words that cannot represent the document features are cleaned up, and then, the TF-IDF is used to calculate the weight of the vocabulary set to form a vocabulary weight matrix, and the number of each vocabulary is counted to form a vocabulary dictionary. e TF-IDF weight matrix is the number of documents and the vocabulary of all documents, so it has a large dimension.…”
Section: High-speed Railway Signal Fault Textmentioning
confidence: 99%