2007
DOI: 10.1109/tpwrs.2006.888990
|View full text |Cite
|
Sign up to set email alerts
|

Power Distribution Fault Cause Identification With Imbalanced Data Using the Data Mining-Based Fuzzy Classification $E$-Algorithm

Abstract: Power distribution systems have been significantly affected by many outage-causing events. Good fault cause identification can help expedite the restoration procedure and improve the system reliability. However, the data imbalance issue in many realworld data sets often degrades the fault cause identification performance. In this paper, the-algorithm, which is extended from the fuzzy classification algorithm by Ishibuchi et al. to alleviate the effect of imbalanced data constitution, is applied to Duke Energy … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
51
0
1

Year Published

2011
2011
2019
2019

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 114 publications
(53 citation statements)
references
References 20 publications
1
51
0
1
Order By: Relevance
“…Among others, imbalanced datasets typically appear in the identification of anomalies in certain systems designed to work under steady conditions. Some examples are: Water Distribution Systems [116], the detection of oil spills from satellite images [136], the identification of power distribution fault causes [231] and the prediction of pre-term births [95]. This issue is growing in importance since it appears more and more in most real domains, especially in systems where data from the usual scenario are abundant while data from abnormal ones are scarce.…”
Section: Transformations Required To Fit Technical Assumptions Of Datmentioning
confidence: 99%
“…Among others, imbalanced datasets typically appear in the identification of anomalies in certain systems designed to work under steady conditions. Some examples are: Water Distribution Systems [116], the detection of oil spills from satellite images [136], the identification of power distribution fault causes [231] and the prediction of pre-term births [95]. This issue is growing in importance since it appears more and more in most real domains, especially in systems where data from the usual scenario are abundant while data from abnormal ones are scarce.…”
Section: Transformations Required To Fit Technical Assumptions Of Datmentioning
confidence: 99%
“…For using a GFS with an imbalanced dataset, the standard fitness function has to be altered, or else changes must be effected on the dataset for raising the importance of misclassifying instances of the minority class. Both techniques have been well studied in the context of GFSs: there are works that deal with the use of costs in fuzzy classifiers for imbalanced datasets 14,44,47,48,52 , and other studies suggest employing a preprocessing step in order to balance the training data before the training 3,19,20,21,22 . In this last respect, we can highlight the re-sampling procedure named "Synthetic Minority Oversampling Technique" or SMOTE 7 ; it has been shown that SMOTE is one the most efficient preprocessing algorithms for imbalanced data in relation with Fuzzy Rule Based Systems (FRBS) 19 .…”
Section: Introductionmentioning
confidence: 99%
“…They also provide a stark contrast to the WBCD results, which tend to increase in accuracy when more attributes are used, despite some attributes being poorer. Table 10 shows the calculated g-mean values for animal, tree and lightning-caused faults when the same data is classified using the Naive Bayes approach and the algorithms created by Xu et al The latter consist of an Artificial Neural Network (ANN) [20], an Artificial Immune Recognition System (AIRS) [18], a fuzzy algorithm called the EAlgorithm [19] and a Fuzzy Artificial Immune Recognition System (FAIRS) [25]). FAIRS is a hybrid system that incorporates the quick searching capability and memory mechanism of the AIRS algorithm and the inference rule extracting capability of the E-algorithm.…”
Section: Experimental Results Using the Duke Outage Datasetmentioning
confidence: 99%