2009
DOI: 10.1145/1656274.1656278
|View full text |Cite
|
Sign up to set email alerts
|

The WEKA data mining software

Abstract: This paper presents the performance of a classifier built using the stackingC algorithm in nine different data sets. Each data set is generated using a sampling technique applied on the original imbalanced data set. Five new sampling techniques are proposed in this paper (i.e., SMOTERandRep, Lax Random Oversampling, Lax Random Undersampling, Combined-Lax Random Oversampling Undersampling, and Combined-Lax Random Undersampling Oversampling) that were based on the three sampling techniques (i.e., Random Undersam… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

11
8,298
0
331

Year Published

2012
2012
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 15,689 publications
(8,640 citation statements)
references
References 20 publications
11
8,298
0
331
Order By: Relevance
“…We previously evaluated [5] the ability of the extracted features to differentiate epileptic from non-epileptic epochs by examining several classification algorithms implemented by the WEKA machine learning toolkit software [24] including the BayesNet [24,26], RandomCommittee, RandomForest [27], IBk [28] and SMO [29,30] with RBF kernel. Since the overall highest accuracy was achieved by the BayesNet classifier, we now evaluate the examined fusion schemes with respect to the BayesNet.…”
Section: Methodsology For Classification Of Generalized Epileptic Andmentioning
confidence: 99%
“…We previously evaluated [5] the ability of the extracted features to differentiate epileptic from non-epileptic epochs by examining several classification algorithms implemented by the WEKA machine learning toolkit software [24] including the BayesNet [24,26], RandomCommittee, RandomForest [27], IBk [28] and SMO [29,30] with RBF kernel. Since the overall highest accuracy was achieved by the BayesNet classifier, we now evaluate the examined fusion schemes with respect to the BayesNet.…”
Section: Methodsology For Classification Of Generalized Epileptic Andmentioning
confidence: 99%
“…This approach could lead to a better comprehension about the role of spectrum intensity in the quality of classification. Three different classifiers were used: Naïve Bayes, k-nearest neighbor (kNN) and a decision tree C4.5 (J48), all of them implemented on open source software WEKA [44]. All classifiers were tested for both excitation wavelengths in the exact same conditions.…”
Section: Methodsmentioning
confidence: 99%
“…As input variables for the development of the classification model, the docking score and the CoMSIA similarity fields, steric, electrostatic, hydrophobic, H-bond donor and H-bond acceptor were used. Random tree (RT) classification technique implemented in WEKA 66 program was used to discriminate between the different classes. Decision trees represent a supervised approach to classification with its simple structure consisting of root, nodes, branches and leaves.…”
Section: ■ Materials and Methodsmentioning
confidence: 99%