2013
DOI: 10.1007/s10618-012-0299-1
|View full text |Cite
|
Sign up to set email alerts
|

Ensemble-based noise detection: noise ranking and visual performance evaluation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
66
0
1

Year Published

2013
2013
2024
2024

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 70 publications
(67 citation statements)
references
References 25 publications
0
66
0
1
Order By: Relevance
“…In the first three steps of the workflow depicted in Figure 4 the outlier documents are identified and extracted (instead of OntoGen we use NoiseRank [25] as implemented in TextFlows). The goal of this phase is to extract a set of outlier documents from the whole corpus of input documents.…”
Section: New Methodology and Its Implementation As A Repeatable Workfmentioning
confidence: 99%
See 2 more Smart Citations
“…In the first three steps of the workflow depicted in Figure 4 the outlier documents are identified and extracted (instead of OntoGen we use NoiseRank [25] as implemented in TextFlows). The goal of this phase is to extract a set of outlier documents from the whole corpus of input documents.…”
Section: New Methodology and Its Implementation As A Repeatable Workfmentioning
confidence: 99%
“…In contrast to the approach for outlier detection with OntoGen, described in Methods section, NoiseRank component implements a different strategy [25]. Here, classifiers are used to detect atypical documents in categorized document corpora, which can be considered as outliers of their own document category.…”
Section: New Methodology and Its Implementation As A Repeatable Workfmentioning
confidence: 99%
See 1 more Smart Citation
“…For each chart type we created a specific template including individual functionality available for a certain type of performance visualization. For example, in PR space charts we included the novel F-isoline evaluation approach [10] which enables to simultaneously visually evaluate algorithm performance in terms of recall, precision and the F -measure. As additional novelty, for ROC curve charts a corresponding PR curve chart can be created (and vice versa), since PR curves give a more informative picture of the algorithm's performance when dealing with highly skewed datasets, which provides additional insight for algorithms design, as discussed in [2].…”
Section: The Vipercharts Platformmentioning
confidence: 99%
“…These inconsistencies can be either errors, absent information or unknown values [2]. Whereas noise needs to be identified and treated, secure data in a dataset must be preserved [3]. The term secure data usually refers to instances that are core of the knowledge necessary to build accurate learning models.…”
Section: Introductionmentioning
confidence: 99%