2019
DOI: 10.3389/fphy.2019.00194
|View full text |Cite
|
Sign up to set email alerts
|

Outlier Mining Methods Based on Graph Structure Analysis

Abstract: Outlier detection in high-dimensional datasets is a fundamental and challenging problem across disciplines that has also practical implications, as removing outliers from the training set improves the performance of machine learning algorithms. While many outlier mining algorithms have been proposed in the literature, they tend to be valid or efficient for specific types of datasets (time series, images, videos, etc.). Here we propose two methods that can be applied to generic datasets, as long as there is a m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
11
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(19 citation statements)
references
References 57 publications
0
11
0
Order By: Relevance
“…The second algorithm, described in Ref. 10 uses Isomap to learn the low-dimensional manifold in which the images are embedded, and assigns to each image an outlier score by comparing the geodesic distances with the distances in the low-dimensional space. Figure 2 shows the Image Map obtained from the two features returned by Isomap.…”
Section: Unsupervised Machine Learning Algorithmsmentioning
confidence: 99%
See 2 more Smart Citations
“…The second algorithm, described in Ref. 10 uses Isomap to learn the low-dimensional manifold in which the images are embedded, and assigns to each image an outlier score by comparing the geodesic distances with the distances in the low-dimensional space. Figure 2 shows the Image Map obtained from the two features returned by Isomap.…”
Section: Unsupervised Machine Learning Algorithmsmentioning
confidence: 99%
“…Figure 4 shows that the performance of the algorithm, measured by the correlation coefficient between the feature returned by the ordering algorithm and the feature provided by manual expert annotation (mean angle), improves when the images that contain artifacts (outliers detected by the outlier detection algorithms proposed in Ref. 10 ) are removed from the database. The different lines show results with different methods of outlier identification and the colored region indicates results when the images removed are randomly selected.…”
Section: Unsupervised Machine Learning Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…Many methods for outlier detection have been proposed in the literature (see, e.g. [2][3][4][5][6] and references therein), some of them, based on distances that can be computed between elements of the dataset [7][8][9][10][11]. In outlier detection via graph methods, distance-based outlier mining is based on a fully connected graph structure in which the nodes represent the elements of the dataset and the connections between them are quantified by a distance measure.…”
Section: Introductionmentioning
confidence: 99%
“…The potential of graph-based methods in combination with machine learning algorithms is addressed in Ref. [5]. The authors explore two methods to detect outliers, with applications to highdimensional datasets.…”
mentioning
confidence: 99%