2020
DOI: 10.1186/s40537-020-00288-8
|View full text |Cite
|
Sign up to set email alerts
|

Tracing outliers in the dataset of Drosophila suzukii records with the Isolation Forest method

Abstract: Invasive alien species (IAS) are one of the leading threats to native wildlife, human health and food safety/production [42, 43]. The ongoing increase in worldwide trade is facilitating the spread of IAS, causing significant ecological and economic impact. Understanding the spatio-temporal spread of invasion of IAS is crucial to allow prevention

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 36 publications
0
4
0
Order By: Relevance
“…In this paper, we have used the Isolation Forest method for outlier detection. The Isolation Forest method is a machine learning algorithm based on Random Forest and a Decision Trees classifier used to classify anomalies in large data sets [21]. Isolation Forest is an unsupervised machine learning algorithm for anomaly detection.…”
Section: Outlier Rejectionmentioning
confidence: 99%
“…In this paper, we have used the Isolation Forest method for outlier detection. The Isolation Forest method is a machine learning algorithm based on Random Forest and a Decision Trees classifier used to classify anomalies in large data sets [21]. Isolation Forest is an unsupervised machine learning algorithm for anomaly detection.…”
Section: Outlier Rejectionmentioning
confidence: 99%
“…Isolation Forest is an unsupervised anomaly detection method, which is suitable for continuous data. Isolation Forest uses a binary search tree structure (also known as "isolated tree") to isolate samples and detect outliers through the isolation of sample points, which is different from the anomaly detection algorithm that expresses the degree of alienation between samples by quantitative indexes such as distance and density [27]. Because most of the attention values in the TGAM are in the correct range, the number of outliers is small and there is a significant difference from most samples.…”
Section: The Design Of Data Transmission Via the Tgammentioning
confidence: 99%
“…Currently, the main approaches to addressing the issue of abnormal values are anomaly detection and removal. Existing methods for detecting and removing outliers include projection-based techniques: employing technologies like LSH 25 and spatial filling curves 26 to transform the original data into a new space structure with reduced complexity, where outlier scores are defined based on the characteristics of the new space; Isolation Forest method, 27,28 which measures data anomalies through randomized partitioning decision trees; and the Local Outlier Factor (LOF) method: 29 evaluating outliers by comparing the density of data points with their neighbouring points. However, these existing outlier removal methods often assume that data follows a specific distribution, which may not hold true in real-world scenarios.…”
Section: Introductionmentioning
confidence: 99%