Features and performance of some outlier detection methods

Barbato, Giulio; Barini, Emanuele Modesto; Genta, Gianfranco; Levi, Raffaello

doi:10.1080/02664763.2010.545119

Cited by 157 publications

(89 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, the indentation size effect results to be negligible in the investigated macro hardness range [9]. In conclusion, it is possible to estimate the measurement reproducibility of the indentation modulus according the three methods for evaluating S. In case of LEM and authors' methods, once identified and eliminated outliers [31], the standard deviations of the values of E IT are taken as measurement reproducibility; these values are respectively 6.74 kN/mm 2 and 7.14 kN/mm 2 . Instead, in case of PLM, it is considered the standard deviation of residuals with respect to the model in Eq.…”

Section: 2mentioning

confidence: 97%

Measurement of elastic modulus by instrumented indentation in the macro-range: Uncertainty evaluation

Cagliero

Barbato

Maizza

et al. 2015

International Journal of Mechanical Sciences

View full text Add to dashboard Cite

Section: 2mentioning

confidence: 97%

Measurement of elastic modulus by instrumented indentation in the macro-range: Uncertainty evaluation

Cagliero

Barbato

Maizza

et al. 2015

International Journal of Mechanical Sciences

View full text Add to dashboard Cite

“…• Based on the sample mean and standard variance, IQR is less vulnerable to extreme values than other filtering methods because it uses quartiles that are resistant to extreme values (Walfish, 2006); (Barbato, Barini, Genta & Levi, 2011).…”

Section: Outlier Filteringmentioning

confidence: 99%

Building High-Quality Auction Fraud Dataset

Elshaar¹,

Sadaoui²

2019

CIS

View full text Add to dashboard Cite

Given the magnitude of online auction transactions, it is difficult to safeguard consumers from dishonest sellers, such as shill bidders. To date, the application of Machine Learning Techniques (MLTs) to auction fraud has been limited, unlike their applications for combatting other types of fraud. Shill Bidding (SB) is a severe auction fraud, which is driven by modern-day technologies and clever scammers. The difficulty of identifying the behavior of sophisticated fraudsters and the unavailability of training datasets hinder the research on SB detection. In this study, we developed a high-quality SB dataset. To do so, first, we crawled and preprocessed a large number of commercial auctions and bidders' history as well. We thoroughly preprocessed both datasets to make them usable for the computation of the SB metrics. Nevertheless, this operation requires a deep understanding of the behavior of auctions and bidders. Second, we introduced two new SB patterns and implemented other existing SB patterns. Finally, we removed outliers to improve the quality of training SB data. •Bidders attempt to detect SB by themselves by tracking many of the competitor's behavior and communicating their suspicions to eBay. Very recently, the bidders' IDs and history are no longer available on eBay. We believe this new policy about blocking the bidding history is to not be able to discover SB activities. •Buyers are the most affected by SB since they pay much more for the items. The price is driven up by disingenuous bidders with no intention of ever winning the bid. For instance, CNB News disclosed that a bidder paid $1, 825 for a nearly complete set of 1959 Topps baseball cards on eBay (nbcnews.com nd). However, two undercover detectives determined that the purchaser ended up paying an extra $531 for the cards due to SB

show abstract

“…Evidence of rather abusive exclusion of outliers appears for both IQR and Chauvenet's methods [16], suggesting an empirical distribution shape radically deviating from the original data set. On the other hand, by retaining a few borderline experimental values, modified IQR, Grubbs' and the extreme values method manage to preserve some typical features of the original data set, while at the same time effectively shielding estimates of main population parameters -central tendency, and scatter -from outlier induced bias.…”

Section: Identification and Treatment Of Outliersmentioning

confidence: 99%

“…Fig.3 shows normal probability plots of values of coded gravity acceleration; at either tail a number of discordant results may be observed, mainly traceable to such anomalies as referred to above. Outliers are identified according to Chauvenet's, Grubbs', and IQR (interquartile range) methods; another based upon extreme value distribution [15], and a modification of IQR method were also considered [16], see also [2], [17].…”

Section: Metrological Problems In Absolute Measurement Of Gravity Accmentioning

confidence: 99%

Treatment of Experimental Data with Discordant Observations: Issues in Empirical Identification of Distribution

Barbato¹,

Genta²,

Germak³

et al. 2012

Measurement Science Review

Self Cite

View full text Add to dashboard Cite

Performances of several methods currently used for detection of discordant observations are reviewed, considering a set of absolute measurements of gravity acceleration exhibiting some peculiar features. Along with currently used methods, a criterion based upon distribution of extremes is also relied upon to provide references; a modification of a simple, broadly used method is mentioned, improving performances while retaining inherent ease of use. Identification of distributions underlying experimental data may entail a substantial uncertainty component, particularly when sample size is small, and no mechanistic models are available. A pragmatic approach is described, providing estimation to a first approximation of overall uncertainty, covering both estimation of parameters, and identification of distribution shape.

show abstract

Features and performance of some outlier detection methods

Cited by 157 publications

References 27 publications

Measurement of elastic modulus by instrumented indentation in the macro-range: Uncertainty evaluation

Measurement of elastic modulus by instrumented indentation in the macro-range: Uncertainty evaluation

Building High-Quality Auction Fraud Dataset

Treatment of Experimental Data with Discordant Observations: Issues in Empirical Identification of Distribution

Contact Info

Product

Resources

About