2014
DOI: 10.1007/978-3-319-03095-1_14
|View full text |Cite
|
Sign up to set email alerts
|

Outliers Detection in Regression Analysis Using Partial Least Square Approach

Abstract: Abstract. Identifying abnormal behavior in the chosen dataset is essential for improving the quality of the given dataset and decreasing the impact of abnormal values/patterns in the knowledge discovery process. Outlier detection may be established in many data mining techniques. In this paper Regression analysis have been used to detect the outliers. Partial Least Square approach is mainly used in regression analysis. Laser dataset has been used to find out the outliers. The main objective is used for constru… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…OD is a key tool in safeguarding data quality and improving the model performance [43]. PLSR has been used for OD by excluding the data points not well described by the model to improve the calibration performance [44][45][46]. It is known that the presence of outliers strongly influences the PLSR method, and the models obtained may not describe the data well and impact overall model effectiveness [47].…”
Section: Discussionmentioning
confidence: 99%
“…OD is a key tool in safeguarding data quality and improving the model performance [43]. PLSR has been used for OD by excluding the data points not well described by the model to improve the calibration performance [44][45][46]. It is known that the presence of outliers strongly influences the PLSR method, and the models obtained may not describe the data well and impact overall model effectiveness [47].…”
Section: Discussionmentioning
confidence: 99%
“…It deviates from other objects such that it makes suspicious to be generated by a different mechanism [10]. Subject of outlier is either an elimination of disturbance such as noise reduction [10][11][12][13][14], or an interest of detection such as crime detection [15][16][17][18][19][20][21][22][23]. This paper focuses on the former view.…”
Section: Introductionmentioning
confidence: 99%
“…TP was the summation of inorganic, organic, and dissolved forms of phosphorus; and TN was the summation of nitrate, nitrite (NO2 -), ammonia (NH3), and organic nitrogen. An interquartile range (IQR) criteria with a factor of two (i.e., median ± 2*IQR) was used to remove outliers, which can distort the goodness of model fit (Aggarwal and Yu, 2001;Devarakonda et al, 2014). The sample size (n) of the temporal datasets ranged from 16 to 241 among the different sites (Table 1).…”
Section: Datasetsmentioning
confidence: 99%