2021
DOI: 10.1155/2021/9436582
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient Algorithm for the Detection of Outliers in Mislabeled Omics Data

Abstract: High dimensionality and noise have made it difficult to detect related biomarkers in omics data. Through previous study, penalized maximum trimmed likelihood estimation is effective in identifying mislabeled samples in high-dimensional data with mislabeled error. However, the algorithm commonly used in these studies is the concentration step (C-step), and the C-step algorithm that is applied to robust penalized regression does not ensure that the criterion function is gradually optimized iteratively, because t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 42 publications
0
1
0
Order By: Relevance
“…Outlier observations not only strongly impact parameter estimation but also variable selection (Alfons et al, 2013). Therefore, methods that are robust to observations that deviate from the remaining observations in the same group have been proposed, in particular, sparse modifications of the popular Least Trimmed Squares (LTS) robust estimator (Rousseeuw, 2013;Rousseeuw & Driessen, 2006) have been successful applied to high-dimensional data, namely gene expression cancer data (Alfons et al, 2013;Jensch et al, 2022;Segaert et al, 2019;Sun et al, 2021).…”
Section: Introductionmentioning
confidence: 99%
“…Outlier observations not only strongly impact parameter estimation but also variable selection (Alfons et al, 2013). Therefore, methods that are robust to observations that deviate from the remaining observations in the same group have been proposed, in particular, sparse modifications of the popular Least Trimmed Squares (LTS) robust estimator (Rousseeuw, 2013;Rousseeuw & Driessen, 2006) have been successful applied to high-dimensional data, namely gene expression cancer data (Alfons et al, 2013;Jensch et al, 2022;Segaert et al, 2019;Sun et al, 2021).…”
Section: Introductionmentioning
confidence: 99%