2018
DOI: 10.1002/pst.1903
|View full text |Cite
|
Sign up to set email alerts
|

Understanding the influence of individual variables contributing to multivariate outliers in assessments of data quality

Abstract: Summary Mahalanobis distance is often recommended to identify patients or clinical sites that are considered unusual in clinical trials. Patients extreme in one or more covariates may be considered outliers in that they reside some distance from the multivariate mean, which can be thought of as the center of the data cloud. Less often discussed, patients whose data are believed to be “too good to be true” are located near the centroid as inliers. In order to efficiently investigate these anomalies for potentia… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 11 publications
0
4
0
Order By: Relevance
“…Thus, it is possible to avoid inaccurate conclusions from results of the study. Therefore, good statistical practices must be followed with sophisticated techniques, such as those presented in this work related to detection of influential data and outliers, as well as other possible inconsistencies in the data; see the studies presented in [48,49], which support our discussion in terms of data quality and analytics in medicine. Thus, our study can be a knowledge addition to the toolkit of diverse practitioners, including medical doctors, applied statisticians, and data scientists.…”
Section: Conclusion Discussion and Future Researchmentioning
confidence: 61%
“…Thus, it is possible to avoid inaccurate conclusions from results of the study. Therefore, good statistical practices must be followed with sophisticated techniques, such as those presented in this work related to detection of influential data and outliers, as well as other possible inconsistencies in the data; see the studies presented in [48,49], which support our discussion in terms of data quality and analytics in medicine. Thus, our study can be a knowledge addition to the toolkit of diverse practitioners, including medical doctors, applied statisticians, and data scientists.…”
Section: Conclusion Discussion and Future Researchmentioning
confidence: 61%
“…However, it is difficult to investigate the causes that the site is out of normal. Mason et al [ 17 ] and Zink et al [ 18 ] tackle the problem of investigating the contribution of the individual variables to each multivariate outlier after detection of abnormalities. Zink et al [ 18 ] illustrated the application of the contribution plot [ 19 ] to RBM for real multicenter clinical trials.…”
Section: Discussionmentioning
confidence: 99%
“…Mason et al [ 17 ] and Zink et al [ 18 ] tackle the problem of investigating the contribution of the individual variables to each multivariate outlier after detection of abnormalities. Zink et al [ 18 ] illustrated the application of the contribution plot [ 19 ] to RBM for real multicenter clinical trials. Mason et al [ 17 ] proposed to calculate the magnitude of the contribution using decomposition of Hotelling's T 2 statistic into one dimensional variate.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation