2015
DOI: 10.1021/acs.jcim.5b00294
|View full text |Cite
|
Sign up to set email alerts
|

Data Quality in the Human and Environmental Health Sciences: Using Statistical Confidence Scoring to Improve QSAR/QSPR Modeling

Abstract: ABSTRACT:A greater number of toxicity data are becoming publicly available allowing for in silico modelling. However, questions often arise as how to incorporate data quality and how to deal with contradicting data if more than a single datum point is available for the same compound. In this study, two well-known and studied QSAR/QSPR models for skin permeability and aquatic toxicology have been investigated in the context of statistical data quality. In particular, the potential benefits of the incorporation … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
5
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 38 publications
1
5
0
Order By: Relevance
“…The greater the number of replicas, the lower the correlation coefficients became. Since this study was a single trial with simple random noise, the results do not contradict those of the previous works ,. The duplication of data should be treated more carefully than it is in the simple duplication.…”
Section: Resultssupporting
confidence: 52%
See 1 more Smart Citation
“…The greater the number of replicas, the lower the correlation coefficients became. Since this study was a single trial with simple random noise, the results do not contradict those of the previous works ,. The duplication of data should be treated more carefully than it is in the simple duplication.…”
Section: Resultssupporting
confidence: 52%
“…Cortes‐Ciriano et al . suggested that the use of multiple replica data sets permutated by random noise could improve the QSAR accuracy . In the replica PCR method, the experimental data and docking scores are replicated by the permutation of 5 % noise.…”
Section: Methodsmentioning
confidence: 99%
“…8,9 Other researchers concluded that high-quality data are crucial for adequately predicting quantitative structure-activity relationship (QSAR) models. 10,11 The demand for reliability of predictions emerged already when the first QSAR models and expert systems appeared. Hence, the assessment of prediction reliability was addressed by many researchers.…”
Section: Introductionmentioning
confidence: 99%
“…This is a conservative approach where the lowest dose associated with a given toxic response is assumed is used. It is worth mentioning, that the presence of multiple, comparable values for the same chemical can increase confidence in the data and this may be expressed as a confidence score (CS), and the use of such data will consequently improves the robustness of the model [39].…”
Section: Introductionmentioning
confidence: 99%