2008
DOI: 10.1016/j.mrfmmm.2007.11.005
|View full text |Cite
|
Sign up to set email alerts
|

Correlating observed odds ratios from lung cancer case–control studies to SNP functional scores predicted by bioinformatic tools

Abstract: Bioinformatic tools are widely utilized to predict functional single nucleotide polymorphisms (SNPs) for genotyping in molecular epidemiological studies. However, the extent to which these approaches are mirrored by epidemiological findings has not been fully explored. In this study, we first surveyed SNPs examined in case-control studies of lung cancer, the most extensively-studied cancer type. We then computed SNP functional scores using four popular bioinformatics tools: SIFT, PolyPhen, SNPs3D, and PMut, an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2009
2009
2018
2018

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(15 citation statements)
references
References 62 publications
0
15
0
Order By: Relevance
“…A perfect prediction would result in a vertical line (infinite slope) at the origin and an AUC of 1, in contrast to a completely random prediction that would result in a line with a slope of 1 and an AUC of 0.5. Other measures to evaluate the ability of prediction methods to prioritize the impact of mutations include the balanced accuracy, which is the average of the sensitivity and specificity, 25 the F1 score, which is the harmonic mean of precision and recall, 122 the Matthews correlation coefficient (MMC), 93 the Spearman's rank correlation coefficient, 123 the Kendall tau rank correlation coefficient, 124 and the scaledependent metric root-mean-square deviation (RMSD). 61 It is important to be cautious when attempting to objectively compare methods, and only new, unpublished data should be included in a validation set in order to keep the methods on equal footing.…”
Section: Availability and Comparisonsmentioning
confidence: 99%
“…A perfect prediction would result in a vertical line (infinite slope) at the origin and an AUC of 1, in contrast to a completely random prediction that would result in a line with a slope of 1 and an AUC of 0.5. Other measures to evaluate the ability of prediction methods to prioritize the impact of mutations include the balanced accuracy, which is the average of the sensitivity and specificity, 25 the F1 score, which is the harmonic mean of precision and recall, 122 the Matthews correlation coefficient (MMC), 93 the Spearman's rank correlation coefficient, 123 the Kendall tau rank correlation coefficient, 124 and the scaledependent metric root-mean-square deviation (RMSD). 61 It is important to be cautious when attempting to objectively compare methods, and only new, unpublished data should be included in a validation set in order to keep the methods on equal footing.…”
Section: Availability and Comparisonsmentioning
confidence: 99%
“…The highest agreement was observed between 1-SIFT and PolyPhen-2 with a relative ratio agreement of 2/3, followed by the agreement between VarioWatch and 1-SIFT with a relative ratio agreement of 3/5, and the lowest agreement was seen between VarioWatch and PolyPhen-2 with a relative ratio agreement of 1/4. While previous studies have investigated the correlations between the 1-SIFT scores and PolyPhen-2 scores with Spearman's rank correlation coefficients, and significant correlations were found between these two scores (Zhu et al, 2008). According to be called 'deleterious' or 'damaging' by two programs at least, rs1051740 (EPHX1), rs1138272 (GSTP1) and rs6712954 (SERPINE2) are more likely to be potentially damaging than other variants which are likely benign.…”
Section: Resultsmentioning
confidence: 98%
“…The tools that were utilized for identifying the coding nsSNPs include Sorting Intolerant from Tolerant (SIFT) (Ng and Henikoff 2003; Shen et al 2006), Polymorphism Phenotyping (PolyPhen) (Johnson et al 2005; Zhu et al 2008), SNPeffect (Reumers et al 2005, 2006), large-scale annotation of coding nsSNPs (LS-SNP) (Karchin et al 2005; Ryan et al 2009) and SNPs3D (Yue et al 2006). …”
Section: Methodsmentioning
confidence: 99%