Objective To determine whether the sensitivity and specificity of SNP chips are adequate for detecting rare pathogenic variants in a clinically unselected population. Design Retrospective, population based diagnostic evaluation. Participants 49 908 people recruited to the UK Biobank with SNP chip and next generation sequencing data, and an additional 21 people who purchased consumer genetic tests and shared their data online via the Personal Genome Project. Main outcome measures Genotyping (that is, identification of the correct DNA base at a specific genomic location) using SNP chips versus sequencing, with results split by frequency of that genotype in the population. Rare pathogenic variants in the BRCA1 and BRCA2 genes were selected as an exemplar for detailed analysis of clinically actionable variants in the UK Biobank, and BRCA related cancers (breast, ovarian, prostate, and pancreatic) were assessed in participants through use of cancer registry data. Results Overall, genotyping using SNP chips performed well compared with sequencing; sensitivity, specificity, positive predictive value, and negative predictive value were all above 99% for 108 574 common variants directly genotyped on the SNP chips and sequenced in the UK Biobank. However, the likelihood of a true positive result decreased dramatically with decreasing variant frequency; for variants that are very rare in the population, with a frequency below 0.001% in UK Biobank, the positive predictive value was very low and only 16% of 4757 heterozygous genotypes from the SNP chips were confirmed with sequencing data. Results were similar for SNP chip data from the Personal Genome Project, and 20/21 individuals analysed had at least one false positive rare pathogenic variant that had been incorrectly genotyped. For pathogenic variants in the BRCA1 and BRCA2 genes, which are individually very rare, the overall performance metrics for the SNP chips versus sequencing in the UK Biobank were: sensitivity 34.6%, specificity 98.3%, positive predictive value 4.2%, and negative predictive value 99.9%. Rates of BRCA related cancers in UK Biobank participants with a positive SNP chip result were similar to those for age matched controls (odds ratio 1.31, 95% confidence interval 0.99 to 1.71) because the vast majority of variants were false positives, whereas sequence positive participants had a significantly increased risk (odds ratio 4.05, 2.72 to 6.03). Conclusions SNP chips are extremely unreliable for genotyping very rare pathogenic variants and should not be used to guide health decisions without validation.
BACKGROUND: Accurate penetrance of monogenic disorders is often unknown due to a phenotype-first approach to genetic testing. Here, we use a genotype-first approach in four large cohorts with different ascertainment contexts to accurately estimate penetrance of the three commonest causes of monogenic diabetes, Maturity Onset Diabetes of the Young (MODY). We contrast HNF1A-MODY / HNF4A-MODY which causes an age-related progressive diabetes and GCK-MODY, which causes life-long mild hyperglycaemia. METHODS: We analysed clinical and genetic sequencing data from four different cohorts: 1742 probands referred for clinical MODY testing; 2194 family members of the MODY probands; 132,194 individuals from an American hospital-based cohort; and 198,748 individuals from a UK population-based cohort. RESULTS: Age-related penetrance of diabetes for pathogenic variants in HNF1A and HNF4A was substantially lower in the clinically unselected cohorts compared to clinically referred probands (ranging from 32% to 98% at age 40yrs for HNF1A, and 21% to 99% for HNF4A). The background rate of diabetes, but not clinical features or variant type, explained the reduced penetrance in the unselected cohorts. In contrast, penetrance of mild hyperglycaemia for pathogenic GCK variants was similarly high across cohorts (ranging from 89 to 97%) despite substantial variation in the background rates of diabetes. CONCLUSIONS: Ascertainment context is crucial when interpreting the consequences of monogenic variants for age-related variably penetrant disorders. This finding has important implications for opportunistic screening during genomic testing.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.