In silico predictive software allows assessing the effect of amino acid substitutions on the structure or function of a protein without conducting functional studies. The accuracy of in silico pathogenicity prediction tools has not been previously assessed for variants associated with autosomal recessive deafness 1A (DFNB1A). Here, we identify in silico tools with the most accurate clinical significance predictions for missense variants of the GJB2 (Cx26), GJB6 (Cx30), and GJB3 (Cx31) connexin genes associated with DFNB1A. To evaluate accuracy of selected in silico tools (SIFT, FATHMM, MutationAssessor, PolyPhen-2, CONDEL, MutationTaster, MutPred, Align GVGD, and PROVEAN), we tested nine missense variants with previously confirmed clinical significance in a large cohort of deaf patients and control groups from the Sakha Republic (Eastern Siberia, Russia): Сх26: p.Val27Ile, p.Met34Thr, p.Val37Ile, p.Leu90Pro, p.Glu114Gly, p.Thr123Asn, and p.Val153Ile; Cx30: p.Glu101Lys; Cx31: p.Ala194Thr. We compared the performance of the in silico tools (accuracy, sensitivity, and specificity) by using the missense variants in GJB2 (Cx26), GJB6 (Cx30), and GJB3 (Cx31) genes associated with DFNB1A. The correlation coefficient (r) and coefficient of the area under the Receiver Operating Characteristic (ROC) curve as alternative quality indicators of the tested programs were used. The resulting ROC curves demonstrated that the largest coefficient of the area under the curve was provided by three programs: SIFT (AUC = 0.833, p = 0.046), PROVEAN (AUC = 0.833, p = 0.046), and MutationAssessor (AUC = 0.833, p = 0.002). The most accurate predictions were given by two tested programs: SIFT and PROVEAN (Ac = 89%, Se = 67%, Sp = 100%, r = 0.75, AUC = 0.833). The results of this study may be applicable for analysis of novel missense variants of the GJB2 (Cx26), GJB6 (Cx30), and GJB3 (Cx31) connexin genes.
Pathogenic variants in the GJB2 gene, encoding connexin 26, are known to be a major cause of hearing impairment (HI). More than 300 allelic variants have been identified in the GJB2 gene. Spectrum and allelic frequencies of the GJB2 gene vary significantly among different ethnic groups worldwide. Until now, the spectrum and frequency of the pathogenic variants in exon 1, exon 2 and the flanking intronic regions of the GJB2 gene have not been described thoroughly in the Sakha Republic (Yakutia), which is located in a subarctic region in Russia. The complete sequencing of the non-coding and coding regions of the GJB2 gene was performed in 393 patients with HI (Yakuts—296, Russians—51, mixed and other ethnicities—46) and in 187 normal hearing individuals of Yakut (n = 107) and Russian (n = 80) populations. In the total sample (n = 580), we revealed 12 allelic variants of the GJB2 gene, 8 of which were recessive pathogenic variants. Ten genotypes with biallelic recessive pathogenic variants in the GJB2 gene (in a homozygous or a compound heterozygous state) were found in 192 out of 393 patients (48.85%). We found that the most frequent GJB2 pathogenic variant in the Yakut patients was c.-23+1G>A (51.82%) and that the second most frequent was c.109G>A (2.37%), followed by c.35delG (1.64%). Pathogenic variants с.35delG (22.34%), c.-23+1G>A (5.31%), and c.313_326del14 (2.12%) were found to be the most frequent among the Russian patients. The carrier frequencies of the c.-23+1G>A and с.109G>A pathogenic variants in the Yakut control group were 10.20% and 2.80%, respectively. The carrier frequencies of с.35delG and c.101T>C were identical (2.5%) in the Russian control group. We found that the contribution of the GJB2 gene pathogenic variants in HI in the population of the Sakha Republic (48.85%) was the highest among all of the previously studied regions of Asia. We suggest that extensive accumulation of the c.-23+1G>A pathogenic variant in the indigenous Yakut population (92.20% of all mutant chromosomes in patients) and an extremely high (10.20%) carrier frequency in the control group may indicate a possible selective advantage for the c.-23+1G>A carriers living in subarctic climate.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.