Background
Waardenburg syndrome (WS) is a rare genetic disorder mainly characterized by hearing loss and pigmentary abnormalities. Currently, seven causative genes have been identified for WS, but clinical genetic testing results show that 38.9% of WS patients remain molecularly unexplained. In this study, we performed multi-data integration analysis through protein-protein interaction and phenotype-similarity to comprehensively decipher the potential causative factors of undiagnosed WS. In addition, we explored the association between genotypes and phenotypes in WS with the manually collected 443 cases from published literature.
Results
We predicted two possible WS pathogenic genes (KIT, CHD7) through multi-data integration analysis, which were further supported by gene expression profiles in single cells and phenotypes in gene knockout mouse. We also predicted twenty, seven, and five potential WS pathogenic variations in gene PAX3, MITF, and SOX10, respectively. Genotype-phenotype association analysis showed that white forelock and telecanthus were dominantly present in patients with PAX3 variants; skin freckles and premature graying of hair were more frequently observed in cases with MITF variants; while aganglionic megacolon and constipation occurred more often in those with SOX10 variants. Patients with variations of PAX3 and MITF were more likely to have synophrys and broad nasal root. Iris pigmentary abnormality was more common in patients with variations of PAX3 and SOX10. Moreover, we found that patients with variants of SOX10 had a higher risk of suffering from auditory system diseases and nervous system diseases, which were closely associated with the high expression abundance of SOX10 in ear tissues and brain tissues.
Conclusions
Our study provides new insights into the potential causative factors of WS and an alternative way to explore clinically undiagnosed cases, which will promote clinical diagnosis and genetic counseling. However, the two potential disease-causing genes (KIT, CHD7) and 32 potential pathogenic variants (PAX3: 20, MITF: 7, SOX10: 5) predicted by multi-data integration in this study are all computational predictions and need to be further verified through experiments in follow-up research.