Background: While multiple genetic causes of movement disorders have been identified in the past decade, modifying factors of disease expression are still largely unknown for most conditions. X-linked dystonia-parkinsonism (XDP) is an inherited neurodegenerative disease caused by a SINE-VNTR-Alu (SVA)-type retrotransposon insertion that contains a hexanucleotide repeat within an intron of the TAF1 gene. To date, four putative genetic modifiers explain about 65% of variance in age at onset in XDP. However, additional genetic modifiers are conceivably at play in XDP and may include mismatches of the SVA hexanucleotide repeat motif. We aim to identify additional genetic modifiers of XDP expressivity and age at onset (AAO).
Methods: Third-generation sequencing of PCR amplicons from XDP patients (n=202) was performed to assess potential repeat interruption and instability. Repeat-primed PCR and Cas9-mediated targeted enrichment were used to confirm the presence of identified repeat mismatches.
Results: An increased frequency of deletions at the beginning of the hexanucleotide repeat (CCCTCT)n domain was found. Specifically, three deletions at positions 11, 14, and 17 of the TAF1 SVA repeat motif of somatic mosaic origins were detected in different combinations. The most common one was three deletions (1-2-3) at a median frequency 0.425 (IQR:0.42-0.43) and deletions within positions 11 and 14 (1-2-wt) at a median frequency 0.128 (IQR:0.12-0.13). The frequency of deletions at positions 11 and 14 correlated with repeat number (r=-0.48, p=9.5x10-13) and AAO (r=0.34, p=9.5x10-7). The association with AAO still stands when including other modifier genotypes (MSH3 and PMS2) in a regression model. However, the association dissipates when including repeat numbers.
Conclusion: We present a novel mosaic repeat motif deletion within the hexanucleotide repeat (CCCTCT)n domain of TAF1 SVA. Our study illustrates: 1) the importance of somatic mosaic genotypes; 2) the biological plausibility of multiple modifiers (both germline and somatic) that can have additive effects on repeat instability; 3) that these variations may remain undetected without assessment of single molecules.