Polymorphic inversions contribute to adaptation and phenotypic variation. However, large multi-centric association studies of inversions remain challenging. We present
scoreInvHap
, a method to genotype inversions from SNP data for genome-wide association studies (GWASs), overcoming important limitations of current methods and outperforming them in accuracy and applicability.
scoreInvHap
calls individual inversion-genotypes from a similarity score to the SNPs of experimentally validated references. It can be used on different sources of SNP data, including those with low SNP coverage such as exome sequencing, and is easily adaptable to genotype new inversions, either in humans or in other species. We present 20 human inversions that can be reliably and easily genotyped with
scoreInvHap
to discover their role in complex human traits, and illustrate a first genome-wide association study of experimentally-validated human inversions.
scoreInvHap
is implemented in R and it is freely available from
Bioconductor
.
PurposePatients with Fanconi anaemia (FA), a rare DNA repair genetic disease, exhibit chromosome fragility, bone marrow failure, malformations and cancer susceptibility. FA molecular diagnosis is challenging since FA is caused by point mutations and large deletions in 22 genes following three heritability patterns. To optimise FA patients’ characterisation, we developed a simplified but effective methodology based on whole exome sequencing (WES) and functional studies.Methods68 patients with FA were analysed by commercial WES services. Copy number variations were evaluated by sequencing data analysis with RStudio. To test FANCA missense variants, wt FANCA cDNA was cloned and variants were introduced by site-directed mutagenesis. Vectors were then tested for their ability to complement DNA repair defects of a FANCA-KO human cell line generated by TALEN technologies.ResultsWe identified 93.3% of mutated alleles including large deletions. We determined the pathogenicity of three FANCA missense variants and demonstrated that two FANCA variants reported in mutations databases as ‘affecting functions’ are SNPs. Deep analysis of sequencing data revealed patients’ true mutations, highlighting the importance of functional analysis. In one patient, no pathogenic variant could be identified in any of the 22 known FA genes, and in seven patients, only one deleterious variant could be identified (three patients each with FANCA and FANCD2 and one patient with FANCE mutations)ConclusionWES and proper bioinformatics analysis are sufficient to effectively characterise patients with FA regardless of the rarity of their complementation group, type of mutations, mosaic condition and DNA source.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.