Array-based technologies have been used to detect chromosomal copy number changes (aneuploidies) in the human genome. Recent studies identified numerous copy number variants (CNV) and some are common polymorphisms that may contribute to disease susceptibility. We developed, and experimentally validated, a novel computational framework (QuantiSNP) for detecting regions of copy number variation from BeadArray™ SNP genotyping data using an Objective Bayes Hidden-Markov Model (OB-HMM). Objective Bayes measures are used to set certain hyperparameters in the priors using a novel re-sampling framework to calibrate the model to a fixed Type I (false positive) error rate. Other parameters are set via maximum marginal likelihood to prior training data of known structure. QuantiSNP provides probabilistic quantification of state classifications and significantly improves the accuracy of segmental aneuploidy identification and mapping, relative to existing analytical tools (Beadstudio, Illumina), as demonstrated by validation of breakpoint boundaries. QuantiSNP identified both novel and validated CNVs. QuantiSNP was developed using BeadArray™ SNP data but it can be adapted to other platforms and we believe that the OB-HMM framework has widespread applicability in genomic research. In conclusion, QuantiSNP is a novel algorithm for high-resolution CNV/aneuploidy detection with application to clinical genetics, cancer and disease association studies.
Gillespie syndrome (GS) is characterized by bilateral iris hypoplasia, congenital hypotonia, non-progressive ataxia, and progressive cerebellar atrophy. Trio-based exome sequencing identified de novo mutations in ITPR1 in three unrelated individuals with GS recruited to the Deciphering Developmental Disorders study. Whole-exome or targeted sequence analysis identified plausible disease-causing ITPR1 mutations in 10/10 additional GS-affected individuals. These ultra-rare protein-altering variants affected only three residues in ITPR1: Glu2094 missense (one de novo, one co-segregating), Gly2539 missense (five de novo, one inheritance uncertain), and Lys2596 in-frame deletion (four de novo). No clinical or radiological differences were evident between individuals with different mutations. ITPR1 encodes an inositol 1,4,5-triphosphate-responsive calcium channel. The homo-tetrameric structure has been solved by cryoelectron microscopy. Using estimations of the degree of structural change induced by known recessive- and dominant-negative mutations in other disease-associated multimeric channels, we developed a generalizable computational approach to indicate the likely mutational mechanism. This analysis supports a dominant-negative mechanism for GS variants in ITPR1. In GS-derived lymphoblastoid cell lines (LCLs), the proportion of ITPR1-positive cells using immunofluorescence was significantly higher in mutant than control LCLs, consistent with an abnormality of nuclear calcium signaling feedback control. Super-resolution imaging supports the existence of an ITPR1-lined nucleoplasmic reticulum. Mice with Itpr1 heterozygous null mutations showed no major iris defects. Purkinje cells of the cerebellum appear to be the most sensitive to impaired ITPR1 function in humans. Iris hypoplasia is likely to result from either complete loss of ITPR1 activity or structure-specific disruption of multimeric interactions.
Inherited retinal degeneration (IRD) is a common cause of visual impairment (prevalence B1/3500). There is considerable phenotype and genotype heterogeneity, making a specific diagnosis very difficult without molecular testing. We investigated targeted capture combined with next-generation sequencing using Nimblegen 12plex arrays and the Roche 454 sequencing platform to explore its potential for clinical diagnostics in two common types of IRD, retinitis pigmentosa and cone-rod dystrophy. 50 patients (36 unknowns and 14 positive controls) were screened, and pathogenic mutations were identified in 25% of patients in the unknown, with 53% in the early-onset cases. All patients with new mutations detected had an age of onset o21 years and 44% had a family history. Thirty-one percent of mutations detected were novel. A de novo mutation in rhodopsin was identified in one early-onset case without a family history. Bioinformatic pipelines were developed to identify likely pathogenic mutations and stringent criteria were used for assignment of pathogenicity. Analysis of sequencing metrics revealed significant variability in capture efficiency and depth of coverage. We conclude that targeted capture and next-generation sequencing are likely to be very useful in a diagnostic setting, but patients with earlier onset of disease are more likely to benefit from using this strategy. The mutation-detection rate suggests that many patients are likely to have mutations in novel genes.
Early B cell factor 3 (EBF3) is an atypical transcription factor that is thought to influence the laminar formation of the cerebral cortex. Here, we report that de novo mutations in EBF3 cause a complex neurodevelopmental syndrome. The mutations were identified in two large-scale sequencing projects: the UK Deciphering Developmental Disorders (DDD) study and the Canadian Clinical Assessment of the Utility of Sequencing and Evaluation as a Service (CAUSES) study. The core phenotype includes moderate to severe intellectual disability, and many individuals exhibit cerebellar ataxia, subtle facial dysmorphism, strabismus, and vesicoureteric reflux, suggesting that EBF3 has a widespread developmental role. Pathogenic de novo variants identified in EBF3 include multiple loss-of-function and missense mutations. Structural modeling suggested that the missense mutations affect DNA binding. Functional analysis of mutant proteins with missense substitutions revealed reduced transcriptional activities and abilities to form heterodimers with wild-type EBF3. We conclude that EBF3, a transcription factor previously unknown to be associated with human disease, is important for brain and other organ development and warrants further investigation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.