While advances in genome sequencing technology make population-scale genomics a possibility, current approaches for analysis of these data rely upon parallelization strategies that have limited scalability, complex implementation and lack reproducibility. Churchill, a balanced regional parallelization strategy, overcomes these challenges, fully automating the multiple steps required to go from raw sequencing reads to variant discovery. Through implementation of novel deterministic parallelization techniques, Churchill allows computationally efficient analysis of a high-depth whole genome sample in less than two hours. The method is highly scalable, enabling full analysis of the 1000 Genomes raw sequence dataset in a week using cloud resources. http://churchill.nchri.org/.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-014-0577-x) contains supplementary material, which is available to authorized users.
BackgroundThere is tremendous potential for genome sequencing to improve clinical diagnosis and care once it becomes routinely accessible, but this will require formalizing research methods into clinical best practices in the areas of sequence data generation, analysis, interpretation and reporting. The CLARITY Challenge was designed to spur convergence in methods for diagnosing genetic disease starting from clinical case history and genome sequencing data. DNA samples were obtained from three families with heritable genetic disorders and genomic sequence data were donated by sequencing platform vendors. The challenge was to analyze and interpret these data with the goals of identifying disease-causing variants and reporting the findings in a clinically useful format. Participating contestant groups were solicited broadly, and an independent panel of judges evaluated their performance.ResultsA total of 30 international groups were engaged. The entries reveal a general convergence of practices on most elements of the analysis and interpretation process. However, even given this commonality of approach, only two groups identified the consensus candidate variants in all disease cases, demonstrating a need for consistent fine-tuning of the generally accepted methods. There was greater diversity of the final clinical report content and in the patient consenting process, demonstrating that these areas require additional exploration and standardization.ConclusionsThe CLARITY Challenge provides a comprehensive assessment of current practices for using genome sequencing to diagnose and report genetic diseases. There is remarkable convergence in bioinformatic techniques, but medical interpretation and reporting are areas that require further development by many groups.
Purpose Somatic activating variants in the PI3K-AKT pathway cause vascular malformations with and without overgrowth. We previously reported an individual with capillary and lymphatic malformation harboring a pathogenic somatic variant in PIK3R1, which encodes three PI3K complex regulatory subunits. Here, we investigate PIK3R1 in a large cohort with vascular anomalies and identify an additional 16 individuals with somatic mosaic variants in PIK3R1. Methods Affected tissue from individuals with vascular lesions and overgrowth recruited from a multisite collaborative network was studied. Next-generation sequencing targeting coding regions of cell-signaling and cancer-associated genes was performed followed by assessment of variant pathogenicity. Results The phenotypic and variant spectrum associated with somatic variation in PIK3R1 is reported herein. Variants occurred in the inter-SH2 or N-terminal SH2 domains of all three PIK3R1 protein products. Phenotypic features overlapped those of the PIK3CA-related overgrowth spectrum (PROS). These overlapping features included mixed vascular malformations, sandal toe gap deformity with macrodactyly, lymphatic malformations, venous ectasias, and overgrowth of soft tissue or bone. Conclusion Somatic PIK3R1 variants sharing attributes with cancer-associated variants cause complex vascular malformations and overgrowth. The PIK3R1-associated phenotypic spectrum overlaps with PROS. These data extend understanding of the diverse phenotypic spectrum attributable to genetic variation in the PI3K-AKT pathway.
Exome sequencing (ES) has become an important tool in pediatric genomic medicine, improving identification of disease-associated variation due to assay breadth. Depth is also afforded by ES, enabling detection of lower-frequency mosaic variation compared to Sanger sequencing in the studied tissue, thus enhancing diagnostic yield. Within a pediatric tertiary-care hospital, we report two years of clinical ES data from probands evaluated for genetic disease to assess diagnostic yield, characteristics of causal variants, and prevalence of mosaicism among disease-causing variants. Exome-derived, phenotype-driven variant data from 357 probands was analyzed concurrent with parental ES data, when available. Blood was the source of nucleic acid. Sequence read alignments were manually reviewed for all assessed variants. Sanger sequencing was used for suspected de novo or mosaic variation. Clinical provider notes were reviewed to determine concordance between laboratory-reported data and the ordering provider's interpretation of variant-associated disease causality. Laboratory-derived diagnostic yield and provider-substantiated diagnoses had 91.4% concordance. The cohort returned 117 provider-substantiated diagnoses among 115 probands for a diagnostic yield of 32.2%. De novo variants represented 64.9% of disease-associated variation within trio analyses. Among the 115 probands, five harbored disease-associated somatic mosaic variation. Two additional probands were observed to inherit a disease-associated variant from an unaffected mosaic parent. Among inheritance patterns, de novo variation was the most frequent disease etiology. Somatic mosaicism is increasingly recognized as a significant contributor to genetic disease, particularly with increased sequence depth attainable from ES. This report highlights the potential and importance of detecting mosaicism in ES.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.