BACKGROUND-Duplications and deletions in the human genome can cause disease or predispose persons to disease. Advances in technologies to detect these changes allow for the routine identification of submicroscopic imbalances in large numbers of patients.
De novo mutations (DNMs) in protein-coding genes are a well-established cause of developmental disorders (DD). However, known DD-associated genes only account for a minority of the observed excess of such DNMs. To identify novel DD-associated genes, we integrated healthcare and research exome sequences on 31,058 DD parent-offspring trios, and developed a simulation-based statistical test to identify gene-specific enrichments of DNMs. We identified 285 significantly DD-associated genes, including 28 not previously robustly associated with DDs. Despite detecting more DD-associated genes than in any previous study, much of the excess of DNMs of protein-coding genes remains unaccounted for. Modelling suggests that over 1,000 novel DD-associated genes await discovery, many of which are likely to be less penetrant than the currently known genes. Research access to clinical diagnostic datasets will be critical for completing the map of dominant DDs.
Alveolar capillary dysplasia with misalignment of pulmonary veins (ACD/MPV) is a rare, neonatally lethal developmental disorder of the lung with defining histologic abnormalities typically associated with multiple congenital anomalies (MCA). Using array CGH analysis, we have identified six overlapping microdeletions encompassing the FOX transcription factor gene cluster in chromosome 16q24.1q24.2 in patients with ACD/MPV and MCA. Subsequently, we have identified four different heterozygous mutations (frameshift, nonsense, and no-stop) in the candidate FOXF1 gene in unrelated patients with sporadic ACD/MPV and MCA. Custom-designed, high-resolution microarray analysis of additional ACD/MPV samples revealed one microdeletion harboring FOXF1 and two distinct microdeletions upstream of FOXF1, implicating a position effect. DNA sequence analysis revealed that in six of nine deletions, both breakpoints occurred in the portions of Alu elements showing eight to 43 base pairs of perfect microhomology, suggesting replication error Microhomology-Mediated Break-Induced Replication (MMBIR)/Fork Stalling and Template Switching (FoSTeS) as a mechanism of their formation. In contrast to the association of point mutations in FOXF1 with bowel malrotation, microdeletions of FOXF1 were associated with hypoplastic left heart syndrome and gastrointestinal atresias, probably due to haploinsufficiency for the neighboring FOXC2 and FOXL1 genes. These differences reveal the phenotypic consequences of gene alterations in cis.
Genomic rearrangements involving AUTS2 (7q11.22) are associated with autism and intellectual disability (ID), although evidence for causality is limited. By combining the results of diagnostic testing of 49,684 individuals, we identified 24 microdeletions that affect at least one exon of AUTS2, as well as one translocation and one inversion each with a breakpoint within the AUTS2 locus. Comparison of 17 well-characterized individuals enabled identification of a variable syndromic phenotype including ID, autism, short stature, microcephaly, cerebral palsy, and facial dysmorphisms. The dysmorphic features were more pronounced in persons with 3'AUTS2 deletions. This part of the gene is shown to encode a C-terminal isoform (with an alternative transcription start site) expressed in the human brain. Consistent with our genetic data, suppression of auts2 in zebrafish embryos caused microcephaly that could be rescued by either the full-length or the C-terminal isoform of AUTS2. Our observations demonstrate a causal role of AUTS2 in neurocognitive disorders, establish a hitherto unappreciated syndromic phenotype at this locus, and show how transcriptional complexity can underpin human pathology. The zebrafish model provides a valuable tool for investigating the etiology of AUTS2 syndrome and facilitating gene-function analysis in the future.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.