Genome-wide identification of the genetic basis of amyotrophic lateral sclerosisHighlights d Machine learning method identifies risk genes by integrating GWASs and epigenetic data d Discovered ALS risk genes lead to a 5-fold increase in recovered heritability d Genetic and experimental support for initiation of ALS pathogenesis in the distal axon d Convergent genetic and experimental data establish KANK1 as a new ALS gene
Amyotrophic lateral sclerosis (ALS) is an archetypal complex disease centered on progressive death of motor neurons. Despite heritability estimates of 52%, GWAS studies have discovered only seven genome-wide significant hits, which are relevant to <10% of ALS patients. To increase the power of gene discovery, we integrated motor neuron functional genomics with ALS genetics in a hierarchical Bayesian model called RefMap. Comprehensive transcriptomic and epigenetic profiling of iPSC-derived motor neurons enabled RefMap to systematically fine-map genes and pathways associated with ALS. As a significant extension of the known genetic architecture of ALS, we identified a group of 690 candidate ALS genes, which is enriched with previously discovered risk genes. Extensive conservation, transcriptome and network analyses demonstrated the functional significance of these candidate genes in motor neurons and disease progression. In particular, we observed a genetic convergence on the distal axon, which supports the prevailing view of ALS as a distal axonopathy. Of the new ALS genes we discovered, we further characterized KANK1 that is enriched with coding and noncoding, common and rare ALS-associated genetic variation. Modelling patient mutations in human neurons reduced KANK1 expression and produced neurotoxicity with disruption of the distal axon. RefMap can be applied broadly to increase the discovery power in genetic association studies of human complex traits and diseases.
Summary
Amyotrophic lateral sclerosis (ALS) is an incurable neurodegenerative disease. CAV1 and CAV2 organize membrane lipid rafts (MLRs) important for cell signaling and neuronal survival, and overexpression of CAV1 ameliorates ALS phenotypes
in vivo
. Genome-wide association studies localize a large proportion of ALS risk variants within the non-coding genome, but further characterization has been limited by lack of appropriate tools. By designing and applying a pipeline to identify pathogenic genetic variation within enhancer elements responsible for regulating gene expression, we identify disease-associated variation within
CAV1/CAV2
enhancers, which replicate in an independent cohort. Discovered enhancer mutations reduce
CAV1/CAV2
expression and disrupt MLRs in patient-derived cells, and CRISPR-Cas9 perturbation proximate to a patient mutation is sufficient to reduce
CAV1/CAV2
expression in neurons. Additional enrichment of ALS-associated mutations within
CAV1
exons positions
CAV1
as an ALS risk gene. We propose
CAV1/CAV2
overexpression as a personalized medicine target for ALS.
Highlights d Machine learning method identifies risk genes by integrating GWASs and epigenetic data d Discovered ALS risk genes lead to a 5-fold increase in recovered heritability d Genetic and experimental support for initiation of ALS pathogenesis in the distal axon d Convergent genetic and experimental data establish KANK1 as a new ALS gene
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.