Heritability, the proportion of phenotypic variance explained by genetic factors, can be estimated from pedigree data 1 , but such estimates are uninformative with respect to the underlying genetic architecture. Analyses of data from genome-wide association studies (GWAS) on unrelated individuals have shown that for human traits and disease, approximately one-third to two-thirds of heritability is captured by common SNPs 2-5 . It is not known whether the remaining heritability is due to the imperfect tagging of causal variants by common SNPs, in particular if the causal variants are rare, or other reasons such as overestimation of heritability from pedigree data. Here we show that pedigree heritability for height and body mass index (BMI) appears to be fully recovered from whole-genome sequence (WGS) data on 21,620 unrelated individuals of European ancestry. We assigned 47.1 million genetic variants to groups based upon their minor allele frequencies (MAF) and linkage disequilibrium (LD) with variants nearby, and estimated and partitioned variation accordingly. The estimated heritability was 0.79 (SE 0.09) for height and 0.40 (SE 0.09) for BMI, consistent with pedigree estimates. Low-MAF variants in low LD with neighbouring variants were enriched for heritability, to a greater extent for protein altering variants, consistent with negative selection thereon. Cumulatively variants in the MAF range of 0.0001 to 0.1 explained 0.54 (SE 0.05) and 0.51 (SE 0.11) of heritability for height and BMI, respectively. Our results imply that the still missing heritability of complex traits and disease is accounted for by rare variants, in particular those in regions of low LD.
Fine-mapping to plausible causal variation may be more effective in multi-ancestry cohorts, particularly in the MHC, which has population-specific structure. To enable such studies, we constructed a large (
n
= 21,546) HLA reference panel spanning five global populations based on whole-genome sequences. Despite population specific long-range haplotypes, we demonstrated accurate imputation at G-group resolution (94.2%, 93.7%, 97.8% and 93.7% in Admixed African (AA), East Asian (EAS), European (EUR) and Latino (LAT) populations). Applying HLA imputation to genome-wide association study (GWAS) data for HIV-1 viral load in three populations (EUR, AA and LAT), we obviated effects of previously reported associations from population-specific HIV studies and discovered a novel association at position 156 in HLA-B. We pinpointed the MHC association to three amino acid positions (97, 67 and 156) marking three consecutive pockets (C, B and D) within the HLA-B peptide binding groove, explaining 12.9% of trait variance.
Summary
Silent cerebral infarct (SCI) is the most commonly recognized cause of neurological injury in sickle cell anaemia (SCA). We tested the hypothesis that magnetic resonance angiography (MRA)-defined vasculopathy is associated with SCI. Furthermore, we examined genetic variations in glucose-6-phosphate dehydrogenase (G6PD) and HBA (α-globin) genes to determine their association with intracranial vasculopathy in children with SCA. Magnetic resonance imaging (MRI) of the brain and MRA of the cerebral vasculature were available in 516 paediatric patients with SCA, enrolled in the Silent Infarct Transfusion (SIT) Trial. All patients were screened for G6PD mutations and HBA deletions. SCI were present in 41.5%(214 of 516) of SIT Trial children. The frequency of intracranial vasculopathy with and without SCI was 15.9% and 6.3%, respectively (p<0.001). Using a multivariate logistic regression model, only the presence of a SCI was associated with increased odds of vasculopathy (p=0.0007, odds ratio (OR) 2.84; 95% Confidence Interval (CI)=1.55-5.21). Among male patients with SCA, G6PD status was associated with vasculopathy (p=0.04, OR 2.78; 95% CI=1.04-7.42), while no significant association was noted for HBA deletions. Intracranial vasculopathy was observed in a minority of children with SCA, and when present, was associated with G6PD status in males and SCI.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.