As an example of optimizing population-specific genotyping assays using a whole-genome sequence reference set, we detail the approach that followed to design the Axiom-NL array which is characterized by an improved imputation backbone based on the Genome of the Netherlands (GoNL) reference sequence and, compared with earlier arrays, a more comprehensive inclusion of SNPs on chromosomes X, Y, and the mitochondria. Common variants on the array were selected to be compatible with the Illumina Psych Array and the Affymetrix UK Biobank Axiom array. About 3.5% of the array (23 977 markers) represents SNPs from the GWAS catalog, including SNPs at FTO, APOE, Ion-channels, killer-cell immunoglobulin-like receptors, and HLA. Around 26 000 markers associated with common psychiatric disorders are included, as well as 6705 markers suggested to be associated with fertility and twinning. The platform can thus be used for risk profiling, detection of new variants, as well as ancestry determination. Results of coverage tests in 249 unrelated subjects with GoNL-based sequence data show that after imputation with 1000G as a reference, the median concordance between original and imputed genotypes is above 98%. The median imputation quality R 2 for MAF thresholds of 0.001, 0.01, 0.05, and 40.05 are 0.05, 0.28, 0.80, 0.99, respectively, for the 1000G imputed SNPs, with a similar quality for the autosomes and X chromosome, showing a good genome-wide coverage for association studies after imputation. 3,4 to hormones, 5 personality, 6 educational attainment, 7 and lifestyle characteristics. [8][9][10] The major technology behind these successes is the relatively cheap genotyping, in comparison with full genome sequencing, of DNA samples on genotyping arrays with 300 K-5 M single-nucleotide polymorphisms (SNPs), followed by imputation of the unmeasured SNPs. Initially, the contents of these arrays were determined by the manufacturers, but recently companies also allow researchers to select the variants on an array. Here, we focus on the Axiom array, a genotyping solution from Affymetrix, Inc., which provides a high throughput platform for high-density SNP genotyping on a diverse range of sample types. This array has been used for several large population-wide genome-screening projects including the UK biobank 11 and the GERA cohorts. 12 We describe a similar custommade Axiom array for the Netherlands population, the Axiom-NL that allows good imputation and enhances association, and risk score analysis on DNA samples collected within Dutch Biobanks, such as the large number of Biobanks collaborating in BBMRI-NL. 13 Notwithstanding the application to this specific population, the SNP selection procedures and coverage testing can provide a general guideline for customizing the Axiom array in other populations for which valid reference sequence genomes are available.
MATERIALS AND METHODS
SNP selection for the Axiom-NL arrayAn overview of the SNP selection is provided in Figure 1, a stepwise procedure of the selection is given in the Supple...