2015
DOI: 10.1186/s13742-015-0047-8
|View full text |Cite
|
Sign up to set email alerts
|

Second-generation PLINK: rising to the challenge of larger and richer datasets

Abstract: BackgroundPLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for faster and scalable implementations of key functions, such as logistic regression, linkage disequilibrium estimation, and genomic distance evaluation. In addition, GWAS and population-genetic data now frequently contain genotype likelihoods, phase inf… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

7
9,371
0
21

Year Published

2015
2015
2023
2023

Publication Types

Select...
10

Relationship

0
10

Authors

Journals

citations
Cited by 9,709 publications
(9,399 citation statements)
references
References 39 publications
7
9,371
0
21
Order By: Relevance
“…The input files for fastStructure were generated using PLINK v1.9 (Chang et al., 2015). We executed the program using the default settings with simple prior and tested multiple K values ranging from 1 to 6.…”
Section: Methodsmentioning
confidence: 99%
“…The input files for fastStructure were generated using PLINK v1.9 (Chang et al., 2015). We executed the program using the default settings with simple prior and tested multiple K values ranging from 1 to 6.…”
Section: Methodsmentioning
confidence: 99%
“…Q‐values were visualized in R (R Core Team, 2016). To have an overall estimate on population divergence, we calculated in PLINK 1.9 (Chang et al., 2015) the average genomewide pairwise F ST (Weir & Cockerham, 1984) between A. m. iberiensis, A. m. carnica and A. m. ligustica and between A. m. iberiensis and combined A. m. carnica with A. m. ligustica (C‐lineage).…”
Section: Methodsmentioning
confidence: 99%
“…We measured linkage disequilibrium by calculating the squared correlation coefficient between unphased genotypes, using PLINK version 1.90b3.45 (ref. 69 ). We started with three datasets: 'neutral' loci (a subsample of SNPs that showed no significant correlation with depth); 'type 1' outliers (the 9 SNPs that are differentiated between 1,000 m and 1,800 m, and which code for non-synonymous changes within coding genes); and 'type 2' outliers (the remaining 337 SNPs that are differentiated between 1,000 m and 1,800 m).…”
Section: Nature Ecology and Evolutionmentioning
confidence: 99%