2011
DOI: 10.1186/1753-6561-5-s9-s100
|View full text |Cite
|
Sign up to set email alerts
|

A LASSO-based approach to analyzing rare variants in genetic association studies

Abstract: Genetic markers with rare variants are spread out in the genome, making it necessary and difficult to consider them in genetic association studies. Consequently, wisely combining rare variants into “composite” markers may facilitate meaningful analyses. In this paper, we propose a novel approach of analyzing rare variant data by incorporating the least absolute shrinkage and selection operator technique. We applied this method to the Genetic Analysis Workshop 17 data, and our results suggest that this new appr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2011
2011
2016
2016

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 14 publications
0
6
0
Order By: Relevance
“…Each of the 200 simulated data sets includes the following information for each individual: case-control status, three continuous quantitative traits (Q1, Q2, Q4), and three phenotypic features (Age, Smoking status, and Sex). We use a multidimensional scaling analysis based on genome-wide pairwise identity-by-state distances computed in PLINK [ 14 ] to determine three main continental population strata: African (Luhya, Luhya-additional, Yoruba-1, Yoruba-2, Yoruba-additional), Asian (Denver Chinese, Denver Chinese-additional, Han Chinese-1, Han Chinese-2, Han Chinese-additional, Japanese-1, Japanese-2, Japanese-additional), and European (CEPH-1, CEPH-2, Tuscan, and Tuscan-additional) [ 15 , 16 ]. We then generate three binary features to include in our model, assigning patients to their corresponding Asian, European, and African populations.…”
Section: Methodsmentioning
confidence: 99%
“…Each of the 200 simulated data sets includes the following information for each individual: case-control status, three continuous quantitative traits (Q1, Q2, Q4), and three phenotypic features (Age, Smoking status, and Sex). We use a multidimensional scaling analysis based on genome-wide pairwise identity-by-state distances computed in PLINK [ 14 ] to determine three main continental population strata: African (Luhya, Luhya-additional, Yoruba-1, Yoruba-2, Yoruba-additional), Asian (Denver Chinese, Denver Chinese-additional, Han Chinese-1, Han Chinese-2, Han Chinese-additional, Japanese-1, Japanese-2, Japanese-additional), and European (CEPH-1, CEPH-2, Tuscan, and Tuscan-additional) [ 15 , 16 ]. We then generate three binary features to include in our model, assigning patients to their corresponding Asian, European, and African populations.…”
Section: Methodsmentioning
confidence: 99%
“…Three work groups [Brennan et al, 2011; Niu et al, 2011; Wang et al, 2011] aimed to generate a linear combination of genotypes within a gene and relied on various regression techniques to test for association of the trait with individual genes. Jung et al [2011] used the count of rare variants within a gene as the surrogate for the gene, but in their association test this count became the response variable and the phenotype became the independent variable in a zero-inflated Poisson regression model.…”
Section: Methods For Collapsing Rare Variantsmentioning
confidence: 99%
“…Brennan et al [2011] considered a two-step approach of analyzing rare variant data by incorporating the least absolute shrinkage and selection operator (LASSO) technique (reviewed by Dasgupta et al [2011]). In the first step, Brennan and colleagues c nducted a gene-level screening on SNPs within each gene using the LASSO method.…”
Section: Methods For Collapsing Rare Variantsmentioning
confidence: 99%
“…Commonly used methods include the collapsing, simple-sum, and weighted-sum methods [25]. They first collapse rare variants and then implement a LASSO (least absolute shrinkage and selection operator) [68], partial least squares regression (PLS) model [9], or other supporting statistical methods using the common variants and the collapsed rare variants [10]. …”
Section: Introductionmentioning
confidence: 99%