2022
DOI: 10.1101/2022.04.15.22273922
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Linking Genotype to Phenotype: Further Exploration of Mutations in SARS-CoV-2 Associated with Mild or Severe Outcomes

Abstract: We previously interrogated the relationship between SARS-CoV-2 genetic mutations and associated patient outcomes using publicly available data downloaded from GISAID in October 2020 [1]. Using high-level patient data included in some GISAID submissions, we were able to aggregate patient status values and differentiate between severe and mild COVID-19 outcomes. In our previous publication, we utilized a logistic regression model with an L1 penalty (Lasso regularization) and found several statistically significa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…Some studies have used methods to analyze one variant at a time, including chi-square or Fisher exact tests that do not account for genetic correlations (2, 3), multiple logistic regression with covariates to capture genetic background like in many human GWAS (4, 14, 15), and phylogeny-based association tests (6). Other studies have used analytical approaches to investigate multiple variant loci at a time, including multiple logistic regression with multiple variants as features (5, 17), neural network approaches (7, 8), random forests (16, 17), and XGBoost (7, 8).…”
Section: Introductionmentioning
confidence: 99%
“…Some studies have used methods to analyze one variant at a time, including chi-square or Fisher exact tests that do not account for genetic correlations (2, 3), multiple logistic regression with covariates to capture genetic background like in many human GWAS (4, 14, 15), and phylogeny-based association tests (6). Other studies have used analytical approaches to investigate multiple variant loci at a time, including multiple logistic regression with multiple variants as features (5, 17), neural network approaches (7, 8), random forests (16, 17), and XGBoost (7, 8).…”
Section: Introductionmentioning
confidence: 99%
“…However, over time even well-performing classifiers began to perform worse than they did before in updated studies, in particular after Omicron began to emerge [ 36 ]. One approach did find a mutational signature using random forests based on mild/severe disease classifications, but the mutations it found like V1176F and L5F in the Spike gene have not been validated [ 37 ].…”
Section: Introductionmentioning
confidence: 99%