2021
DOI: 10.1155/2021/9939134
|View full text |Cite|
|
Sign up to set email alerts
|

[Retracted] Identifying COVID‐19‐Specific Transcriptomic Biomarkers with Machine Learning Methods

Abstract: COVID-19, a severe respiratory disease caused by a new type of coronavirus SARS-CoV-2, has been spreading all over the world. Patients infected with SARS-CoV-2 may have no pathogenic symptoms, i.e., presymptomatic patients and asymptomatic patients. Both patients could further spread the virus to other susceptible people, thereby making the control of COVID-19 difficult. The two major challenges for COVID-19 diagnosis at present are as follows: (1) patients could share similar symptoms with other respiratory i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
9

Relationship

2
7

Authors

Journals

citations
Cited by 20 publications
(20 citation statements)
references
References 82 publications
(110 reference statements)
0
20
0
Order By: Relevance
“…However, the field of genomics is struggling with the successful implementation of gold standard machine learning (ML) algorithms for clinically proven reproducible computational predictions [ 38 ]. It is necessary to automate the process of gene-variant data annotation, expression and simulation to produce timely presentable results [ 117 ]. Current limitations in this context imply gaps among clinics and fundamental basic and applied research; difficulties in getting exigent approvals and timeliness of data availability; levels of granularity in clinical information; and application of appropriate modeling strategies that allow learning in the data continuum [ 118 ].…”
Section: Discussionmentioning
confidence: 99%
“…However, the field of genomics is struggling with the successful implementation of gold standard machine learning (ML) algorithms for clinically proven reproducible computational predictions [ 38 ]. It is necessary to automate the process of gene-variant data annotation, expression and simulation to produce timely presentable results [ 117 ]. Current limitations in this context imply gaps among clinics and fundamental basic and applied research; difficulties in getting exigent approvals and timeliness of data availability; levels of granularity in clinical information; and application of appropriate modeling strategies that allow learning in the data continuum [ 118 ].…”
Section: Discussionmentioning
confidence: 99%
“…The method compares the value of the original features to the significance achievable at random, as indicated by their permuted copies, and gradually removes unnecessary features to stabilize the test. In the last few years, Boruta has been widely used in processing biological data ( Chen et al, 2021a ; Huang et al, 2021a ; Zhou et al, 2022 ).…”
Section: Methodsmentioning
confidence: 99%
“…High-throughput sequencing and data analysis provide convenience for understanding the immune cell characteristics of COVID-19 ( Chen et al, 2021a ; Li et al, 2021a ; Stephenson et al, 2021 ; Zhang et al, 2021 ). Based on the single-cell profiling of gene expression and surface proteins of 696,109 peripheral blood immune cells from 102 patients with COVID-19 having different disease severity and 41 control individuals, we used a machine learning statistical analysis to explore the expression characteristics of various immune cells in patients with COVID-19 and immune molecules related to the COVID-19 immunity mechanism.…”
Section: Introductionmentioning
confidence: 99%
“…Three sets of top 50 feature genes were selected by the three feature selection methods. Thereafter, we set up a classifier to filter optimal feature genes by methods described previously ( 18 ). A random forest classifier was constructed based on the python package “skfeature” ( 17 ).…”
Section: Methodsmentioning
confidence: 99%