Objective
We aimed to screen out biomarkers for atrial fibrillation (AF) based on machine learning methods and evaluate the degree of immune infiltration in AF patients in detail.
Methods
Two datasets (GSE41177 and GSE79768) related to AF were downloaded from Gene expression omnibus (GEO) database and merged for further analysis. Differentially expressed genes (DEGs) were screened out using âlimmaâ package in R software. Candidate biomarkers for AF were identified using machine learning methods of the LASSO regression algorithm and SVM-RFE algorithm. Receiver operating characteristic (ROC) curve was employed to assess the diagnostic effectiveness of biomarkers, which was further validated in another independent validation dataset of GSE14975. Moreover, we used CIBERSORT to study the proportion of infiltrating immune cells in each sample, and the Spearman method was used to explore the correlation between biomarkers and immune cells.
Results
129 DEGs were identified, and CYBB, CXCR2, and S100A4 were identified as key biomarkers of AF using LASSO regression and SVM-RFE algorithm. Both in the training dataset and the validation dataset, CYBB, CXCR2, and S100A4 showed favorable diagnostic effectiveness. Immune infiltration analysis indicated that, compared with sinus rhythm (SR), the atrial samples of patients with AF contained a higher T cells gamma delta, neutrophils and mast cells resting, whereas T cells follicular helper were relatively lower. Correlation analysis demonstrated that CYBB, CXCR2, and S100A4 were significantly correlated with the infiltrating immune cells.
Conclusions
In conclusion, this study suggested that CYBB, CXCR2, and S100A4 are key biomarkers of AF correlated with infiltrating immune cells, and infiltrating immune cells play pivotal roles in AF.