Bipolar disorder diagnosis(BPD) is associated with great imprecision and uncertainty and requires a reliable diagnostic measure. Recently, machine learning techniques, named support vector machine (SVM), random forest, and K-nearest neighbor (KNN), have been combined with neuroimaging methods to help diagnose BPD. This study aimed to predict BPD using SVM, RF, and KNN classifier models based on graph theory values of the whole brain's global function and gray matter volume using the data integration method.
Methodology: In this study, we used data from 49 patients with bipolar disorder and 49 healthy. In this method, we used the global efficiency scale and brain gray matter volume for integration into the concatenation method.
Results: For the combined dataset, the SVM model had an accuracy of 0.85, a sensitivity of 0.92, and a specificity of 0.78, The random forest model had an accuracy of 0.89, a sensitivity of 0.88, and a specificity of 0.91; the KNN model had an accuracy of 0.82, a sensitivity of 0.84, and a specificity of 0.80. In the global-efficiency dataset, the SVM model had an accuracy of 0.85, a sensitivity of 0.82, and a specificity of 0.87; The random forest model had an accuracy of 0.82 a sensitivity of 0.78, and a specificity of 0.86; The KNN model has an achieved of 0.78, a sensitivity of 0.79, and a specificity of 0.81. In the gray matter volume dataset, all three models (SVM, Random Forest, and KNN) performed similarly, with accuracies, sensitivities, and specificities ranging from 0.51 to 0.52.
The results show that the combined gray matter and global-efficiency data group yielded the highest accuracy for all three models. The random forest model consistently performed well on all datasets and demonstrated robustness when handling brain imaging data. However, note that the performance of the models varies depending on the data type used. Therefore, model selection should be tailored to the specific characteristics of the given dataset.