BackgroundBipolar disorder (BD) is easy to be misdiagnosed as major depressive disorder (MDD), which may contribute to a delay in treatment and affect prognosis. Circadian rhythm dysfunction is significantly associated with conversion from MDD to BD. So far, there has been no study that has revealed a relationship between circadian rhythm gene polymorphism and MDD-to-BD conversion. Furthermore, the prediction of MDD-to-BD conversion has not been made by integrating multidimensional data. The study combined clinical and genetic factors to establish a predictive model through machine learning (ML) for MDD-to-BD conversion.MethodBy following up for 5 years, 70 patients with MDD and 68 patients with BD were included in this study at last. Single nucleotide polymorphisms (SNPs) of the circadian rhythm genes were selected for detection. The R software was used to operate feature screening and establish a predictive model. The predictive model was established by logistic regression, which was performed by four evaluation methods.ResultsIt was found that age of onset was a risk factor for MDD-to-BD conversion. The younger the age of onset, the higher the risk of BD. Furthermore, suicide attempts and the number of hospitalizations were associated with MDD-to-BD conversion. Eleven circadian rhythm gene polymorphisms were associated with MDD-to-BD conversion by feature screening. These factors were used to establish two models, and 4 evaluation methods proved that the model with clinical characteristics and SNPs had the better predictive ability.ConclusionThe risk factors for MDD-to-BD conversion have been found, and a predictive model has been established, with a specific guiding significance for clinical diagnosis.