We aimed to develop a machine learning algorithm to screen for depression and assess severity based on data from wearable devices. Methods: We used a wearable device that calculates steps, energy expenditure, body movement, sleep time, heart rate, skin temperature, and ultraviolet light exposure. Depressed patients and healthy volunteers wore the device continuously for the study period. The modalities were compared hourly between patients and healthy volunteers. XGBoost was used to build machine learning models and 10-fold cross-validation was applied for the validation. Results: Forty-five depressed patients and 41 healthy controls participated, creating a combined 5,250 days' worth of data. Heart rate, steps, and sleep were significantly different between patients and healthy volunteers in some comparisons. Similar differences were also observed longitudinally when patients' symptoms improved. Based on seven days' data, the model identified symptomatic patients with 0.76 accuracy and predicted Hamilton Depression Rating Scale-17 scores with a 0.61 correlation coefficient. Skin temperature, sleep time-related features, and the correlation of those modalities were the most significant features in machine learning. Limitations: The small number of subjects who participated in this study may have weakened the statistical significance of the study. There are differences in the demographic data among groups although we performed a correction for multiple comparisons. Validation in independent datasets was not performed, although 10-fold cross validation with the internal data was conducted.
Conclusion:The results indicated that utilizing wearable devices and machine learning may be useful in identifying depression as well as assessing severity.
Background
Previous studies have attempted to characterize depression using electroencephalography (EEG), but results have been inconsistent. New noise reduction technology allows EEG acquisition during conversation.
Methods
We recorded EEG from 40 patients with depression as they engaged in conversation using a single-channel EEG device while conducting real-time noise reduction and compared them to those of 40 healthy subjects. Differences in EEG between patients and controls, as well as differences in patients’ depression severity, were examined using the ratio of the power spectrum at each frequency. In addition, the effects of medications were examined in a similar way.
Results
In comparing healthy controls and depression patients, significant power spectrum differences were observed at 3 Hz, 4 Hz, and 10 Hz and higher frequencies. In the patient group, differences in the power spectrum were observed between asymptomatic patients and healthy individuals, and between patients of each respective severity level and healthy individuals. In addition, significant differences were observed at multiple frequencies when comparing patients who did and did not take antidepressants, antipsychotics, and/or benzodiazepines. However, the power spectra still remained significantly different between non-medicated patients and healthy individuals.
Limitations
The small sample size may have caused Type II error. Patients’ demographic characteristics varied. Moreover, most patients were taking various medications, and cannot be compared to the non-medicated control group.
Conclusion
A study with a larger sample size should be conducted to gauge reproducibility, but the methods used in this study could be useful in clinical practice as a biomarker of depression.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.