Depression has become one of the most common mental illnesses, and the widespread use of social media provides new ideas for detecting various mental illnesses. The purpose of this study is to use machine learning technology to detect users of depressive patients based on user-shared content and posting behaviors in social media. At present, the existing research mostly uses a single detection method, and the unbalanced class distribution often leads to a low recognition rate. In addition, a large number of irrelevant or redundant features in high-dimensional data sets interfere with the accuracy of recognition. To solve this problem, this paper proposes a hybrid feature selection and stacking ensemble strategy for depression user detection. First, recursive elimination method and extremely randomized trees method are used to calculate feature importance and mutual information value, calculate feature weight vector, and select the optimal feature subset according to the feature weight. Second, naive bayes, k-nearest neighbor, regularized logistic regression and support vector machine are used as base learners, and a simple logistic regression algorithm is used as a combination strategy to build a stacking model. Experimental results show that compared with other machine learning algorithms, the proposed hybrid method, which integrates feature selection and ensemble, has a higher accuracy of 90.27% in identifying online patients. We believe this study will help develop new methods to identify depressed people in social networks, providing guidance for future research.
Suicide has become a serious problem, and how to prevent suicide has become a very important research topic. Social media provides an ideal platform for monitoring suicidal ideation. This paper presents an integrated model for multidimensional information fusion. By integrating the best classification models determined by single and multiple features, different feature information is combined to better identify suicidal posts in online social media. This approach was assessed with a dataset formed from 40,222 posts annotated by Weibo. By integrating the best classification model of single features and multidimensional features, the proposed model ((BSC + RFS)-fs, WEC-fs) achieved 80.61% accuracy and a 79.20% F1-score. Other representative text information representation methods and demographic factors related to suicide may also be important predictors of suicide, which were not considered in this study. To the best of our knowledge, this is the good try that feature combination and ensemble algorithms have been fused to detect user-generated content with suicidal ideation. The findings suggest that feature combinations do not always work well, and that an appropriate combination strategy can make classification models work better. There are differences in the information contained in different functional carriers, and a targeted choice classification model may improve the detection rate of suicidal ideation.
Social media platforms provide unique insights into mental health issues, but a large number of related studies have focused on English text information. The purpose of this paper is to identify the posting content and posting behaviors of users with depression on Chinese social media. These clues may suggest signs of depression. We created two data sets consisting of 130 users with diagnosed depression and 320 other users that were randomly selected. By comparing and analyzing the two data sets, we can observe more closely how users reveal their signs of depression on Chinese social platforms. The results show that the distribution of some Chinese speech users with depression is significantly different from that of other users. Emotional sadness, fear and disgust are more common in the depression class. For personal pronouns, negative words and interrogative words, there are also great differences between the two data sets. Using topic modeling, we found that patients mainly discussed seven topics: negative emotion fluctuation, disease treatment and somatic responses, sleep disorders, sense of worthlessness, suicidal extreme behavior, seeking emotional support and interpersonal communication. The depression class post negative polarity posts much more frequently than other users. The frequency and characteristics of posts also reveal certain characteristics, such as sleep problems and reduced self-disclosure. In this study, we used Chinese microblog data to conduct a detailed analysis of the users showing depression signs, which helps to identify more patients with depression. At the same time, the study can provide a further theoretical basis for cross-cultural research of different language groups in the field of psychology.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.