Background
As a common mental disease, depression seriously affects people’s physical and mental health. According to the statistics of the World Health Organization, depression is one of the main reasons for suicide and self-harm events in the world. Therefore, strengthening depression detection can effectively reduce the occurrence of suicide or self-harm events so as to save more people and families. With the development of computer technology, some researchers are trying to apply natural language processing techniques to detect people who are depressed automatically. Many existing feature engineering methods for depression detection are based on emotional characteristics, but these methods do not consider high-level emotional semantic information. The current deep learning methods for depression detection cannot accurately extract effective emotional semantic information.
Objective
In this paper, we propose an emotion-based attention network, including a semantic understanding network and an emotion understanding network, which can capture the high-level emotional semantic information effectively to improve the depression detection task.
Methods
The semantic understanding network module is used to capture the contextual semantic information. The emotion understanding network module is used to capture the emotional semantic information. There are two units in the emotion understanding network module, including a positive emotion understanding unit and a negative emotion understanding unit, which are used to capture the positive emotional information and the negative emotional information, respectively. We further proposed a dynamic fusion strategy in the emotion understanding network module to fuse the positive emotional information and the negative emotional information.
Results
We evaluated our method on the Reddit data set. The experimental results showed that the proposed emotion-based attention network model achieved an accuracy, precision, recall, and F-measure of 91.30%, 91.91%, 96.15%, and 93.98%, respectively, which are comparable results compared with state-of-the-art methods.
Conclusions
The experimental results showed that our model is competitive with the state-of-the-art models. The semantic understanding network module, the emotion understanding network module, and the dynamic fusion strategy are effective modules for depression detection. In addition, the experimental results verified that the emotional semantic information was effective in depression detection.
Short-term metro passenger flow prediction is vital for the operation and management of metro systems. Most studies focus on the higher prediction accuracy with statistical and machine learning methods, but little attention has been paid to the prioritization and selection of feature variables, especially for different metro station types. This study aims to analyze the effect of feature variables on the prediction results, and then select appropriate predictor variables accordingly. A novel three-stage framework is proposed to prioritize feature variables for short-term metro passenger flow prediction, including station clustering, feature extraction, and variable prioritization. A hierarchical clustering algorithm (AHC) is developed for station clustering, the results of which are verified by the K-means and Davies-Bouldin (DB) statistical index. We then extract the temporal, spatial, and external features. Finally, the association between the variables and the prediction results is explored using tree-based models. The proposed framework is demonstrated and validated with data collected from Shanghai Metro Automatic Fare Collection (AFC) system. The results highlight that the importance of feature variables for developing models varies between stations, whereas only a few variables are found to explain most of the variation in the testing dataset; different feature variables lead to distinct differences in prediction accuracy, and simply adding more predictor variables does not necessarily lead to higher prediction accuracy. In addition, the station type and prediction type (i.e., tap-in and tap-out) have little influence on the selection of feature variables.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.