The increasing suicide rates amongst youth and its high correlation with suicidal ideation expression on social media warrants a deeper investigation into models for the detection of suicidal intent in text such as tweets to enable prevention. However, the complexity of the natural language constructs makes this task very challenging. Deep Learning architectures such as LSTMs, CNNs, and RNNs show promise in sentence level classification problems. This work investigates the ability of deep learning architectures to build an accurate and robust model for suicidal ideation detection and compares their performance with standard baselines in text classification problems. The experimental results reveal the merit in C-LSTM based models as compared to other deep learning and machine learning based classification models for suicidal ideation detection.
Technological advancements in the World Wide Web and social networks in particular coupled with an increase in social media usage has led to a positive correlation between the exhibition of Suicidal ideation on websites such as Twitter and cases of suicide. This paper proposes a novel supervised approach for detecting suicidal ideation in content on Twitter. A set of features is proposed for training both linear and ensemble classifiers over a dataset of manually annotated tweets. The performance of the proposed methodology is compared against four baselines that utilize varying approaches to validate its utility. The results are finally summarized by reflecting on the effect of the inclusion of the proposed features one by one for suicidal ideation detection.
Groundwater pollution sources are characterized by spatially and temporally varying source locations, injection rates, and duration of activity. Concentration measurement data at specified observation locations are generally utilized to identify these sources characteristics. Identification of unknown groundwater pollution sources in terms of these source characteristics becomes more difficult in the absence of complete breakthrough curves of concentration history at all the time steps. If concentration observations are missing over a length of time after an unknown source has become active, it is even more difficult to correctly identify the unknown sources. An artificial neural network (ANN) based methodology is developed to identify these source characteristics for such a missing data scenario, when concentration measurement data over an initial length of time is not available. The source characteristics and the corresponding concentration measurements at time steps for which it is not missing, constitute a pattern for training the ANN. A groundwater flow and transport numerical simulation model is utilized to generate the necessary patterns for training the ANN. Performance evaluation results show that the back-propagation based ANN model is essentially capable of extracting hidden relationship between patterns of available concentration measurement values, and the corresponding sources characteristics, resulting in identification of unknown groundwater pollution sources. The performance of the methodology is also evaluated for different levels of noise (or measurement errors) in concentration measurement data at available time steps.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.