More and more people are under high pressure in modern society, leading to growing mental disorders, such as antenatal depression for pregnant women. Antenatal depression can affect pregnant woman’s physical and psychological health and child outcomes, and cause postpartum depression. Therefore, it is essential to detect the antenatal depression of pregnant women early. This study aims to predict pregnant women’s antenatal depression and identify factors that may lead to antenatal depression. First, a questionnaire was designed, based on the daily life of pregnant women. The survey was conducted on pregnant women in a hospital, where 5666 pregnant women participated. As the collected data is unbalanced and has high dimensions, we developed a one-class classifier named Stacked Auto Encoder Support Vector Data Description (SAE-SVDD) to distinguish depressed pregnant women from normal ones. To validate the method, SAE-SVDD was firstly applied on three benchmark datasets. The results showed that SAE-SVDD was effective, with its F-scores better than other popular classifiers. For the antenatal depression problem, the F-score of SAE- SVDD was higher than 0.87, demonstrating that the questionnaire is informative and the classification method is successful. Then, by an improved Term Frequency-Inverse Document Frequency (TF-IDF) analysis, the critical factors of antenatal depression were identified as work stress, marital status, husband support, passive smoking, and alcohol consumption. With its generalizability, SAE-SVDD can be applied to analyze other questionnaires.
Oil drilling is the core process of oil and natural gas resources exploitation. Well overflow is one of the biggest threats to safety drilling. Prediction of the overflow in advance can effectively avoid the occurrence of this kind of accident. However, the drilling history has unbalanced distribution, and labeling data is a time-consuming and laborious job. To address this issue, an overflow-prediction algorithm based on semi-supervised learning is designed in this paper, which can accurately predict overflow 10 min in advance when the labeled data are limited. Firstly, a three-step feature-selection algorithm is conducted to extract 22 features, and the time series samples are constructed through a 500-width sliding window with step size 1. Then, the Mean Teacher model with Jitter noise is employed to train the labeled and unlabeled data at the same time, in which a fused CNN-LSTM network is built for time-series prediction. Compared with supervised learning and other semi-supervised learning frameworks, the results show that the proposed model based on only 200 labeled samples is able to achieve the same effect of supervised learning method using 1000 labeled samples, and the prediction accuracy can reach 87.43% 10 min in advance. With the increase of the proportion of unlabeled samples, the performance of the model can sustain a rise within a certain range.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.