Landslides constitute a severe environmental problem in Jiangxi, China. This research was aimed at conducting landslide hazard assessment to provide technical support for disaster reduction and prevention action in the province. Fourteen geo-environmental factors, e.g., slope, elevation, road, river, fault, lithology, rainfall, and land cover types, were selected for this study. A test was made in two cases: (1) only based on the main linear features, e.g., main rivers and roads, and (2) with detailed complete linear features including all levels of roads and rivers. After buffering of the linear features, an information value (IV) analysis was applied to quantify the distribution of the observed landslides for each subset of the 14 factors. The results were inputted into the binary logistic regression model (LRM) for landslide risk modeling, taking the known landslide points as a training set (70% of the total 9,525 points). The calculated probability of a landslide was further classified into five grades with an interval of 0.2 for hazard mapping: very high (3.70%), high (4.05%), moderate (18.72%), low (27.17%), and stable zones (46.36%). The accuracy was evaluated by AUC [the area under the receiver operating characteristic (ROC) curve] vs. the validation set (30%, the remaining landslides). The final results show that with increasing the completeness of the linear features, the modeling reliability also significantly increased. We hence concluded that the tested methodology is capable of achieving the landslide hazard prediction at regional scale, and the results may provide technical support for geohazard reduction and prevention in the studied province.
Landslides are one of the major geohazards threatening human society. The objective of this study was to conduct a landslide hazard susceptibility assessment for Ruijin, Jiangxi, China, and to provide technical support to the local government for implementing disaster reduction and prevention measures. Machine learning approaches, e.g., random forests (RFs) and support vector machines (SVMs) were employed and multiple geo-environmental factors such as land cover, NDVI, landform, rainfall, lithology, and proximity to faults, roads, and rivers, etc., were utilized to achieve our purposes. For categorical factors, three processing approaches were proposed: simple numerical labeling (SNL), weight assignment (WA)-based and frequency ratio (FR)-based. Then 19 geo-environmental factors were respectively converted into raster to constitute three 19-band datasets, i.e., DS1, DS2, and DS3 from three different processes. Then, 155 observed landslides that occurred in the past decades were vectorized, among which 70% were randomly selected to compose a training set (TS1) and the remaining 30% to form a validation set (VS1). A number of non-landslide (no-risk) samples distributed in the whole study area were identified in low slope (<1–3°) zones such as urban areas and croplands, and also added to the TS1 and VS1 in the same ratio. For comparison, we used the FR approach to identify the no-risk samples in both flat and non-flat areas, and merged them into the field-observed landslides to constitute another pair of training and validation sets (TS2 and VS2) using the same ratio of 7:3. The RF algorithm was applied to model the probability of the landslide occurrence using DS1, DS2, and DS3 as predictive variables and TS1 and TS2 for training to obtain the SNL-based, WA-based, and FR-based RF models, respectively. Verified against VS1 and VS2, the three models have similar overall accuracy (OA) and Kappa coefficient (KC), which are 89.61%, 91.47%, and 94.54%, and 0.7926, 0.8299, and 0.8908, respectively. All of them are much better than the three models obtained by SVM algorithm with OA of 81.79%, 82.86%, and 83%, and KC of 0.6337, 0.655, and 0.660. New case verification with the recent 26 landslide events of 2017–2020 revealed that the landslide susceptibility map from WA-based RF modeling was able to properly identify the high and very high susceptibility zones where 23 new landslides had occurred, and performed better than the SNL-based and FR-based RF modeling, though the latter has a slightly higher OA and KC. Hence, we concluded that all three RF models achieve reasonable risk prediction, but WA-based and FR-based RF modeling deserves a recommendation for application elsewhere. The results of this study may serve as reference for the local authorities in prevention and early warning of landslide hazards.
Motivation Breast cancer is one of the leading causes of cancer deaths among women worldwide. It is necessary to develop new breast cancer drugs because of the shortcomings of existing therapies. The traditional discovery process is time-consuming and expensive. Repositioning of clinically approved drugs has emerged as a novel approach for breast cancer therapy. However, serendipitous or experiential repurposing cannot be used as a routine method. Results In this study, we proposed a graph neural network model GraphRepur based on GraphSAGE for drug repurposing against breast cancer. GraphRepur integrated two major classes of computational methods, drug network-based and drug signature-based. The differentially expressed genes of disease, drug-exposure gene expression data, and the drug-drug links information were collected. By extracting the drug signatures and topological structure information contained in the drug relationships, GraphRepur can predict new drugs for breast cancer, outperforming previous state-of-the-art approaches and some classic machine learning methods. The high-ranked drugs have indeed been reported as new uses for breast cancer treatment recently. Availability The source code of our model and datasets are available at: https://github.com/cckamy/GraphRepur and https://figshare.com/articles/software/GraphRepur_Breast_Cancer_Drug_Repurposing/14220050 Supplementary information Supplementary data are available at Bioinformatics online.
Reliable prediction of landslide occurrence is important for hazard risk reduction and prevention. Taking Guixi in northeast Jiangxi as an example, this research aimed to conduct such a landslide risk assessment using a multiple logistic regression (MLR) algorithm. Field-investigated landslides and non-landslide sites were converted into polygons. We randomly generated 50,000 sampling points to intersect these polygons and the intersected points were divided into two parts, a training set (TS) and a validation set (VT) in a ratio of 7 to 3. Thirteen geo-environmental factors, including elevation, slope, and distance from roads were employed as hazard-causative factors, which were intersected by the TS to create the random point (RP)-based dataset. The next step was to compute the certainty factor (CF) of each factor to constitute a CF-based dataset. MLR was applied to the two datasets for landslide risk modeling. The probability of landslides was then calculated in each pixel, and risk maps were produced. The overall accuracy of these two models versus VS was 91.5% and 90.4% with a Kappa coefficient of 0.814 and 0.782, respectively. The RP-based MLR modeling achieved more reliable predictions and its risk map seems more plausible for providing technical support for implementing disaster prevention measures in Guixi.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.