Landslide prediction is complex and involves many factors, such as geotechnical, geological, topographical, and even meteorological. This work presents a methodology by using a Data Mining approach in order to predict landslide occurrences induced by rainfall in Rio de Janeiro city. Landslide and rain data records from 1998 to 2001 were obtained from field technical reports and 30 automatic rain gauges, respectively. It was also collected data regarding soil parameters, including urban areas, forest, vulnerability, among others, and totalizing 46 soil variables. All the information was inserted into a Geographic Information Systems. Clustering (Dendrogram and k-means) andStatistical (Principal Component Analysis and Correlation) techniques were used to regionalize the rain data and select the rain gauges to be input on Artificial Neural Networks , which were used to replace the missing rain values. The landslide volume variable also presented missing values and it was completed by the k-Nearest Neighbor method. After data preparation, some models were built to predict landslide and rainfall using Data Mining techniques. The obtained model's performance is also analyzed.
There is an ongoing need for scientific analysis to help governments and public health authorities make decisions regarding the COVID-19 pandemic. This article presents a methodology based on data mining that can offer support for coping with epidemic diseases. The methodological approach was applied in São Paulo, Rio de Janeiro and Manaus, the cities in Brazil with the most COVID-19 deaths until the first half of 2021. We aimed to predict the evolution of COVID-19 in metropolises and identify air quality and meteorological variables correlated with confirmed cases and deaths. The statistical analyses indicated the most important explanatory environmental variables, while the cluster analyses showed the potential best input variables for the forecasting models. The forecast models were built by two different algorithms and their results have been compared. The relationship between epidemiological and environmental variables was particular to each of the three cities studied. Low solar radiation periods predicted in Manaus can guide managers to likely increase deaths due to COVID-19. In São Paulo, an increase in the mortality rate can be indicated by drought periods. The developed models can predict new cases and deaths by COVID-19 in studied cities. Furthermore, the methodological approach can be applied in other cities and for other epidemic diseases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.