In essence, data mining consists of extracting knowledge from data. This paper proposes a co-evolutionary system for discovering fuzzy classification rules. The system uses two evolutionary algorithms: a genetic programming (GP) algorithm evolving a population of fuzzy rule sets and a simple evolutionary algorithm evolving a population of membership function definitions. The two populations co-evolve, so that the final result of the co-evolutionary process is a fuzzy rule set and a set of membership function definitions which are well adapted to each other. In addition, our system also has some innovative ideas with respect to the encoding of GP individuals representing rule sets. The basic idea is that our individual encoding scheme incorporates several syntactical restrictions that facilitate the handling of rule sets in disjunctive normal form. We have also adapted GP operators to better work with the proposed individual encoding scheme.
Abstract. For a given data set, its set of attributes defines its data space representation. The quality of a data space representation is one of the most important factors influencing the performance of a data mining algorithm. The attributes defining the data space can be inadequate, making it difficult to discover highquality knowledge. In order to solve this problem, this paper proposes a Genetic Programming algorithm developed for attribute construction. This algorithm constructs new attributes out of the original attributes of the data set, performing an important preprocessing step for the subsequent application of a data mining algorithm.
Objective: To conduct a geospatial analysis of suicide deaths among young people in the state of Paraná , southern Brazil, and evaluate their association with socioeconomic and spatial determinants. Methods: Data were obtained from the Mortality Information System and the Brazilian Institute of Geography and Statistics. Data on suicide mortality rates (SMR) were extracted for three age groups (15-19, 20-24, and 25-29 years) from two 5-year periods (1998-2002 and 2008-2012). Geospatial data were analyzed through exploratory spatial data analysis. We applied Bayesian networks algorithms to explore the network structure of the socioeconomic predictors of SMR. Results: We observed spatial dependency in SMR in both periods, revealing geospatial clusters of high SMR. Our results show that socioeconomic deprivation at the municipality level was an important determinant of suicide in the youth population in Paraná , and significantly influenced the formation of high-risk SMR clusters. Conclusion: While youth suicide is multifactorial, there are predictable geospatial and sociodemographic factors associated with high SMR among municipalities in Paraná. Suicide among youth aged 15-29 occurs in geographic clusters which are associated with socioeconomic deprivation. Rural settings with poor infrastructure and development also correlate with increased SMR clusters.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.