Data mining is the study of data for relationships that have not previously been discovered. In sociology, discrimination is the hurtful treatment of an individual based on the group, class or category to which that person or things belongs rather than on individual merit: racial and religious intolerance and discrimination. Along with confidentiality, discrimination is a very essential issue when considering the legal and ethical aspects of data mining. It is more than obvious that most people do not want to be discriminated because of their race, gender, religion, nationality, age etc, especially when those attributes are used for making decisions about them like giving them a job, loan, education, insurance etc. Because of this reason, antidiscrimination techniques with discrimination discovery as well as discrimination prevention have been introduced in data mining. Discrimination can be either direct or indirect. Direct discrimination occurs when decisions are taken by considering sensitive attributes. Indirect discrimination occurs when decisions are taken on the basis of nonsensitive attributes which are strongly associated with biased sensitive ones. Here, discrimination prevention in data mining is tackle as well as propose new techniques applicable for direct or indirect discrimination prevention individually or both at the same time. Several decision-making tasks are there which let somebody use themselves to discrimination, such as education, life insurances, loan granting, and staff selection. In many applications, information systems are used for decision-making tasks.
KeywordsData mining, direct and indirect discrimination prevention.
Web usage mining is the method of extracting interesting patterns from Web usage log file. Web usage mining is subfield of data mining uses various data mining techniques to produce association rules. Data mining techniques are used to generate association rules from transaction data. Most of the time transactions are boolean transactions, whereas Web usage data consists of quantitative values. To handle these real world quantitative data we used fuzzy data mining algorithm for extraction of association rules from quantitative Web log file. To generate fuzzy association rules first we designed membership function. This membership function is used to transform quantitative values into fuzzy terms. Experiments are carried out on different support and confidence. Experimental results show the performance of the algorithm with varied supports and confidence.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.