R. Alhajj scite author profile

It is not an easy task to know a priori the most appropriate fuzzy sets that cover the domains of quantitative attributes for fuzzy association rules mining, simply because characteristics of quantitative data are in general unknown. Besides, it is unrealistic that the most appropriate fuzzy sets can always be provided by domain experts. Motivated by this, in this paper we propose an automated method for mining fuzzy association rules. For this purpose, we first present a genetic algorithm (GA) based clustering method that adjusts centroids of the clusters, which are to be handled later as midpoints of triangular membership functions. Next, we give a different method for generating the membership functions by using Clustering Using Representatives (CURE) clustering algorithm, which is known as one of the most efficient clustering algorithms described in the literature. Finally, we compared the proposed GA-based approach with other approaches from the literature. Experiments conducted on 100K transactions from the US census in the year 2000 show that the proposed method exhibits a good performance in terms of execution time and interesting fuzzy association rules.

show abstract

A comprehensive survey of numeric and symbolic outlier mining techniques

Agyemang

Barker

Alhajj

2006

IDA

119

View full text Add to dashboard Cite

Employing Clustering Techniques for Automatic Information Extraction From HTML Documents

Ashraf

Özyer

Alhajj

2008

IEEE Trans. Syst., Man, Cybern. C

View full text Add to dashboard Cite

In the past few years, there has been an exponential increase in the amount of information available on the World Wide Web. This plethora of information can be extremely beneficial for users. However, the amount of human intervention that is currently required for this is inconvenient. Information extraction (IE) systems try to solve this problem by making the task as automatic as possible. Most of the existing approaches, however, require user feedback in one form or another during the extraction. This paper proposes a system that employs clustering techniques for automatic IE from HTML documents containing semistructured data. Using domain-specific information provided by the user, the proposed system parses and tokenizes the data from an HTML document, partitions it into clusters containing similar elements, and estimates an extraction rule based on the pattern of occurrence of data tokens. The extraction rule is then used to refine clusters, and finally, the output is reported. We employed a multiobjective genetic-algorithm-based clustering approach in the process; it is capable of finding the number of clusters and the most natural clustering. The proposed approach is tested by conducting experiments on a number of Web sites from different domains. To demonstrate the effectiveness of this approach, the results of the experiments are tested against those reported in the literature, and prove comparable.

show abstract

Facilitating fuzzy association rules mining by using multi-objective genetic algorithms for automated clustering

Kaya

Alhajj

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

R. Alhajj

Genetic algorithm based framework for mining fuzzy association rules

A comprehensive survey of numeric and symbolic outlier mining techniques

Employing Clustering Techniques for Automatic Information Extraction From HTML Documents

Facilitating fuzzy association rules mining by using multi-objective genetic algorithms for automated clustering

Contact Info

Product

Resources

About