Mining Classification Rules from Datasets with Large Number of Many-Valued Attributes

Guiffrida, Giovanni; Chu, Wesley W.; Hanssens, Dominique M.

doi:10.1007/3-540-46439-5_23

Cited by 15 publications

(11 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In their simulation, the overall mean prediction rate of the logit was 72.7%, whereas the hit rate for SVM was 85.9%. Similarly, Giuffrida, Chu, and Hanssens (2000) reported that a multivariate decision tree induction algorithm outperformed a logit model in identifying the best customer targets for cross-selling purposes.…”

Section: Computer Science Modelsmentioning

confidence: 99%

Modeling Customer Lifetime Value

Гупта

Hanssens

Hardie

et al. 2006

Journal of Service Research

539

264

View full text Add to dashboard Cite

As modern economies become predominantly service-based, companies increasingly derive revenue from the creation and sustenance of long-term relationships with their customers. In such an environment, marketing serves the purpose of maximizing customer lifetime value (CLV) and customer equity, which is the sum of the lifetime values of the company’s customers. This article reviews a number of implementable CLV models that are useful for market segmentation and the allocation of marketing resources for acquisition, retention, and cross-selling. The authors review several empirical insights that were obtained from these models and conclude with an agenda of areas that are in need of further research.

show abstract

Section: Computer Science Modelsmentioning

confidence: 99%

Modeling Customer Lifetime Value

Гупта

Hanssens

Hardie

et al. 2006

Journal of Service Research

539

264

View full text Add to dashboard Cite

show abstract

“…We used all 45,222 tuples in the Adult dataset and 12,960 tuples in the Nursery dataset. We considered decision tree [26], Naive Bayes classifier [27], [28], and classification rules [29], [30] as knowledge models. We used the classification accuracy to measure the quality of decision trees and Naive Bayes classifiers, and the number of preserved classification rules to measure classification rules.…”

Section: Methodsmentioning

confidence: 99%

Privacy-Preserving Data Publishing Based on Utility Specification

Tian

Zhang

2013

2013 International Conference on Social Computing

View full text Add to dashboard Cite

Most existing privacy-preserving data publishing methods anonymize data based on some general utility measures. However, the anonymized data may not be useful to applications that have specific requirements for the data they use. In this paper, we propose a method for data users to describe some characteristics of the anonymized data, as a special requirement of some classification applications, and a heuristic anonymization algorithm that incorporates the user-specified requirements into a generalization technique. Our preliminary results show that the specification format and the anonymization algorithm can significantly improve the utility of the anonymized data for a number of data mining applications that learn decision trees, Naive Bayes Classifier and Classification Rules.

show abstract

“…Algorithm application to forecasting and direct marketing has been discussed in management and computer science literature (Cooper & Giuffrida, 2000;Giuffrida, Chu, & Hanssens, 2000). The authors and their affiliates conducted extensive benchmark testing of the KDS/Noah algorithm in application to promotion forecasting and demonstrated its superiority to multiple commercially available data mining solutions including SAS EnterpriseMiner, SOMine, CN2, Ripper, Apriory, CBA, and others (Krycha, 1999).…”

Section: Promotion-event Forecasting System-promocastmentioning

confidence: 99%

Retailer promotion planning: Improving forecast accuracy and interpretability

Trusov

Bodapati

Cooper

2006

Journal of Interactive Marketing

View full text Add to dashboard Cite

This article considers the supermarket manager's problem of forecasting demand for a product as a function of the product's attributes and of market control variables. To forecast sales on the stock keeping unit (SKU) level, a good model should account for product attributes, historical sales levels, and store specifics, and to control for marketing mix. One of the challenges here is that many variables which describe product, store, or promotion conditions are categorical with hundreds or thousands of levels in a single attribute. Identifying the right product attributes and incorporating them correctly into a prediction model is a very difficult statistical problem. This article proposes an analytical engine that combines techniques from statistical market response modeling, datamining, and combinatorial optimization to produce a small, efficient rule set that predicts sales volume levels.

show abstract

Mining Classification Rules from Datasets with Large Number of Many-Valued Attributes

Cited by 15 publications

References 7 publications

Modeling Customer Lifetime Value

Modeling Customer Lifetime Value

Privacy-Preserving Data Publishing Based on Utility Specification

Retailer promotion planning: Improving forecast accuracy and interpretability

Contact Info

Product

Resources

About