2016
DOI: 10.1016/j.ejor.2015.08.029
|View full text |Cite
|
Sign up to set email alerts
|

Demand forecasting with high dimensional data: The case of SKU retail sales forecasting with intra- and inter-category promotional information

Abstract: In marketing analytics applications in OR, the modeler often faces the problem of selecting key variables from a large number of possibilities. For example, SKU level retail store sales are affected by inter and intra category effects which potentially need to be considered when deciding on promotional strategy and producing operational forecasts, but no research has put this well accepted concept into forecasting practice: an obvious obstacle is the ultra-high dimensionality of the variable space. This paper … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

3
99
1
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 135 publications
(104 citation statements)
references
References 68 publications
3
99
1
1
Order By: Relevance
“…The extracted principal components and time dummy variables then serve as the independent variables in a linear regression to forecast the retailer's sales and profits. Though this method has been widely adopted to handle high‐dimensional data in demand forecasting, it has some disadvantages that may compromise its predictive ability (Ma et al., ). For example, the principal components are extracted only from the predictors, and thus the valuable information for predicting the dependent variable may be lost.…”
Section: Empirical Analysesmentioning
confidence: 99%
See 3 more Smart Citations
“…The extracted principal components and time dummy variables then serve as the independent variables in a linear regression to forecast the retailer's sales and profits. Though this method has been widely adopted to handle high‐dimensional data in demand forecasting, it has some disadvantages that may compromise its predictive ability (Ma et al., ). For example, the principal components are extracted only from the predictors, and thus the valuable information for predicting the dependent variable may be lost.…”
Section: Empirical Analysesmentioning
confidence: 99%
“…Among the benchmark models, the standard LASSO method has the smallest prediction error for both sales (MSE = 0.0072) and profits (MSE = 0.0122) of the retailer. Ridge regression has relatively large prediction errors (for sales: MSE = 0.0234; for profits: MSE = 0.0188); this is because it cannot perform variable selection, and the inclusion of highly correlated variables attenuates its predictive ability (Ma et al., ). The PCR method has relatively good performance in terms of prediction accuracy, and the random forests method performs even better.…”
Section: Empirical Analysesmentioning
confidence: 99%
See 2 more Smart Citations
“…First, our model controls for the endogeneity of promotion depth in estimating the high-dimensional promotion effects and advances the application of marketing analytics in data-rich environments. Although the standard LASSO method was introduced into the statistics literature more than 20 years ago (Tibshirani, 1996), it has rarely been applied in the research area of marketing (Ma, Fildes, & Huang, 2016) and few prior studies have considered the endogeneity issue when using the method. Compared with traditional methods, our model can estimate the promotion effects of a large number of products on retailer-level performance with high efficiency and less endogeneity bias.…”
Section: Introductionmentioning
confidence: 99%