2019
DOI: 10.1108/ijqss-05-2018-0050
|View full text |Cite
|
Sign up to set email alerts
|

A study on the effect of imbalanced data in tourism recommendation models

Abstract: Purpose This paper aims to study the effect of imbalanced data in tourism quality models. It is demonstrated that this imbalance strongly affects the accuracy of tourism prediction models for hotel recommendation. Design/methodology/approach A questionnaire was used to survey 83,740 clients from hotels between five and two or less stars using a binary logistic model. The data correspond to a sample of 87 hotels from all around the world (120 countries from America, Africa, Asia, Europe and Australia). Find… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 29 publications
0
7
0
Order By: Relevance
“…. + βm Xm, where m is the number of independent variables X1, X2, X3, and X4, which in this study are: customer service, facilities, cleanliness, and price-quality ratio [66].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…. + βm Xm, where m is the number of independent variables X1, X2, X3, and X4, which in this study are: customer service, facilities, cleanliness, and price-quality ratio [66].…”
Section: Discussionmentioning
confidence: 99%
“…For these reasons, a possible study in the future should include more attributes in order to know what the hotel recommendation variables are and how the cleanliness measures could change the importance of the model's quality-price ratio. Furthermore, these studies in the current pandemic context should include balanced samples with respect to the recommendation variable [66].…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, it is not known how the imbalance data affects the prediction models and this generates inaccuracies in the results in the accommodation field. While it is true that there are some studies that identify the existence of imbalanced data in hotels (Chawla et al, 2002(Chawla et al, , 2004Li and Sun, 2012;Fern andez-Muñoz et al, 2019) none of them go as far as applying it to predictive models.…”
Section: Theoretical Contextmentioning
confidence: 99%
“…The existence of imbalance data is related to a quantitative analysis of the sample or data collected, and its treatment or not will produce competitive advantage differences among the companies and their marketing campaigns. Quality hotels studies assume there is an equilibrium between the number of satisfied customers (who would recommend the hotel) and unsatisfied customers (who would not recommend the hotel), but this is usually far from true (Fern andez-Muñoz et al, 2019). The existence of imbalanced samples in which the number of satisfied customers is larger than the number of unsatisfied customers directly affects the predictive capacity of the models and has implications for hotel management, development of marketing campaigns, construction of a brand image and the development of competitive advantage.…”
Section: The Imbalance Phenomenon In Tourism Datamentioning
confidence: 99%
See 1 more Smart Citation