Hotel Guest Length of Stay Prediction Using Random Forest Regressor

Keywords: Length of Stay; Random Forest Regression; Predictive Accuracy; Operational Optimization; Machine Learning in Hospitality; Feature Importance; Data-Driven Decision Making

Abstract

This research offers a robust framework for integrating predictive analytics into hospitality operations, contributing to sustainable growth and competitive advantage in the industry. This research investigates the application of the Random Forest Regression model to predict the Length of Stay (LoS) of hotel guests, leveraging key features such as country, guest type, room type, and rating. The study addresses the need for precise forecasting to optimize resource allocation, improve operational efficiency, and support data-driven decision-making in the hospitality sector. The methodology involves data collection from a structured dataset of guest reviews, preprocessing through encoding categorical variables, converting target values into numeric forms, and standardizing features to ensure consistency and uniformity. The dataset is split into training (80%) and testing (20%) subsets, with hyperparameters such as n_estimators=100 and random_state=42 set to ensure stability and reproducibility during model training. The Random Forest Regression model demonstrated strong predictive performance, achieving an R-squared value of 0.85 and a Mean Absolute Error (MAE) of 1.06. Feature importance analysis identified "country" as the most significant variable (importance score: 0.5), followed by guest type (0.2), room type (0.15), and rating (0.15). The Predicted vs. Actual Plot and Error Distribution evaluation reveals that most errors cluster near zero, indicating high accuracy with minor deviations in extreme cases. These findings emphasize the model’s potential to enhance marketing strategies, optimize resource allocation, and improve guest satisfaction. This research offers a robust framework for integrating predictive analytics into hospitality operations, contributing to sustainable growth and competitive advantage in the industry.

Downloads

Download data is not yet available.

References

A. Dursun-Cengizci and M. Caber, “Using machine learning methods to predict future churners: an analysis of repeat hotel customers,” Int. J. Contemp. Hosp. Manag., 2024, doi: 10.1108/IJCHM-06-2023-0844.

M. Kumar, C. Kumar, N. Kumar, and S. Kavitha, “Efficient Hotel Rating Prediction from Reviews Using Ensemble Learning Technique,” Wirel. Pers. Commun., vol. 137, no. 2, pp. 1161–1187, 2024, doi: 10.1007/s11277-024-11457-w.

S. Ahmed, S. Chowdhury, and R. M. Rahman, “Hotel Booking Cancellation with Visual Analytics,” International IEEE Conference proceedings, IS, no. 2024. 2024. doi: 10.1109/IS61756.2024.10705220.

K. P. Rajesh, M. Prabu Nallasivam, C. Sakthi Gokul Rajan, P. S. Sherlin Paul, S. Hari Kumar, and V. S. Dharun, “Detection of Fake Hotel Reviews Using ANFIS and Natural Language Processing Techniques,” Proceedings of International Conference on Circuit Power and Computing Technologies, ICCPCT 2024. pp. 265–269, 2024. doi: 10.1109/ICCPCT61902.2024.10672838.

M. S. Shallan, I. F. Moawad, R. El Naggar, and H. Montasser, “Using Machine Learning Techniques to Maximize Profitability in the Hospitality Industry,” 6th International Conference on Computing and Informatics, ICCI 2024. pp. 182–188, 2024. doi: 10.1109/ICCI61671.2024.10485148.

C. YU, L. J. LIANG, and H. C. CHOI, “Examining Customer Value Cocreation Behavior in Boutique Hotels: Hospitableness, Perceived Value, Satisfaction, and Citizenship Behavior,” Tour. Anal., vol. 29, no. 2, pp. 221–237, 2024, doi: 10.3727/108354224X17091476372167.

M. Bordian, M. Fuentes-Blasco, I. Gil-Saura, and B. Moliner-Velázquez, “Technology and Innovation: Analyzing the Heterogeneity of the Hotel Guests’ Behavior,” J. Theor. Appl. Electron. Commer. Res. , vol. 19, no. 2, pp. 1599–1615, 2024, doi: 10.3390/jtaer19020078.

M. Darvishmotevali, H. E. Arici, and M. A. Koseoglu, “Customer satisfaction antecedents in uncertain hospitality conditions: an exploratory data mining approach,” J. Hosp. Tour. Insights, 2024, doi: 10.1108/JHTI-11-2023-0845.

X. Wang, J. Zheng, and M. Luo, “More than words: the role of personality in shaping the timeliness of online reviews,” J. Hosp. Tour. Technol., 2024, doi: 10.1108/JHTT-03-2024-0192.

M. Landa-Zárate, E. Fernández-Echeverría, L. E. García-Santamaría, G. Fernández-Lambert, and E. Martínez-Mendoza, “An Approach to Define Service Strategies: The Case of an Ecotourism Hotel in Mexico,” J. Ind. Eng. Manag., vol. 17, no. 1, pp. 182–195, 2024, doi: 10.3926/jiem.6099.

H. Han, S. I. Kim, J. S. Lee, and I. Jung, “Understanding the drivers of consumers’ acceptance and use of service robots in the hotel industry,” Int. J. Contemp. Hosp. Manag., 2024, doi: 10.1108/IJCHM-02-2024-0163.

A. Bhardwaj, T. Yadav, and R. Chaudhary, “Predicting Hotel Booking Cancellations using Machine Learning Techniques,” 2024 15th International Conference on Computing Communication and Networking Technologies, ICCCNT 2024. 2024. doi: 10.1109/ICCCNT61001.2024.10725148.

K. Sharma, Y. K. Dwivedi, and B. Metri, “Incorporating causality in energy consumption forecasting using deep neural networks,” Ann. Oper. Res., vol. 339, no. 1–2, pp. 537–572, 2024, doi: 10.1007/s10479-022-04857-3.

S. Birim, I. Kazancoglu, S. K. Mangla, A. Kahraman, and Y. Kazancoglu, “The derived demand for advertising expenses and implications on sustainability: a comparative study using deep learning and traditional machine learning methods,” Ann. Oper. Res., vol. 339, no. 1–2, pp. 131–161, 2024, doi: 10.1007/s10479-021-04429-x.

K. Ito, S. Kanemitsu, R. Kimura, and R. Omori, “Time changes of customer behavior on accommodation reservation: a case study of Japan,” Jpn. J. Ind. Appl. Math., vol. 41, no. 2, pp. 881–902, 2024, doi: 10.1007/s13160-023-00623-5.

N. Satish, J. Anmala, K. Rajitha, and M. R. R. Varma, “A stacking ANN ensemble model of ML models for stream water quality prediction of Godavari River Basin, India,” Ecol. Inform., vol. 80, 2024, doi: 10.1016/j.ecoinf.2024.102500.

A. Lotfipoor, S. Patidar, and D. P. Jenkins, “Deep neural network with empirical mode decomposition and Bayesian optimisation for residential load forecasting,” Expert Syst. Appl., vol. 237, 2024, doi: 10.1016/j.eswa.2023.121355.

S. Chalupa and M. Petricek, “Understanding customer’s online booking intentions using hotel big data analysis,” J. Vacat. Mark., vol. 30, no. 1, pp. 110–122, 2024, doi: 10.1177/13567667221122107.

S. M. Fazal-e-Hasan, G. Mortimer, H. Ahmadi, M. Adil, and M. Sadiq, “Examining the impact of tourists’ hope, knowledge and perceived value on online hotel booking intentions,” Asia Pacific J. Tour. Res., vol. 29, no. 6, pp. 719–735, 2024, doi: 10.1080/10941665.2024.2343058.

S. Khan and S. U. Khan, “Tourist motivation to adopt smart hospitality: the impact of smartness and technology readiness,” J. Hosp. Tour. Insights, 2024, doi: 10.1108/JHTI-04-2024-0335.

J. Castanha, S. K. B. Pillai, and K. G. Sankaranarayanan, “What Influences Consumer Satisfaction and Behaviour Intention in Hotel Industry? A Case Study of Goa, India,” Int. J. Hosp. Tour. Syst., vol. 17, no. 2, pp. 61–69, 2024, [Online]. Available: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85197922431&origin=inward

A. Pal, K. S. Ahmed, and S. Mangalathu, “Data-driven machine learning approaches for predicting slump of fiber-reinforced concrete containing waste rubber and recycled aggregate,” Constr. Build. Mater., vol. 417, 2024, doi: 10.1016/j.conbuildmat.2024.135369.

P. Jain, M. T. Islam, and A. S. Alshammari, “Comparative analysis of machine learning techniques for metamaterial absorber performance in terahertz applications,” Alexandria Eng. J., vol. 103, pp. 51–59, 2024, doi: 10.1016/j.aej.2024.05.111.

J. de S. Brogni, L. T. Tricárico, P. F. Limberger, and T. F. Fiuza, “The relationship between visitors’ motivations and satisfaction about a Brazilian sacred complex,” Int. J. Tour. Cities, vol. 10, no. 2, pp. 682–700, Jan. 2024, doi: 10.1108/IJTC-03-2022-0060.

Z. Chen, C. Ye, H. Yang, P. Ye, Y. Xie, and Z. Ding, “Exploring the impact of seasonal forest landscapes on tourist emotions using Machine learning,” Ecol. Indic., vol. 163, 2024, doi: 10.1016/j.ecolind.2024.112115.

T. D. Dang and M. T. Nguyen, “Understanding Customer Perception and Brand Equity in the Hospitality Sector: Integrating Sentiment Analysis and Topic Modeling,” Springer Proceedings in Business and Economics. pp. 413–425, 2024. doi: 10.1007/978-3-031-49105-4_24.

A. S. Abuhammad and M. A. Ahmed, “Automatic Negation Detection for Semantic Analysis in Arabic Hotel Reviews Through Lexical and Structural Features: A Supervised Classification,” J. Inf. Commun. Technol., vol. 23, no. 4, pp. 709–744, 2024, doi: 10.32890/jict2024.23.4.5.

Garima et al., “Fake Review Detection and Removal: A Comparative Analysis using ML and DL Models,” 15th International Conference on Advances in Computing, Control, and Telecommunication Technologies, ACT 2024, vol. 1. pp. 200–208, 2024. [Online]. Available: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85208790888&origin=inward

M. M. Khan and M. Alkhathami, “Anomaly detection in IoT-based healthcare: machine learning for enhanced security,” Sci. Rep., vol. 14, no. 1, 2024, doi: 10.1038/s41598-024-56126-x.

S. Bhadra and C. J. Kumar, “Enhancing the efficacy of depression detection system using optimal feature selection from EHR,” Comput. Methods Biomech. Biomed. Engin., vol. 27, no. 2, pp. 222–236, 2024, doi: 10.1080/10255842.2023.2181660.

R. A. Rasul, P. Saha, D. Bala, S. M. R. U. Karim, M. I. Abdullah, and B. Saha, “An evaluation of machine learning approaches for early diagnosis of autism spectrum disorder,” Healthc. Anal., vol. 5, 2024, doi: 10.1016/j.health.2023.100293.

L. A. Pereira, R. S. Frio, M. A. Pereira, and T. O. Dos Santos, “Does guest perception of sustainability affect consumer advocacy in hospitality?,” Brazilian J. Tour. Res., vol. 18, 2024, doi: 10.7784/rbtur.v18.2969.

K. M. Selem, M. H. Shoukat, R. Khalid, and M. Raza, “Guest interaction with hotel booking website information: scale development and validation of antecedents and consequences,” J. Hosp. Mark. Manag., vol. 33, no. 5, pp. 626–648, 2024, doi: 10.1080/19368623.2023.2279174.

A. K. Zinn, D. Greene, and S. Dolnicar, “Communicating default changes to hotel room cleaning without reducing guest satisfaction,” J. Clean. Prod., vol. 483, 2024, doi: 10.1016/j.jclepro.2024.144266.

F. Fadhlurrachman and N. Sofyan, “Eastparc Hotel Marketing Communication Strategy for Increasing Occupancy During the Pandemic in 2021,” Studies in Systems, Decision and Control, vol. 489. pp. 501–510, 2024. doi: 10.1007/978-3-031-36895-0_40.

O. Martorell Cunill, L. Otero, P. Durán Santomil, and J. Gil Lafuente, “Analysis of the effect of growth strategies and hotel attributes on performance,” Manag. Decis., vol. 62, no. 7, pp. 2233–2264, 2024, doi: 10.1108/MD-06-2023-0974.

D. Contessi, L. Viverit, L. N. Pereira, and C. Y. Heo, “Decoding the future: Proposing an interpretable machine learning model for hotel occupancy forecasting using principal component analysis,” Int. J. Hosp. Manag., vol. 121, 2024, doi: 10.1016/j.ijhm.2024.103802.

J. L. Nicolau, Z. Xiang, and D. Wang, “Daily online review sentiment and hotel performance,” Int. J. Contemp. Hosp. Manag., vol. 36, no. 3, pp. 790–811, Jan. 2024, doi: 10.1108/IJCHM-05-2022-0594.

Published
2024-12-31
Abstract views: 266 times
Download PDF: 95 times
How to Cite
Singgalen, Y. (2024). Hotel Guest Length of Stay Prediction Using Random Forest Regressor. Journal of Information Systems and Informatics, 6(4), 3016-3034. https://doi.org/10.51519/journalisi.v6i4.959
Section
Articles

Most read articles by the same author(s)

1 2 3 4 > >>