Hotel Guest Length of Stay Prediction Using Random Forest Regressor
Abstract
This research offers a robust framework for integrating predictive analytics into hospitality operations, contributing to sustainable growth and competitive advantage in the industry. This research investigates the application of the Random Forest Regression model to predict the Length of Stay (LoS) of hotel guests, leveraging key features such as country, guest type, room type, and rating. The study addresses the need for precise forecasting to optimize resource allocation, improve operational efficiency, and support data-driven decision-making in the hospitality sector. The methodology involves data collection from a structured dataset of guest reviews, preprocessing through encoding categorical variables, converting target values into numeric forms, and standardizing features to ensure consistency and uniformity. The dataset is split into training (80%) and testing (20%) subsets, with hyperparameters such as n_estimators=100 and random_state=42 set to ensure stability and reproducibility during model training. The Random Forest Regression model demonstrated strong predictive performance, achieving an R-squared value of 0.85 and a Mean Absolute Error (MAE) of 1.06. Feature importance analysis identified "country" as the most significant variable (importance score: 0.5), followed by guest type (0.2), room type (0.15), and rating (0.15). The Predicted vs. Actual Plot and Error Distribution evaluation reveals that most errors cluster near zero, indicating high accuracy with minor deviations in extreme cases. These findings emphasize the model’s potential to enhance marketing strategies, optimize resource allocation, and improve guest satisfaction. This research offers a robust framework for integrating predictive analytics into hospitality operations, contributing to sustainable growth and competitive advantage in the industry.
Downloads
References
A. Dursun-Cengizci and M. Caber, “Using machine learning methods to predict future churners: an analysis of repeat hotel customers,” Int. J. Contemp. Hosp. Manag., 2024, doi: 10.1108/IJCHM-06-2023-0844.
M. Kumar, C. Kumar, N. Kumar, and S. Kavitha, “Efficient Hotel Rating Prediction from Reviews Using Ensemble Learning Technique,” Wirel. Pers. Commun., vol. 137, no. 2, pp. 1161–1187, 2024, doi: 10.1007/s11277-024-11457-w.
S. Ahmed, S. Chowdhury, and R. M. Rahman, “Hotel Booking Cancellation with Visual Analytics,” International IEEE Conference proceedings, IS, no. 2024. 2024. doi: 10.1109/IS61756.2024.10705220.
K. P. Rajesh, M. Prabu Nallasivam, C. Sakthi Gokul Rajan, P. S. Sherlin Paul, S. Hari Kumar, and V. S. Dharun, “Detection of Fake Hotel Reviews Using ANFIS and Natural Language Processing Techniques,” Proceedings of International Conference on Circuit Power and Computing Technologies, ICCPCT 2024. pp. 265–269, 2024. doi: 10.1109/ICCPCT61902.2024.10672838.
M. S. Shallan, I. F. Moawad, R. El Naggar, and H. Montasser, “Using Machine Learning Techniques to Maximize Profitability in the Hospitality Industry,” 6th International Conference on Computing and Informatics, ICCI 2024. pp. 182–188, 2024. doi: 10.1109/ICCI61671.2024.10485148.
C. YU, L. J. LIANG, and H. C. CHOI, “Examining Customer Value Cocreation Behavior in Boutique Hotels: Hospitableness, Perceived Value, Satisfaction, and Citizenship Behavior,” Tour. Anal., vol. 29, no. 2, pp. 221–237, 2024, doi: 10.3727/108354224X17091476372167.
M. Bordian, M. Fuentes-Blasco, I. Gil-Saura, and B. Moliner-Velázquez, “Technology and Innovation: Analyzing the Heterogeneity of the Hotel Guests’ Behavior,” J. Theor. Appl. Electron. Commer. Res. , vol. 19, no. 2, pp. 1599–1615, 2024, doi: 10.3390/jtaer19020078.
M. Darvishmotevali, H. E. Arici, and M. A. Koseoglu, “Customer satisfaction antecedents in uncertain hospitality conditions: an exploratory data mining approach,” J. Hosp. Tour. Insights, 2024, doi: 10.1108/JHTI-11-2023-0845.
X. Wang, J. Zheng, and M. Luo, “More than words: the role of personality in shaping the timeliness of online reviews,” J. Hosp. Tour. Technol., 2024, doi: 10.1108/JHTT-03-2024-0192.
M. Landa-Zárate, E. Fernández-Echeverría, L. E. García-Santamaría, G. Fernández-Lambert, and E. Martínez-Mendoza, “An Approach to Define Service Strategies: The Case of an Ecotourism Hotel in Mexico,” J. Ind. Eng. Manag., vol. 17, no. 1, pp. 182–195, 2024, doi: 10.3926/jiem.6099.
H. Han, S. I. Kim, J. S. Lee, and I. Jung, “Understanding the drivers of consumers’ acceptance and use of service robots in the hotel industry,” Int. J. Contemp. Hosp. Manag., 2024, doi: 10.1108/IJCHM-02-2024-0163.
A. Bhardwaj, T. Yadav, and R. Chaudhary, “Predicting Hotel Booking Cancellations using Machine Learning Techniques,” 2024 15th International Conference on Computing Communication and Networking Technologies, ICCCNT 2024. 2024. doi: 10.1109/ICCCNT61001.2024.10725148.
K. Sharma, Y. K. Dwivedi, and B. Metri, “Incorporating causality in energy consumption forecasting using deep neural networks,” Ann. Oper. Res., vol. 339, no. 1–2, pp. 537–572, 2024, doi: 10.1007/s10479-022-04857-3.
S. Birim, I. Kazancoglu, S. K. Mangla, A. Kahraman, and Y. Kazancoglu, “The derived demand for advertising expenses and implications on sustainability: a comparative study using deep learning and traditional machine learning methods,” Ann. Oper. Res., vol. 339, no. 1–2, pp. 131–161, 2024, doi: 10.1007/s10479-021-04429-x.
K. Ito, S. Kanemitsu, R. Kimura, and R. Omori, “Time changes of customer behavior on accommodation reservation: a case study of Japan,” Jpn. J. Ind. Appl. Math., vol. 41, no. 2, pp. 881–902, 2024, doi: 10.1007/s13160-023-00623-5.
N. Satish, J. Anmala, K. Rajitha, and M. R. R. Varma, “A stacking ANN ensemble model of ML models for stream water quality prediction of Godavari River Basin, India,” Ecol. Inform., vol. 80, 2024, doi: 10.1016/j.ecoinf.2024.102500.
A. Lotfipoor, S. Patidar, and D. P. Jenkins, “Deep neural network with empirical mode decomposition and Bayesian optimisation for residential load forecasting,” Expert Syst. Appl., vol. 237, 2024, doi: 10.1016/j.eswa.2023.121355.
S. Chalupa and M. Petricek, “Understanding customer’s online booking intentions using hotel big data analysis,” J. Vacat. Mark., vol. 30, no. 1, pp. 110–122, 2024, doi: 10.1177/13567667221122107.
S. M. Fazal-e-Hasan, G. Mortimer, H. Ahmadi, M. Adil, and M. Sadiq, “Examining the impact of tourists’ hope, knowledge and perceived value on online hotel booking intentions,” Asia Pacific J. Tour. Res., vol. 29, no. 6, pp. 719–735, 2024, doi: 10.1080/10941665.2024.2343058.
S. Khan and S. U. Khan, “Tourist motivation to adopt smart hospitality: the impact of smartness and technology readiness,” J. Hosp. Tour. Insights, 2024, doi: 10.1108/JHTI-04-2024-0335.
J. Castanha, S. K. B. Pillai, and K. G. Sankaranarayanan, “What Influences Consumer Satisfaction and Behaviour Intention in Hotel Industry? A Case Study of Goa, India,” Int. J. Hosp. Tour. Syst., vol. 17, no. 2, pp. 61–69, 2024, [Online]. Available: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85197922431&origin=inward
A. Pal, K. S. Ahmed, and S. Mangalathu, “Data-driven machine learning approaches for predicting slump of fiber-reinforced concrete containing waste rubber and recycled aggregate,” Constr. Build. Mater., vol. 417, 2024, doi: 10.1016/j.conbuildmat.2024.135369.
P. Jain, M. T. Islam, and A. S. Alshammari, “Comparative analysis of machine learning techniques for metamaterial absorber performance in terahertz applications,” Alexandria Eng. J., vol. 103, pp. 51–59, 2024, doi: 10.1016/j.aej.2024.05.111.
J. de S. Brogni, L. T. Tricárico, P. F. Limberger, and T. F. Fiuza, “The relationship between visitors’ motivations and satisfaction about a Brazilian sacred complex,” Int. J. Tour. Cities, vol. 10, no. 2, pp. 682–700, Jan. 2024, doi: 10.1108/IJTC-03-2022-0060.
Z. Chen, C. Ye, H. Yang, P. Ye, Y. Xie, and Z. Ding, “Exploring the impact of seasonal forest landscapes on tourist emotions using Machine learning,” Ecol. Indic., vol. 163, 2024, doi: 10.1016/j.ecolind.2024.112115.
T. D. Dang and M. T. Nguyen, “Understanding Customer Perception and Brand Equity in the Hospitality Sector: Integrating Sentiment Analysis and Topic Modeling,” Springer Proceedings in Business and Economics. pp. 413–425, 2024. doi: 10.1007/978-3-031-49105-4_24.
A. S. Abuhammad and M. A. Ahmed, “Automatic Negation Detection for Semantic Analysis in Arabic Hotel Reviews Through Lexical and Structural Features: A Supervised Classification,” J. Inf. Commun. Technol., vol. 23, no. 4, pp. 709–744, 2024, doi: 10.32890/jict2024.23.4.5.
Garima et al., “Fake Review Detection and Removal: A Comparative Analysis using ML and DL Models,” 15th International Conference on Advances in Computing, Control, and Telecommunication Technologies, ACT 2024, vol. 1. pp. 200–208, 2024. [Online]. Available: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85208790888&origin=inward
M. M. Khan and M. Alkhathami, “Anomaly detection in IoT-based healthcare: machine learning for enhanced security,” Sci. Rep., vol. 14, no. 1, 2024, doi: 10.1038/s41598-024-56126-x.
S. Bhadra and C. J. Kumar, “Enhancing the efficacy of depression detection system using optimal feature selection from EHR,” Comput. Methods Biomech. Biomed. Engin., vol. 27, no. 2, pp. 222–236, 2024, doi: 10.1080/10255842.2023.2181660.
R. A. Rasul, P. Saha, D. Bala, S. M. R. U. Karim, M. I. Abdullah, and B. Saha, “An evaluation of machine learning approaches for early diagnosis of autism spectrum disorder,” Healthc. Anal., vol. 5, 2024, doi: 10.1016/j.health.2023.100293.
L. A. Pereira, R. S. Frio, M. A. Pereira, and T. O. Dos Santos, “Does guest perception of sustainability affect consumer advocacy in hospitality?,” Brazilian J. Tour. Res., vol. 18, 2024, doi: 10.7784/rbtur.v18.2969.
K. M. Selem, M. H. Shoukat, R. Khalid, and M. Raza, “Guest interaction with hotel booking website information: scale development and validation of antecedents and consequences,” J. Hosp. Mark. Manag., vol. 33, no. 5, pp. 626–648, 2024, doi: 10.1080/19368623.2023.2279174.
A. K. Zinn, D. Greene, and S. Dolnicar, “Communicating default changes to hotel room cleaning without reducing guest satisfaction,” J. Clean. Prod., vol. 483, 2024, doi: 10.1016/j.jclepro.2024.144266.
F. Fadhlurrachman and N. Sofyan, “Eastparc Hotel Marketing Communication Strategy for Increasing Occupancy During the Pandemic in 2021,” Studies in Systems, Decision and Control, vol. 489. pp. 501–510, 2024. doi: 10.1007/978-3-031-36895-0_40.
O. Martorell Cunill, L. Otero, P. Durán Santomil, and J. Gil Lafuente, “Analysis of the effect of growth strategies and hotel attributes on performance,” Manag. Decis., vol. 62, no. 7, pp. 2233–2264, 2024, doi: 10.1108/MD-06-2023-0974.
D. Contessi, L. Viverit, L. N. Pereira, and C. Y. Heo, “Decoding the future: Proposing an interpretable machine learning model for hotel occupancy forecasting using principal component analysis,” Int. J. Hosp. Manag., vol. 121, 2024, doi: 10.1016/j.ijhm.2024.103802.
J. L. Nicolau, Z. Xiang, and D. Wang, “Daily online review sentiment and hotel performance,” Int. J. Contemp. Hosp. Manag., vol. 36, no. 3, pp. 790–811, Jan. 2024, doi: 10.1108/IJCHM-05-2022-0594.


Copyright (c) 2024 Journal of Information Systems and Informatics

This work is licensed under a Creative Commons Attribution 4.0 International License.
- I certify that I have read, understand and agreed to the Journal of Information Systems and Informatics (Journal-ISI) submission guidelines, policies and submission declaration. Submission already using the provided template.
- I certify that all authors have approved the publication of this and there is no conflict of interest.
- I confirm that the manuscript is the authors' original work and the manuscript has not received prior publication and is not under consideration for publication elsewhere and has not been previously published.
- I confirm that all authors listed on the title page have contributed significantly to the work, have read the manuscript, attest to the validity and legitimacy of the data and its interpretation, and agree to its submission.
- I confirm that the paper now submitted is not copied or plagiarized version of some other published work.
- I declare that I shall not submit the paper for publication in any other Journal or Magazine till the decision is made by journal editors.
- If the paper is finally accepted by the journal for publication, I confirm that I will either publish the paper immediately or withdraw it according to withdrawal policies
- I Agree that the paper published by this journal, I transfer copyright or assign exclusive rights to the publisher (including commercial rights)