Predictive Analytics on Shopee for Optimizing Product Demand Prediction through K-Means Clustering and KNN Algorithm Fusion

  • Mesi Febima Catur Insan Cendekia University, Indonesia
  • Lena Magdalena Catur Insan Cendekia University, Indonesia
Keywords: K-Means, KNN, Product Demand, Sales Prediction, Shopee.

Abstract

This study focuses on predictive analysis in the context of the Shopee market, aiming to optimize product demand forecasting through the combination of K-Means clustering and KNN algorithms. With the exponential growth of e-commerce platforms like Shopee, accurately predicting product demand is becoming increasingly important for inventory management and marketing strategies. In this research, we propose a novel approach that combines the strengths of K-Means clustering and the KNN algorithm to improve demand prediction accuracy. By leveraging K-Means clustering to group similar products into two clusters, namely “Low Interest” with 64 data points and “High Interest” with 25 data points, we then apply the KNN algorithm to predict demand within each cluster. The KNN algorithm produces two classifications: Low Sales and High Sales. Based on tests using the KNN algorithm with k values of 3, 5, and 7, it was demonstrated that the product “Soraya Bedsheet Cotton Gold Motif Dallas Ask Grey Tua” can be predicted to fall under “High Sales.” The sales prediction accuracy rate for Shopee marketplace products is 96%. The implications of these findings indicate that the combination of K-Means and KNN algorithms can improve the accuracy of product demand predictions and optimize inventory and marketing strategies.

Downloads

Download data is not yet available.

References

S. Roni and C. Crysdian, “Studi Literature Analisis Potensi Pasar Marketplace terhadap Penjualan,” J. Teknol. dan Manaj. Inform., vol. 8, no. 2, pp. 134–142, 2022, doi: 10.26905/jtmi.v8i2.9055.

A. Kurniawati and N. Ariyani, “Sales Promotion Strategy on Shopee Marketplace,” Propaganda, vol. 2, no. 1, pp. 65–79, 2022.

Nurmalasari et al., “Implementation of Clustering Algorithm Method for Customer Segmentation,” J. Comput. Theor. Nanosci., vol. 17, no. 2, pp. 1388–1395, 2020, doi: 10.1166/jctn.2020.8815.

S. P. Dewi, N. Nurwati, and E. Rahayu, “Penerapan Data Mining Untuk Prediksi Penjualan Produk Terlaris Menggunakan Metode K-Nearest Neighbor,” Build. Informatics, Technol. Sci., vol. 3, no. 4, pp. 639–648, 2022, doi: 10.47065/bits.v3i4.1408.

Nursobah, S. Lailiyah, B. Harpad, and M. Fahmi, “Penerapan Data Mining Untuk Prediksi Perkiraan Hujan dengan Menggunakan Algoritma K-Nearest Neighbor,” Build. Informatics, Technol. Sci., vol. 4, no. 3, pp. 1395–1400, 2022, doi: 10.47065/bits.v4i3.2564.

E. P. W. Mandala, E. Rianti, and S. Defit, “Classification of Customer Loans Using Hybrid Data Mining,” JUITA J. Inform., vol. 10, no. 1, p. 45, 2022, doi: 10.30595/juita.v10i1.12521.

J. Rejito, A. Atthariq, and A. S. Abdullah, “Application of text mining employing k-means algorithms for clustering tweets of Tokopedia,” J. Phys. Conf. Ser., vol. 1722, no. 1, 2021, doi: 10.1088/1742-6596/1722/1/012019.

O. A. Alghanam, S. N. Al-Khatib, and M. O. Hiari, “Data Mining Model for Predicting Customer Purchase Behavior in e-Commerce Context,” Int. J. Adv. Comput. Sci. Appl., vol. 13, no. 2, pp. 421–428, 2022, doi: 10.14569/IJACSA.2022.0130249.

Dewi Eka Putri and Eka Praja Wiyata Mandala, “Hybrid Data Mining berdasarkan Klasterisasi Produk untuk Klasifikasi Penjualan,” J. KomtekInfo, vol. 9, pp. 68–73, 2022, doi: 10.35134/komtekinfo.v9i2.279.

S. Arlis and S. Defit, “Machine Learning Algorithms for Predicting the Spread of Covid‒19 in Indonesia,” TEM Journal, vol. 10, no. 2. pp. 970–974, 2021. doi: 10.18421/TEM102-61.

A. Alfani W.P.R., F. Rozi, and F. Sukmana, “Prediksi Penjualan Produk Unilever Menggunakan Metode K-Nearest Neighbor,” JIPI (Jurnal Ilm. Penelit. dan Pembelajaran Inform., vol. 6, no. 1, pp. 155–160, 2021, doi: 10.29100/jipi.v6i1.1910.

F. A. Prayoga and K. Kusnawi, “Smartphone Recommendation System Using Model-Based Collaborative Filtering Method,” J. Tek. Inform., vol. 3, no. 6, pp. 1613–1622, 2022, doi: 10.20884/1.jutif.2022.3.6.413.

T. F. Aulia, D. R. Wijaya, E. Hernawati, and W. Hidayat, “Poverty Level Prediction Based on E-Commerce Data Using K-Nearest Neighbor and Information-Theoretical-Based Feature Selection,” pp. 28–33, 2021.

X. Li, J. Zhang, and F. Safara, “Improving the Accuracy of Diabetes Diagnosis Applications through a Hybrid Feature Selection Algorithm,” Neural Process. Lett., vol. 55, no. 1, pp. 153–169, 2023, doi: 10.1007/s11063-021-10491-0.

A. H. Nasyuha, Zulham, and I. Rusydi, “Implementation of K-means algorithm in data analysis,” Telkomnika (Telecommunication Comput. Electron. Control., vol. 20, no. 2, pp. 307–313, 2022, doi: 10.12928/TELKOMNIKA.v20i2.21986.

Al-Khowarizmi, R. Syah, M. K. M. Nasution, and M. Elveny, “Sensitivity of MAPE using detection rate for big data forecasting crude palm oil on k-nearest neighbor,” Int. J. Electr. Comput. Eng., vol. 11, no. 3, pp. 2696–2703, 2021, doi: 10.11591/ijece.v11i3.pp2696-2703.

Published
2024-06-13
Abstract views: 654 times
Download PDF: 288 times
How to Cite
Febima, M., & Magdalena, L. (2024). Predictive Analytics on Shopee for Optimizing Product Demand Prediction through K-Means Clustering and KNN Algorithm Fusion. Journal of Information Systems and Informatics, 6(2), 751-765. https://doi.org/10.51519/journalisi.v6i2.720