Manhattan Metric Technique in K-Means Clustering for Data Grouping

  • Mila Sari State Islamic University of North Sumatra, Indonesia
  • Armansyah Armansyah State Islamic University of North Sumatra, Indonesia https://orcid.org/0000-0003-3411-0112
Keywords: K-Means, Manhattan Distance, Elbow Method, Davies Bouldin Index

Abstract

Clustering can be defined as a method commonly applied in data mining to group objects into clusters. Clusters consist of data objects that are similar to each other in a group but different from objects in other clusters. In this study, the data used is the data of KIP scholarship recipients for the 2016-2023 period. Various clustering metric measurement techniques have been frequently used by researchers, especially those focusing on distance and similarity metrics, such as Euclidean Distance, Manhattan, and Minkowski. In general, K-Means is an unsupervised learning method used in the clustering process to group data based on similarity. The elbow method is used to determine the optimal number of clusters, so that the clustering results obtained can be maximized to achieve better results. This study aims to analyze the use of Manhattan technique in K-Means clustering for data grouping. The research problem is how to analyze the Manhattan metric technique in K-Means clustering for effective data grouping. Applying the K-Means method shows that the existing data is successfully divided into four specified clusters. After determining the correct number of clusters, the K-Means method is used to sort the data in the dataset. From 3172 data, the final results obtained cluster 0 as many as 774 data, cluster 1 as many as 417 data, cluster 2 as many as 1244 data, and cluster 3 as many as 737 data. The results of the clustering process obtained a davies-bouldin index value of 1.4568.

Downloads

Download data is not yet available.

References

M.J. Nurul Rohmawati and S. Defiyanti, "Implementation of K-Means Algorithm in Clustering Scholarship Applicants," Jitter, vol. 1, no. 2, pp. 62–68, 2015.

Y. Heryadi and E. Irwansyah, Deep Learning: Its Application in Geospatial Field, PT. Artifisia Wahana Informa Teknologi, 2020.

M.A. Hakim and I. N. Alam, "A Comparative Study of Clustering Techniques in Data Mining: K-Means, DBSCAN, and Hierarchical Clustering," Int. J. Adv. Comput. Sci. Appl., vol. 9, no. 12, pp. 275–283, 2018, doi: 10.14569/IJACSA.2018.091236.

S.A. Rahmah and J. Antares, "Clusterization of Student Selection Candidates for Scholarship Recipients from the Foundation Using K-Means Clustering," INFORM., vol. 13, no. 2, p. 25, 2022, doi: 10.36723/juri.v13i2.282.

M.S. Pangestu and M.A. Fitriani, "Comparison of Euclidean Distance, Manhattan Distance, and Cosine Similarity Calculations in Rice Seed Data Clustering Using the K-Means Algorithm," Sainteks, vol. 19, no. 2, p. 141, 2022, doi: 10.30595/sainteks.v19i2.14495.

M. Nishom, "Comparison of Accuracy of Euclidean Distance, Minkowski Distance, and Manhattan Distance on Chi-Square Based K-Means Clustering Algorithm," J. Inform.: J. Pengemb. IT, vol. 4, no. 1, pp. 20–24, 2019, doi: 10.30591/jpit.v4i1.1253.

N. Syahfitri, E. Budianita, A. Nazir, and I. Afrianty, "Product Grouping Based on Inventory Data Using the Elbow and K-Medoid Methods," KLIK Kajian Ilm. Inform. Komput., vol. 4, no. 3, pp. 1668–1675, 2023, doi: 10.30865/klik.v4i3.1525.

R.F.N. Alifah and A.C. Fauzan, "Implementation of K-Means Clustering Algorithm Based on Manhattan Distance for Clustering of Student Field Concentration," Ilk. J. Comput. Sci. Appl. Inform., vol. 5, no. 1, pp. 31–41, 2023, doi: 10.28926/ilkomnika.v5i1.542.

W. Wahyu Pribadi, A. Yunus, and A.S. Wiguna, "Comparison of K-Means Euclidean Distance and Manhattan Distance Methods in Determining Covid-19 Zoning in Malang Regency," JATI, vol. 6, no. 2, pp. 493–500, 2022, doi: 10.36040/jati.v6i2.4808.

N. Puspitasari, H. Haviluddin, and F.U.J. Helmi Puadi, "Clusterization of Pepper Plant Producing Areas Using the K-Means Algorithm," Indones. J. Comput. Sci., vol. 11, no. 3, pp. 1001–1013, 2022, doi: 10.33022/ijcs.v11i3.3104.

R. Syahputra and E. Bu'ulolo, "Determination of Students Receiving Single Tuition Assistance with K-NN," J. Eksplora Inform., vol. 13, no. 1, pp. 46–54, 2023, doi: 10.30864/eksplora.v13i1.797.

Published
2024-09-24
Abstract views: 98 times
Download PDF: 55 times
How to Cite
Sari, M., & Armansyah, A. (2024). Manhattan Metric Technique in K-Means Clustering for Data Grouping. Journal of Information Systems and Informatics, 6(3), 1945-1961. https://doi.org/10.51519/journalisi.v6i3.841