Modeling Student Learning Profiles from LMS Behavioral Traces Using Big Data Analytics

Authors

  • Arief Hidayat Universitas Wahid Hasyim, Indonesia
  • Kusworo Adi Diponegoro University, Indonesia
  • Bayu Surarso Diponegoro University, Indonesia
Pages Icon

DOI:

https://doi.org/10.63158/journalisi.v8i3.1588

Keywords:

Learning Analytics, Educational Data Mining, LMS Behavioral Traces, K-Means Clustering, Student Behavioral Modeling, Learning Profiles

Abstract

Digital learning environments and Learning Management Systems (LMSs) generate large volumes of time-stamped behavioral traces that can be used to examine how students access resources, navigate course structures, communicate, and approach assessments. Traditional learning-style models often depend on static self-report categories and may not reflect how students actually study in digital courses. This study develops a learning analytics framework for modeling student learning profiles from authentic LMS behavioral traces. The study used a quantitative, non-experimental, longitudinal design based on Canvas LMS interaction data from 15,342 undergraduate students enrolled in 150 large-enrollment courses during the 2023–2024 academic year. More than 500 million raw interaction logs were processed into 24 engineered behavioral features representing temporal engagement, resource access, navigation behavior, interaction activity, and assessment timing. After feature normalization, K-Means clustering was applied, and the optimal cluster solution was selected using the elbow method and average silhouette score. Cluster distinctiveness was examined using one-way analysis of variance, and the association between cluster membership and academic performance category was evaluated using a Chi-squared test. The analysis supported a four-cluster solution. Assessment procrastination and navigation sequentially were the strongest differentiating features.

Downloads

Download data is not yet available.

References

[1] G. Siemens, “Learning analytics: The emergence of a discipline,” American Behavioral Scientist, vol. 57, no. 10, pp. 1380–1400, 2013, doi: 10.1177/0002764213498851.

[2] R. Ferguson, “Learning analytics: drivers, developments and challenges,” International Journal of Technology Enhanced Learning, vol. 4, no. 5/6, pp. 304–317, 2012, doi: 10.1504/IJTEL.2012.051816.

[3] C. Romero and S. Ventura, “Educational data mining and learning analytics: An updated survey,” WIREs Data Mining and Knowledge Discovery, vol. 10, no. 3, e1355, 2020, doi: 10.1002/widm.1355.

[4] R. S. J. d. Baker and G. Siemens, “Educational data mining and learning analytics,” in The Cambridge Handbook of the Learning Sciences, 2nd ed., R. K. Sawyer, Ed. Cambridge: Cambridge University Press, 2014, pp. 253–272.

[5] D. Gašević, S. Dawson, and G. Siemens, “Let’s not forget: Learning analytics are about learning,” TechTrends, vol. 59, no. 1, pp. 64–71, 2015, doi: 10.1007/s11528-014-0822-x.

[6] O. Viberg, M. Hatakka, O. Bälter, and A. Mavroudi, “The current landscape of learning analytics in higher education,” Computers in Human Behavior, vol. 89, pp. 98–110, 2018, doi: 10.1016/j.chb.2018.07.027.

[7] R. Cerezo, M. Sánchez-Santillán, M. P. Paule-Ruiz, and J. C. Núñez, “Students’ LMS interaction patterns and their relationship with achievement: A case study in higher education,” Computers & Education, vol. 96, pp. 42–54, 2016, doi: 10.1016/j.compedu.2016.02.006.

[8] J. W. You, “Identifying significant indicators using LMS data to predict course achievement in online learning,” The Internet and Higher Education, vol. 29, pp. 23–30, 2016, doi: 10.1016/j.iheduc.2015.11.003.

[9] J. Jovanović, D. Gašević, S. Dawson, A. Pardo, and N. Mirriahi, “Learning analytics to unveil learning strategies in a flipped classroom,” The Internet and Higher Education, vol. 33, pp. 74–85, 2017, doi: 10.1016/j.iheduc.2017.02.001.

[10] H. Pashler, M. McDaniel, D. Rohrer, and R. Bjork, “Learning styles: Concepts and evidence,” Psychological Science in the Public Interest, vol. 9, no. 3, pp. 105–119, 2008, doi: 10.1111/j.1539-6053.2009.01038.x.

[11] P. A. Kirschner, “Stop propagating the learning styles myth,” Computers & Education, vol. 106, pp. 166–171, 2017, doi: 10.1016/j.compedu.2016.12.006.

[12] P. M. Newton and M. Miah, “Evidence-based higher education: Is the learning styles myth important?” Frontiers in Psychology, vol. 8, article 444, 2017, doi: 10.3389/fpsyg.2017.00444.

[13] P. H. Winne and A. F. Hadwin, “Studying as self-regulated learning,” in Metacognition in Educational Theory and Practice, D. J. Hacker, J. Dunlosky, and A. C. Graesser, Eds. Mahwah, NJ: Lawrence Erlbaum, 1998, pp. 277–304.

[14] E. Panadero, “A review of self-regulated learning: Six models and four directions for research,” Frontiers in Psychology, vol. 8, article 422, 2017, doi: 10.3389/fpsyg.2017.00422.

[15] J. Broadbent and W. L. Poon, “Self-regulated learning strategies and academic achievement in online higher education learning environments: A systematic review,” The Internet and Higher Education, vol. 27, pp. 1–13, 2015, doi: 10.1016/j.iheduc.2015.04.007.

[16] C. C. Gray and D. Perkins, “Utilizing early engagement and machine learning to predict student outcomes,” Computers & Education, vol. 131, pp. 22–32, 2019, doi: 10.1016/j.compedu.2018.12.006.

[17] L. A. Macfadyen and S. Dawson, “Mining LMS data to develop an early warning system for educators: A proof of concept,” Computers & Education, vol. 54, no. 2, pp. 588–599, 2010, doi: 10.1016/j.compedu.2009.09.008.

[18] J. MacQueen, “Some methods for classification and analysis of multivariate observations,” in Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, Berkeley, CA: University of California Press, 1967, pp. 281–297.

[19] D. Arthur and S. Vassilvitskii, “k-means++: The advantages of careful seeding,” in Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, 2007, pp. 1027–1035.

[20] P. J. Rousseeuw, “Silhouettes: A graphical aid to the interpretation and validation of cluster analysis,” Journal of Computational and Applied Mathematics, vol. 20, pp. 53–65, 1987, doi: 10.1016/0377-0427(87)90125-7.

[21] S. Slade and P. Prinsloo, “Learning analytics: Ethical issues and dilemmas,” American Behavioral Scientist, vol. 57, no. 10, pp. 1510–1529, 2013, doi: 10.1177/0002764213479366.

[22] D. Ifenthaler and C. Schumacher, “Student perceptions of privacy principles for learning analytics,” Educational Technology Research and Development, vol. 64, pp. 923–938, 2016, doi: 10.1007/s11423-016-9477-y.

[23] T. Susnjak, G. S. Ramaswami, and A. Mathrani, “Learning analytics dashboard: A tool for providing actionable insights to learners,” International Journal of Educational Technology in Higher Education, vol. 19, article 12, 2022, doi: 10.1186/s41239-021-00313-7.

[24] M. Blumenstein, “Synergies of learning analytics and learning design: A systematic review of student outcomes,” Journal of Learning Analytics, vol. 7, no. 3, pp. 13–32, 2020, doi: 10.18608/jla.2020.73.3.

[25] L. Paulsen and E. Lindsay, “Learning analytics dashboards are increasingly becoming about learning and not just analytics: A systematic review,” Education and Information Technologies, vol. 29, no. 11, pp. 14279–14308, 2024, doi: 10.1007/s10639-023-12401-4.

[26] I. Molenaar, “Towards hybrid human-AI learning technologies,” European Journal of Education, vol. 57, no. 4, pp. 632–645, 2022, doi: 10.1111/ejed.12527.

[27] S. H. E. Dijkstra, M. Hinne, E. Segers, and I. Molenaar, “Clustering children’s learning behaviour to identify self-regulated learning support needs,” Computers in Human Behavior, vol. 145, article 107754, 2023, doi: 10.1016/j.chb.2023.107754.

Downloads

Published

2026-06-22

Issue

Section

Articles

Most read articles by the same author(s)