Data-Driven Student Group Formation for Group Investigation: A K-Medoids Clustering Approach in Cooperative Learning

Authors

  • Salma Alyasyifa Information System, Telkom University, Indonesia
  • Oktariani Nurul Pratiwi Information System, Telkom University, Indonesia
  • Irfan Darmawan Information System, Telkom University, Indonesia

DOI:

https://doi.org/10.52436/1.jutif.2025.6.5.4765

Keywords:

Cooperative Learning, Group Investigation, K-Medoids, Student Group Formation

Abstract

Group Investigation (GI) is a widely used cooperative learning strategy in higher education, but challenges such as large class sizes and diverse student profiles complicate manual group formation. Previous studies have applied clustering algorithms like K-Means, yet K-Medoids, which is robust to noise, remain underexplored for group formation, especially GI. This study proposed a data-driven approach using the K-Medoids clustering algorithm to create student groups that are both interest-aligned and heterogeneous in profile, which enhancing the effectiveness of GI activities. Employing the Knowledge Discovery in Databases (KDD) framework, the process included data selection, preprocessing, transformation, three grouping processes, and evaluation were performed. In grouping process students were initially grouped by interest, clustered using K-Medoids with various distance measures tested, and finally, groups were adjusted to balance homogeneity and diversity. In grouping stage 2, clustering with Euclidean distance and PCA achieved the highest Silhouette Score, indicating superior grouping quality. The result of heterogeneity group of students evaluated with Gower dissimilarity shows that the method produces internally diverse yet cohesive interest groups, supporting GI goals.

Downloads

Download data is not yet available.

References

M. Malan, “The Effectiveness of Cooperative Learning in an Online Learning Environment Through a Comparison of Group and Individual Marks,” Electronic Journal of e-Learning, vol. 19, no. 6, pp. 588–600, Dec. 2021, doi: 10.34190/ejel.19.6.2238.

A. Abramczyk and S. Jurkowski, “Cooperative learning as an evidence-based teaching strategy: what teachers know, believe, and how they use it,” Journal of Education for Teaching, vol. 46, no. 3, pp. 296–308, 2020, doi: 10.1080/02607476.2020.1733402.

Sugiharto, “Geographical students’ learning outcomes on basic political science by using cooperative learning model with Group Investigation (GI) type in State University of Medan, Indonesia,” J Hum Behav Soc Environ, vol. 30, no. 4, pp. 447–456, May 2020, doi: 10.1080/10911359.2019.1696261.

J. Kilpeläinen-Pettersson, P. Koskinen, A. Lehtinen, and T. Mäntylä, “Cooperative Learning in Higher Education Physics – A Systematic Literature Review,” Int J Sci Math Educ, Jan. 2025, doi: 10.1007/s10763-024-10538-3.

M. Susanti, S. Suyanto, J. Jailani, and H. Retnawati, “Problem-based learning for improving problem-solving and critical thinking skills: A case on probability theory course,” Journal of Education and Learning (EduLearn), vol. 17, no. 4, pp. 507–525, Nov. 2023, doi: 10.11591/edulearn.v17i4.20866.

M. Saqr and S. López-Pernas, “The temporal dynamics of online problem-based learning: Why and when sequence matters,” Int J Comput Support Collab Learn, vol. 18, no. 1, pp. 11–37, 2023, doi: 10.1007/s11412-023-09385-1.

D. A. Donovan and G. L. Connell, “Evolution of a Student-Centered Biology Class: How Systematically Testing Aspects of Class Structure Has Informed Our Teaching,” in Active Learning in College Science, Cham: Springer International Publishing, 2020, pp. 307–323. doi: 10.1007/978-3-030-33600-4_20.

M. Fischer, R. M. Rilke, and B. B. Yurtoglu, “When, and why, do teams benefit from self-selection?,” Exp Econ, vol. 26, no. 4, pp. 749–774, Sep. 2023, doi: 10.1007/s10683-023-09800-2.

Y. Sharan and S. Sharan, “Design for Change: A Teacher Education Project for Cooperative Learning and Group Investigation in Israel,” in Pioneering Perspectives in Cooperative Learning, Routledge, 2021, pp. 165–182. doi: 10.4324/9781003106760-8.

A. C. Seherrie and A. S. Mawela, “Life Orientation teachers’ pedagogical content knowledge and skills in using a group investigation cooperative teaching approach,” Journal of Education, no. 89, pp. 1–20, Jan. 2023, doi: 10.17159/2520-9868/i89a03.

H. Silva, J. Lopes, E. Morais, and C. Dominguez, “Fostering Critical and Creative Thinking through the Cooperative Learning Jigsaw and Group Investigation,” International Journal of Instruction, vol. 16, no. 3, pp. 261–282, Jul. 2023, doi: 10.29333/iji.2023.16315a.

T. Vallès-Català and R. Palau, “Minimum entropy collaborative groupings: A tool for an automatic heterogeneous learning group formation,” PLoS One, vol. 18, no. 3 March, 2023, doi: 10.1371/journal.pone.0280604.

S. Lorente, M. Arnal-Palacián, and M. Paredes-Velasco, “Effectiveness of cooperative, collaborative, and interdisciplinary learning guided by software development in Spanish universities,” European Journal of Psychology of Education, vol. 39, no. 4, pp. 4467–4491, Dec. 2024, doi: 10.1007/s10212-024-00881-y.

R. C.-Y. Loh and C.-S. Ang, “Unravelling Cooperative Learning in Higher Education,” Research in Social Sciences and Technology, vol. 5, no. 2, pp. 22–39, May 2020, doi: 10.46303/ressat.05.02.2.

Kanika, S. Chakraverty, P. Chakraborty, and M. Madan, “Effect of different grouping arrangements on students’ achievement and experience in collaborative learning environment,” Interactive Learning Environments, vol. 31, no. 10, pp. 6366–6378, 2023, doi: 10.1080/10494820.2022.2036764.

G. Nalli, D. Amendola, A. Perali, and L. Mostarda, “Comparative analysis of clustering algorithms and moodle plugin for creation of student heterogeneous groups in online university courses,” Applied Sciences (Switzerland), vol. 11, no. 13, 2021, doi: 10.3390/app11135800.

G. Nalli, D. Amendola, and S. Smith, “Artificial Intelligence to Improve Learning Outcomes Through Online Collaborative Activities,” in Proceedings of the European Conference on e-Learning, ECEL, Oct. 2022, pp. 475–479. doi: 10.34190/ecel.21.1.661.

G. Nalli and S. Smith, “Comparison of the Effectiveness and Performance of Student Workgroups in Online Wiki Activities with and without AI †,” Engineering Proceedings, vol. 56, no. 1, 2023, doi: 10.3390/ASEC2023-16273.

Z. Bousalem, A. Qazdar, and I. El Guabassi, “Cooperative Learning Groups: A New Approach Based on Students’ Performance Prediction,” International Journal of Online and Biomedical Engineering (iJOE), vol. 19, no. 12, pp. 34–48, Aug. 2023, doi: 10.3991/ijoe.v19i12.41181.

N. Gavrilovic, T. Sibalija, and D. Domazet, “Design and implementation of discrete Jaya and discrete PSO algorithms for automatic collaborative learning group composition in an e-learning system,” Appl Soft Comput, vol. 129, p. 109611, Nov. 2022, doi: 10.1016/J.ASOC.2022.109611.

O. S. Revelo, C. A. Collazos Ordonez, M. A. Redondo, and I. Ibert Bittencourt Santana Pinto, “Homogeneous Group Formation in Collaborative Learning Scenarios: An Approach Based on Personality Traits and Genetic Algorithms,” IEEE Transactions on Learning Technologies, vol. 14, no. 4, pp. 486–499, 2021, doi: 10.1109/TLT.2021.3105008.

P. S. Pawar, J. R. Saini, P. J. Pawar, and S. Vaidya, Genetic Algorithm Application for Improving the Performance of Teaching Learning Process Through Collaborative Learning, vol. 1086 LNNS. 2024. doi: 10.1007/978-981-97-6036-7_27.

C.-C. Peng, C.-J. Tsai, T.-Y. Chang, J.-Y. Yeh, and M.-C. Lee, “Novel heterogeneous grouping method based on magic square,” Inf Sci (N Y), vol. 517, pp. 340–360, 2020, doi: 10.1016/j.ins.2019.12.088.

E. Schubert and P. J. Rousseeuw, “Fast and eager K-Medoids clustering: O(k) runtime improvement of the PAM, CLARA, and CLARANS algorithms,” Inf Syst, vol. 101, p. 101804, Nov. 2021, doi: 10.1016/J.IS.2021.101804.

J. Wang, X. W. Chen, K. R. Cheng, Y. L. Cao, and B. Pan, A Study on the Characteristics of College Students’ Consumption Behavior Based on Clustering and Association Rules, vol. 1423. 2021. doi: 10.1007/978-3-030-78618-2_30.

T. Hamim, F. Benabbou, and N. Sael, “Toward a Generic Student Profile Model,” in Innovations in Smart Cities Applications, Edition 3., Springer, 2020, pp. 200–214. doi: 10.1007/978-3-030-37629-1_16.

T. Emmanuel, T. Maupong, D. Mpoeleng, T. Semong, B. Mphago, and O. Tabona, “A survey on missing data in machine learning,” J Big Data, vol. 8, no. 1, 2021, doi: 10.1186/s40537-021-00516-9.

W.-C. Lin and C.-F. Tsai, “Missing value imputation: a review and analysis of the literature (2006–2017),” Artif Intell Rev, vol. 53, no. 2, pp. 1487–1509, 2020, doi: 10.1007/s10462-019-09709-4.

V. N. G. Raju, K. P. Lakshmi, V. M. Jain, A. Kalidindi, and V. Padma, “Study the Influence of Normalization/Transformation process on the Accuracy of Supervised Classification,” in Proceedings of the 3rd International Conference on Smart Systems and Inventive Technology, ICSSIT 2020, 2020, pp. 729–735. doi: 10.1109/ICSSIT48917.2020.9214160.

A. Tran, C. Zuniga-Navarrete, L. J. Segura, A. Dourado, X. Wang, and C. R. Bego, “Categorical Variable Coding for Machine Learning in Engineering Education,” in Proceedings - Frontiers in Education Conference, FIE, 2024. doi: 10.1109/FIE61694.2024.10893080.

M. M. Ahsan, M. A. P. Mahmud, P. K. Saha, K. D. Gupta, and Z. Siddique, “Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance,” Technologies (Basel), vol. 9, no. 3, 2021, doi: 10.3390/technologies9030052.

M. Prince and P. M. Joe Prathap, “A Novel Approach to Design Distribution Preserving Framework for Big Data,” Intelligent Automation and Soft Computing, vol. 35, no. 3, pp. 2789–2803, 2023, doi: 10.32604/iasc.2023.029533.

U. Mishra and S. S. Parikh, “Clustering method for forming student groups based on the cognitive performance measured using eye responses,” in Proceedings of 2024 3rd International Conference on Artificial Intelligence and Intelligent Information Processing, AIIIP 2024, 2025, pp. 303–310. doi: 10.1145/3707292.3707381.

K. Murphy, S. López-Pernas, and M. Saqr, Dissimilarity-Based Cluster Analysis of Educational Data: A Comparative Tutorial Using R. 2024. doi: 10.1007/978-3-031-54464-4_8.

T. J. Fontalvo-Herrera, E. J. Delahoz, and A. A. Mendoza-Mendoza, “Application of data mining for the classification of university programs of industrial engineering accredited in high quality in Colombia | Aplicación de minería de datos para la clasificación de programas universitarios de ingeniería industrial acreditad,” Informacion Tecnologica, vol. 29, no. 3, pp. 89–96, 2018, doi: 10.4067/S0718-07642018000300089.

Q. Hu and H. Rangwala, “Towards Fair Educational Data Mining: A Case Study on Detecting At-risk Students,” in Proceedings of the 13th International Conference on Educational Data Mining, EDM 2020, 2020, pp. 431–437.

D. Ebbert and S. Dutke, “Patterns in students’ usage of lecture recordings: A cluster analysis of self-report data,” Research in Learning Technology, vol. 28, 2020, doi: 10.25304/rlt.v28.2258.

P. Liu, H. Yuan, Y. Ning, B. Chakraborty, N. Liu, and M. A. Peres, “A modified and weighted Gower distance-based clustering analysis for mixed type data: a simulation and empirical analyses,” BMC Med Res Methodol, vol. 24, no. 1, 2024, doi: 10.1186/s12874-024-02427-8.

J. De La Cruz Hernandez, K. J. Tobin, J. C. Kilburn, and M. E. Bennett, “Using a Modified Gower Distance Measure to Assess Supplemental Learning Supporting an Online Social Science Graduate Course,” Educ Sci (Basel), vol. 15, no. 3, 2025, doi: 10.3390/educsci15030371.

C. Alalouch, “Cognitive styles, gender, and student academic performance in engineering education,” Educ Sci (Basel), vol. 11, no. 9, 2021, doi: 10.3390/educsci11090502.

S. Jiang, J. McClure, C. Tatar, F. Bickel, C. P. Rosé, and J. Chao, “Towards inclusivity in AI: A comparative study of cognitive engagement between marginalized female students and peers,” British Journal of Educational Technology, vol. 55, no. 6, pp. 2557–2573, 2024, doi: 10.1111/bjet.13467.

Additional Files

Published

2025-10-23

How to Cite

[1]
S. . Alyasyifa, O. N. . Pratiwi, and I. . Darmawan, “Data-Driven Student Group Formation for Group Investigation: A K-Medoids Clustering Approach in Cooperative Learning”, J. Tek. Inform. (JUTIF), vol. 6, no. 5, pp. 3886–3898, Oct. 2025.