Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Article . 2025
License: CC BY
Data sources: Datacite
ZENODO
Article . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

COMPARATIVE STUDY OF CLUSTERING ALGORITHMS FOR STUDENT PERFORMANCE EVALUATION

Authors: Karim Md Razaul; Cheng Zekai;

COMPARATIVE STUDY OF CLUSTERING ALGORITHMS FOR STUDENT PERFORMANCE EVALUATION

Abstract

Predicting student performance is essential for enhancing educational outcomes, enabling educators to identify studentswho may need additional support or intervention. Clustering algorithms, as unsupervised data mining techniques, areparticularly effective at uncovering patterns in student performance data. These algorithms can group students basedon their exam scores, providing insights that allow for more tailored and targeted educational strategies. This studycompares four unsupervised methods K-Means, DBSCAN, Hierarchical Clustering (Ward linkage), and GaussianMixture Models (GMM) on a dataset of 200 students’ scores across five exam questions. After standardizing the data,we project it into two dimensions via Principal Component Analysis (PCA) for visualization. We then evaluate eachmodel using three validation metrics: Silhouette Score, Davies-Bouldin Index, and Calinski-Harabasz Index. K-Meanswith k = 5 achieves the highest Silhouette (0.387) and Calinski-Harabasz (90.156) scores and the lowest DaviesBouldin Index (0.883), outperforming alternatives in both visual separation and quantitative metrics. DBSCANidentifies noise but yields overlapping clusters; Hierarchical clustering shows moderate cohesion; GMM producessofter boundaries. Our results demonstrate that K-Means offers the most interpretable and robust grouping for thiseducational dataset, providing a practical tool for segmenting students into performance tiers. Future work may exploredynamic k-selection methods, incorporation of additional student features, and deployment in intelligent tutoringsystems.

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green