Improving Student Graduation Timeliness Prediction Using SMOTE and Ensemble Learning with Stacking and GridSearch CV Optimization

Efendi, Akmar and Fitri, Iskandar and Nurcahyo, Gunadi Widi (2025) Improving Student Graduation Timeliness Prediction Using SMOTE and Ensemble Learning with Stacking and GridSearch CV Optimization. Data and Metadata, 4. pp. 1-10. ISSN 2953-4917

[thumbnail of 1. Improving Student Graduation Timeliness Prediction Using SMOTE.pdf]
Preview
Text
1. Improving Student Graduation Timeliness Prediction Using SMOTE.pdf - Published Version

Download (351kB) | Preview

Abstract

Introduction: timely graduation is a key performance indicator in higher education. This study aims to improve the accuracy of predicting student graduation timeliness using ensemble machine learning techniques combined with SMOTE and hyperparameter optimization. Method: this is a quantitative predictive study. The population includes students and alumni of Universitas Islam Riau. A sample of 160 respondents was obtained via purposive sampling. Data were collected using structured questionnaires encompassing academic variables (e.g., GPA, credits, attendance) and nonacademic variables (e.g., stress, social support, extracurricular activity). After preprocessing and label encoding, SMOTE was applied to balance class distribution. Several classifiers (Naïve Bayes, SVM, Decision Tree, KNN) were tested, with ensemble learning (voting and stacking) implemented and optimized using GridSearchCV. Results: the stacking ensemble model optimized with GridSearchCV achieved the highest performance with an accuracy of 99,37 %, precision and recall above 0,99, and minimal misclassification. This outperformed individual models and previous approaches in the literature. Conclusions: the integration of SMOTE, ensemble methods, and GridSearchCV significantly enhances predictive accuracy for student graduation timeliness. The resulting model provides a robust framework for academic risk detection and early intervention

Item Type: Article
Uncontrolled Keywords: Student Graduation; Ensemble Learning; SMOTE; Stacking; GridSearchCV; Machine Learning
Subjects: Q Science > QA Mathematics > QA76 Computer software
Divisions: > Teknik Informatika
Depositing User: Mia
Date Deposited: 07 Oct 2025 02:01
Last Modified: 07 Oct 2025 02:01
URI: https://repository.uir.ac.id/id/eprint/30874

Actions (login required)

View Item View Item