Enhancing Student Performance Prediction with Limited Data in Distance Learning Environments

Mulyanto Mulyanto; Evizal Abdul Kadir

doi:10.24014/sitekin.v22i2.37515

Enhancing Student Performance Prediction with Limited Data in Distance Learning Environments

Mulyanto Mulyanto, Evizal Abdul Kadir

Abstract

Providing early predictions of student performance assessments is an essential task in the educational system. Previous studies on predicting student performance assessments have traditionally relied on academic scores and test indicators. The utilization of assignments, grades, and exams has been an extensive and successful method for evaluating student performance. However, with the increasing popularity of distance learning, a new perspective has emerged. The Online Learning Management System (OLMS) provides a wide array of features that can be leveraged in various ways to predict student performance. This study aims to propose an alternative approach to predicting student performance assessments by utilizing student engagement in an online learning management system. The study strives to investigate and analyze prospective features based on student activity. Bagging ensemble learning methods are proposed to predict student performance assessments through oversampling datasets. The effectiveness of these prediction models is then compared with various machine-learning models, with the results indicating that the proposed model outperforms others at all comparison levels. Furthermore, the proposed model demonstrates the ability to discriminate and predict student performance assessments based on OLMS-related features.

Keywords: Student Performance Assessment, Ensemble Learning, Machine Learning, Student Performance Prediction

Full Text:

PDF

References

F. V. Ferraro, F. I. Ambra, L. Aruta, and M. L. Iavarone, “Distance learning in the covid-19 era: Perceptions in Southern Italy,” Educ. Sci., vol. 10, no. 12, pp. 1–10, 2020, doi: 10.3390/educsci10120355.

A. Bozkurt, “From Distance Education to Open and Distance Learning,” in Handbook of Research on Learning in the Age of Transhumanism, no. April, 2019, pp. 252–273. doi: 10.4018/978-1-5225-8431-5.ch016.

A. Moubayed, M. Injadat, A. B. Nassif, H. Lutfiyya, and A. Shami, “E-Learning: Challenges and Research Opportunities Using Machine Learning Data Analytics,” IEEE Access, vol. 6, pp. 39117–39138, 2018, doi: 10.1109/ACCESS.2018.2851790.

R. Conijn, C. Snijders, A. Kleingeld, and U. Matzat, “Predicting student performance from LMS data: A comparison of 17 blended courses using moodle LMS,” IEEE Trans. Learn. Technol., vol. 10, no. 1, pp. 17–29, 2017, doi: 10.1109/TLT.2016.2616312.

S. Abuhammad, “Barriers to distance learning during the COVID-19 outbreak: A qualitative review from parents’ perspective,” Heliyon, vol. 6, no. 11, p. e05482, 2020, doi: 10.1016/j.heliyon.2020.e05482.

K. T. Chui, R. W. Liu, M. Zhao, and P. Ordóñez de Pablos, “Predicting Students’ Performance with School and Family Tutoring Using Generative Adversarial Network-Based Deep Support Vector Machine,” IEEE Access, vol. 8, pp. 86745–86752, 2020, doi: 10.1109/ACCESS.2020.2992869.

N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, “SMOTE: Synthetic Minority Over-sampling Technique,” J. Artif. Intell. Res., vol. 16, no. January, pp. 321–357, 2002, doi: 10.1613/jair.953.

I. Alazzam, I. Alsmadi, and M. Akour, “Software fault proneness prediction: a comparative study between bagging, boosting, and stacking ensemble and base learner methods,” Int. J. Data Anal. Tech. Strateg., vol. 9, no. 1, p. 1, 2017, doi: 10.1504/ijdats.2017.10003991.

A. Amin et al., “Comparing Oversampling Techniques to Handle the Class Imbalance Problem: A Customer Churn Prediction Case Study,” IEEE Access, vol. 4, no. Ml, pp. 7940–7957, 2016, doi: 10.1109/ACCESS.2016.2619719.

H. Zhang, L. Huang, C. Q. Wu, and Z. Li, “An effective convolutional neural network based on SMOTE and Gaussian mixture model for intrusion detection in imbalanced dataset,” Comput. Networks, vol. 177, no. May, 2020, doi: 10.1016/j.comnet.2020.107315.

E. Fernandes, M. Holanda, M. Victorino, V. Borges, R. Carvalho, and G. Van Erven, “Educational data mining: Predictive analysis of academic performance of public school students in the capital of Brazil,” J. Bus. Res., vol. 94, no. August 2017, pp. 335–343, 2019, doi: 10.1016/j.jbusres.2018.02.012.

A. Polyzou and G. Karypis, “Feature Extraction for Next-Term Prediction of Poor Student Performance,” IEEE Trans. Learn. Technol., vol. 12, no. 2, pp. 237–248, 2019, doi: 10.1109/TLT.2019.2913358.

A. Kumar and M. Jain, Ensemble Learning for AI Developers. 2020. doi: 10.1007/978-1-4842-5940-5.

Zhi-Hua Zhou, Ensemble Methods, Foundations and Algorithms. 2012.

J. Beemer, K. Spoon, L. He, J. Fan, and R. A. Levine, “Ensemble Learning for Estimating Individualized Treatment Effects in Student Success Studies,” Int. J. Artif. Intell. Educ., vol. 28, no. 3, pp. 315–335, 2018, doi: 10.1007/s40593-017-0148-x.

G. Fumera, F. Roli, and A. Serrau, “A theoretical analysis of bagging as a linear combination of classifiers,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 30, no. 7, pp. 1293–1299, 2008, doi: 10.1109/TPAMI.2008.30.

M. Mulyanto, S. W. Prakosa, M. Faisal, and J.-S. Leu, “Using Optimized Focal Loss for Imbalanced Dataset on Network Intrusion Detection System,” in IEEE Vehicular Technology Conference, 2022. doi: 10.1109/VTC2022-Spring54318.2022.9861034.

J. Davis and M. Goadrich, “The Relationship between Precision-Recall and ROC Curves,” in Proceedings of the 23rd International Conference on Machine Learning, in ICML ’06. New York, NY, USA: Association for Computing Machinery, 2006, pp. 233–240. doi: 10.1145/1143844.1143874.

A. Moubayed, M. Injadat, A. Shami, and H. Lutfiyya, “Relationship between student engagement and performance in e-learning environment using association rules,” EDUNINE 2018 - 2nd IEEE World Eng. Educ. Conf. Role Prof. Assoc. Contemp. Eng. Careers, Proc., 2018, doi: 10.1109/EDUNINE.2018.8451005.

M. N. Injadat, A. Moubayed, A. B. Nassif, and A. Shami, “Multi-split optimized bagging ensemble model selection for multi-class educational data mining,” Appl. Intell., vol. 50, pp. 4506–4528, 2020.

X. Xu, J. Wang, H. Peng, and R. Wu, “Prediction of academic performance associated with internet usage behaviors using machine learning algorithms,” Comput. Human Behav., vol. 98, no. April, pp. 166–173, 2019, doi: 10.1016/j.chb.2019.04.015.

I. A. A. Amra and A. Maghari, “Students Performance Prediction Using KNN and Naïve Bayesian,” Int. Conf. Inf. Technol., pp. 909–913, 2017.

M. Masud, J. Gao, L. Khan, J. Han, and B. M. Thuraisingham, “Classification and novel class detection in concept-drifting data streams under time constraints,” IEEE Trans. Knowl. Data Eng., vol. 23, no. 6, pp. 859–874, 2011, doi: 10.1109/TKDE.2010.61.

DOI: http://dx.doi.org/10.24014/sitekin.v22i2.37515

Refbacks

There are currently no refbacks.

Editorial Address:

FAKULTAS SAINS DAN TEKNOLOGI
UIN SULTAN SYARIF KASIM RIAU
Kampus Raja Ali Haji
Gedung Fakultas Sains & Teknologi UIN Suska Riau
Jl.H.R.Soebrantas No.155 KM 18 Simpang Baru Panam, Pekanbaru 28293

Whatsapp (Direct Chat)

SITEKIN Journal Indexing:

Google Scholar | Garuda | Moraref | IndexCopernicus | SINTA

SITEKIN by http://ejournal.uin-suska.ac.id/index.php

Username
Password
Remember me