Predicting Student Success with Heterogeneous Graph Deep Learning and Machine Learning Models

๐Ÿ“… 2026-01-11
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 1
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This study addresses the challenge of enabling early and continuous prediction of academic success by leveraging multi-source, heterogeneous student data that includes dynamic assessment features and diverse entity relationships. To this end, the authors propose a novel framework that integrates heterogeneous graph neural networks with traditional machine learning methods, uniquely incorporating dynamic assessment features into heterogeneous graph modeling. Complex dependencies among students, courses, and assessments are captured through carefully designed meta-paths. Evaluated on the OULA dataset, the approach achieves a validation F1-score of 68.6% using only 7% of semester data and reaches 89.5% by the semesterโ€™s end, outperforming the best conventional model by 4.7% in early prediction performance. These results demonstrate the methodโ€™s effectiveness and innovation in modeling temporal and relational student data for academic outcome forecasting.

Technology Category

Application Category

๐Ÿ“ Abstract
Early identification of student success is crucial for enabling timely interventions, reducing dropout rates, and promoting on time graduation. In educational settings, AI powered systems have become essential for predicting student performance due to their advanced analytical capabilities. However, effectively leveraging diverse student data to uncover latent and complex patterns remains a key challenge. While prior studies have explored this area, the potential of dynamic data features and multi category entities has been largely overlooked. To address this gap, we propose a framework that integrates heterogeneous graph deep learning models to enhance early and continuous student performance prediction, using traditional machine learning algorithms for comparison. Our approach employs a graph metapath structure and incorporates dynamic assessment features, which progressively influence the student success prediction task. Experiments on the Open University Learning Analytics (OULA) dataset demonstrate promising results, achieving a 68.6% validation F1 score with only 7% of the semester completed, and reaching up to 89.5% near the semester's end. Our approach outperforms top machine learning models by 4.7% in validation F1 score during the critical early 7% of the semester, underscoring the value of dynamic features and heterogeneous graph representations in student success prediction.
Problem

Research questions and friction points this paper is trying to address.

student success prediction
heterogeneous graph
dynamic features
early intervention
educational data mining
Innovation

Methods, ideas, or system contributions that make the work stand out.

heterogeneous graph
deep learning
dynamic features
student success prediction
metapath
๐Ÿ”Ž Similar Papers
No similar papers found.