Beyond classical and contemporary models: a transformative ai framework for student dropout prediction in distance learning using rag, prompt engineering, and cross-modal fusion

๐Ÿ“… 2025-07-04
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Student attrition in online education remains a critical challenge, and conventional machine learning models struggle to capture affective and contextual factors embedded in unstructured interactions. This paper proposes a multimodal dropout risk prediction framework that jointly models textual sentiment (leveraging RAG-augmented domain-specific BERT with prompt engineering to detect academic stress), temporal behavioral patterns, and sociodemographic attributes. A cross-modal attention mechanism enables dynamic, fine-grained integration across modalities. The framework ensures strong model interpretability and supports generation of actionable, personalized intervention recommendations. Evaluated on a dataset of 4,423 students, the model achieves 89% accuracy and an F1-score of 0.88โ€”outperforming baseline methods by 7% in F1 and reducing false-negative rate by 21%. This work establishes a novel paradigm for intelligent academic support in remote education.

Technology Category

Application Category

๐Ÿ“ Abstract
Student dropout in distance learning remains a critical challenge, with profound societal and economic consequences. While classical machine learning models leverage structured socio-demographic and behavioral data, they often fail to capture the nuanced emotional and contextual factors embedded in unstructured student interactions. This paper introduces a transformative AI framework that redefines dropout prediction through three synergistic innovations: Retrieval-Augmented Generation (RAG) for domain-specific sentiment analysis, prompt engineering to decode academic stressors, and cross-modal attention fusion to dynamically align textual, behavioral, and socio-demographic insights. By grounding sentiment analysis in a curated knowledge base of pedagogical content, our RAG-enhanced BERT model interprets student comments with unprecedented contextual relevance, while optimized prompts isolate indicators of academic distress (e.g., "isolation," "workload anxiety"). A cross-modal attention layer then fuses these insights with temporal engagement patterns, creating holistic risk profiles. Evaluated on a longitudinal dataset of 4 423 students, the framework achieves 89% accuracy and an F1-score of 0.88, outperforming conventional models by 7% and reducing false negatives by 21%. Beyond prediction, the system generates interpretable interventions by retrieving contextually aligned strategies (e.g., mentorship programs for isolated learners). This work bridges the gap between predictive analytics and actionable pedagogy, offering a scalable solution to mitigate dropout risks in global education systems
Problem

Research questions and friction points this paper is trying to address.

Predict student dropout in distance learning using AI
Analyze unstructured emotional and contextual student data
Improve prediction accuracy with cross-modal data fusion
Innovation

Methods, ideas, or system contributions that make the work stand out.

RAG-enhanced BERT for contextual sentiment analysis
Prompt engineering to detect academic distress
Cross-modal fusion for holistic risk profiling
๐Ÿ”Ž Similar Papers
No similar papers found.
M
Miloud Mihoubi
Institute of Applied Artificial Intelligence (I2A Institute), University of TELUQ
M
Meriem Zerkouk
Institute of Applied Artificial Intelligence (I2A Institute), University of TELUQ
Belkacem Chikhaoui
Belkacem Chikhaoui
Full professor, TELUQ University, Montreal
Data minigmachine learningQuantum machine learningArtificial intelligence