Chat-Based Support Alone May Not Be Enough: Comparing Conversational and Embedded LLM Feedback for Mathematical Proof Learning

📅 2026-02-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates whether tutoring solely through conversational large language models (LLMs) is sufficient to effectively support learning in mathematical proof construction, and compares its efficacy against embedded, structured feedback in fostering knowledge transfer. Using the GPTutor system, the research presents the first empirical comparison in a discrete mathematics course between students who received LLM assistance via conversational question-answering and those who interacted with LLM-generated annotations embedded directly within their proof-writing workspace. Integrating manual behavioral coding with automated classification of interaction logs, the findings reveal that frequent use of the chatbot—particularly when coupled with a tendency to seek direct answers—is significantly negatively associated with subsequent exam performance. In contrast, embedded feedback showed no such detrimental effect, suggesting it better supports sustainable learning outcomes.

Technology Category

Application Category

📝 Abstract
We evaluate GPTutor, an LLM-powered tutoring system for an undergraduate discrete mathematics course. It integrates two LLM-supported tools: a structured proof-review tool that provides embedded feedback on students' written proof attempts, and a chatbot for math questions. In a staggered-access study with 148 students, earlier access was associated with higher homework performance during the interval when only the experimental group could use the system, while we did not observe this performance increase transfer to exam scores. Usage logs show that students with lower self-efficacy and prior exam performance used both components more frequently. Session-level behavioral labels, produced by human coding and scaled using an automated classifier, characterize how students engaged with the chatbot (e.g., answer-seeking or help-seeking). In models controlling for prior performance and self-efficacy, higher chatbot usage and answer-seeking behavior were negatively associated with subsequent midterm performance, whereas proof-review usage showed no detectable independent association. Together, the findings suggest that chatbot-based support alone may not reliably support transfer to independent assessment of math proof-learning outcomes, whereas work-anchored, structured feedback appears less associated with reduced learning.
Problem

Research questions and friction points this paper is trying to address.

mathematical proof learning
chatbot feedback
embedded feedback
learning transfer
LLM tutoring
Innovation

Methods, ideas, or system contributions that make the work stand out.

structured feedback
conversational AI
mathematical proof learning
LLM tutoring system
behavioral coding
🔎 Similar Papers
No similar papers found.
Eason Chen
Eason Chen
Human-Computer Interaction Institute, Carnegie Mellon University
Learning SciencesEducation TechnologiesLearning AnalyticsBlockchain
S
Sophia Judicke
Carnegie Mellon University, Pittsburgh, PA, USA
K
Kayla Beigh
Carnegie Mellon University, Pittsburgh, PA, USA
X
Xinyi Tang
Carnegie Mellon University, Pittsburgh, PA, USA
I
Isabel Wang
Carnegie Mellon University, Pittsburgh, PA, USA
N
Nina Yuan
Carnegie Mellon University, Pittsburgh, PA, USA
Z
Zimo Xiao
Carnegie Mellon University, Pittsburgh, PA, USA
C
Chuangji Li
Carnegie Mellon University, Pittsburgh, PA, USA
S
Shizhuo Li
Carnegie Mellon University, Pittsburgh, PA, USA
R
Reed Luttmer
Carnegie Mellon University, Pittsburgh, PA, USA
Shreya Singh
Shreya Singh
IIT Jammu
Cyber Security
M
Maria Yampolsky
Carnegie Mellon University, Pittsburgh, PA, USA
N
Naman Parikh
Carnegie Mellon University, Pittsburgh, PA, USA
Y
Yvonne Zhao
Carnegie Mellon University, Pittsburgh, PA, USA
M
Meiyi Chen
Carnegie Mellon University, Pittsburgh, PA, USA
S
Scarlett Huang
Carnegie Mellon University, Pittsburgh, PA, USA
A
Anishka Mohanty
Carnegie Mellon University, Pittsburgh, PA, USA
G
Gregory Johnson
Carnegie Mellon University, Pittsburgh, PA, USA
John Mackey
John Mackey
Professor of Oncology, University of Alberta
cancerclinical trialsdrug developmentphoto acoustic remote sensing
Jionghao Lin
Jionghao Lin
University of Hong Kong | Carnegie Mellon University | Monash University
Artificial Intelligence in EducationLearning AnalyticsHuman-Centered AIFeedbackDiscourse
Ken Koedinger
Ken Koedinger
HCII, Carnegie Mellon University
Educational Data MiningArtificial Intelligence in EducationLearning EngineeringIntelligent Tutoring Systems