Learning to Solve Complex Problems via Dataset Decomposition

📅 2026-02-23
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes a reverse curriculum learning framework tailored for structurally complex and conceptually deep tasks such as advanced mathematical problem solving and code generation. The approach introduces a difficulty scoring mechanism based on structural complexity and conceptual depth, and employs a teacher–student architecture to recursively decompose challenging problems. The teacher model generates progressively simplified examples through step-by-step reasoning, thereby constructing an easy-to-hard curriculum that guides the student model in incremental learning. Experimental results on benchmarks including MATH and AIME demonstrate that this method significantly outperforms standard training strategies, effectively enhancing the model’s capacity to solve complex problems.

Technology Category

Application Category

📝 Abstract
Curriculum learning is a class of training strategies that organizes the data being exposed to a model by difficulty, gradually from simpler to more complex examples. This research explores a reverse curriculum generation approach that recursively decomposes complex datasets into simpler, more learnable components. We propose a teacher-student framework where the teacher is equipped with the ability to reason step-by-step, which is used to recursively generate easier versions of examples, enabling the student model to progressively master difficult tasks. We propose a novel scoring system to measure data difficulty based on its structural complexity and conceptual depth, allowing curriculum construction over decomposed data. Experiments on math datasets (MATH and AIME) and code generation datasets demonstrate that models trained with curricula generated by our approach exhibit superior performance compared to standard training on original datasets.
Problem

Research questions and friction points this paper is trying to address.

curriculum learning
dataset decomposition
complex problem solving
data difficulty
structured reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

curriculum learning
dataset decomposition
teacher-student framework
difficulty scoring
recursive simplification
🔎 Similar Papers
No similar papers found.