🤖 AI Summary
Traditional in-context learning (ICL) lacks explicit mechanisms for abstract knowledge retrieval and transfer, hindering large language models’ ability to implicitly construct and leverage schema-based reasoning representations. To address this, we propose Schema-Activated ICL (SA-ICL), the first ICL framework integrating cognitive science’s schema theory. SA-ICL introduces lightweight, structured abstract reasoning templates that explicitly model task-invariant patterns, unifying chain-of-thought and schema-guided reasoning strategies. Built upon Transformer architectures, it extracts cognitive primitives from demonstrations and dynamically activates task-adaptive schemas to enhance inference. Evaluated on the GPQA chemistry/physics subset, SA-ICL achieves up to a 36.19% absolute improvement in single-shot accuracy, substantially reduces dependence on demonstration count, and improves both interpretability and human-like reasoning fidelity.
📝 Abstract
In-Context Learning (ICL) enables transformer-based language models to adapt to new tasks by conditioning on demonstration examples. However, traditional example-driven in-context learning lacks explicit modules for knowledge retrieval and transfer at the abstraction level. Inspired by cognitive science, specifically schema theory, which holds that humans interpret new information by activating pre-existing mental frameworks (schemas) to structure understanding, we introduce SCHEMA ACTIVATED IN CONTEXT LEARNING (SA-ICL). This framework extracts the representation of the building blocks of cognition for the reasoning process instilled from prior examples, creating an abstracted schema, a lightweight, structured template of key inferential steps and their relationships, which is then used to augment a model's reasoning process when presented with a novel question. We demonstrate that a broad range of large language models (LLMs) lack the capacity to form and utilize internal schema-based learning representations implicitly, but instead benefit significantly from explicit schema-based scaffolding. Across chemistry and physics questions from the GPQA dataset, our experiments show that SA-ICL consistently boosts performance, up to 36.19 percent, when the single demonstration example is of high quality, which simultaneously reduces reliance on the number of demonstrations and enhances interpretability. SCHEMA ACTIVATED IN CONTEXT LEARNING not only bridges disparate ICL strategies ranging from pattern priming to Chain-of-Thought prompting, but also paves a new path for enhancing human-like reasoning in LLMs.