Interaction Locality in Hierarchical Recursive Reasoning

📅 2026-05-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

211K/year
🤖 AI Summary
This study investigates the interplay between local information flow and global structure in spatial reasoning, particularly within recursive and hierarchical models. To this end, it introduces the “interaction locality” framework, which formalizes the intuitive notion of “local execution with global planning” into a reproducible, quantitative methodology. By integrating task geometry, the framework assesses whether information propagates within neighboring units or semantic segments versus across structural boundaries. Validation on HRM/TRM and MTU3D systems—through sparse autoencoder feature ablation, noisy activation patching, structured Jacobian analysis, and attention mechanisms—reveals that recursive models exhibit strong local write patterns, while causal locality in 3D embodied models concentrates at module interfaces. These findings elucidate how localized operations cumulatively give rise to global architectural coherence.
📝 Abstract
Spatial reasoning requires both location-bound computation and location-invariant structure: agents must make local moves while preserving route, object, or constraint-level plans. We propose interaction locality, a task-geometry-aware framework for measuring whether information flow stays within nearby cells or semantic segments, or crosses them. We instantiate the framework with sparse-autoencoder feature ablations and finite-noise activation patching, with structural Jacobian and attention checks reported in the appendix, and apply it to HRM and TRM, two compact hierarchical and recursive reasoning models, on Maze-Hard, Sudoku Extreme, and ARC-AGI. Across these models, activation patching gives the clearest architectural fingerprint: high-level recurrent states tend to write information within nearby cells or same-segment units, while repeated recursive updates accumulate these local writes into broader solution structure. This pattern holds across maze paths, Sudoku constraints, and ARC-AGI object neighborhoods, with the strongest concentration in TRM. To test whether interaction locality extends beyond toy-yet-challenging grid benchmarks, we also apply it to MTU3D, a large-scale embodied 3D scene-grounding model. In this MTU3D setting, causal spatial locality appears primarily at the transition where visual scene features are handed to the downstream grounding module, rather than uniformly throughout the visual encoder. This contrast suggests that the local-to-global handoff observed in HRM and TRM is tied to explicit recursive reasoning dynamics, while embodied 3D models may concentrate causal spatial structure at module boundaries. Interaction locality turns the intuitive local-execution/global-planning story into a reproducible measurement framework for recursive and embodied spatial reasoning.
Problem

Research questions and friction points this paper is trying to address.

interaction locality
hierarchical recursive reasoning
spatial reasoning
information flow
locality
Innovation

Methods, ideas, or system contributions that make the work stand out.

interaction locality
hierarchical recursive reasoning
activation patching
spatial reasoning
causal spatial structure
🔎 Similar Papers
2024-02-26Annual Meeting of the Association for Computational LinguisticsCitations: 97
2024-09-12arXiv.orgCitations: 2