LACE: Lattice Attention for Cross-thread Exploration

๐Ÿ“… 2026-04-16
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

205K/year
๐Ÿค– AI Summary
This work addresses the limitation of existing large language models in parallel reasoning, where independent inference paths often lead to redundant errors. To overcome this, the authors propose LACE, a novel framework that enables real-time interaction and collaborative error correction among multiple reasoning threads for the first time. LACE introduces a lattice attention mechanism and a cross-thread attention architecture, allowing paths to dynamically share intermediate insights during inference. Additionally, the framework incorporates a synthetic data generation pipeline specifically designed to train models for effective collaboration. Experimental results demonstrate that LACE improves reasoning accuracy by over 7 percentage points compared to standard parallel search methods.

Technology Category

Application Category

๐Ÿ“ Abstract
Current large language models reason in isolation. Although it is common to sample multiple reasoning paths in parallel, these trajectories do not interact, and often fail in the same redundant ways. We introduce LACE, a framework that transforms reasoning from a collection of independent trials into a coordinated, parallel process. By repurposing the model architecture to enable cross-thread attention, LACE allows concurrent reasoning paths to share intermediate insights and correct one another during inference. A central challenge is the absence of natural training data that exhibits such collaborative behavior. We address this gap with a synthetic data pipeline that explicitly teaches models to communicate and error-correct across threads. Experiments show that this unified exploration substantially outperforms standard parallel search, improving reasoning accuracy by over 7 points. Our results suggest that large language models can be more effective when parallel reasoning paths are allowed to interact.
Problem

Research questions and friction points this paper is trying to address.

parallel reasoning
cross-thread interaction
reasoning collaboration
redundant failure
independent inference
Innovation

Methods, ideas, or system contributions that make the work stand out.

cross-thread attention
parallel reasoning
collaborative inference
synthetic training data
lattice attention
๐Ÿ”Ž Similar Papers