X-SYNTH: Beyond Retrieval -- Enterprise Context Synthesis from Observed Human Attention

📅 2026-05-15
📈 Citations: 0
Influential: 0
📄 PDF

career value

211K/year
🤖 AI Summary
This study addresses the challenge that critical contextual information for enterprise AI tasks is scattered across multiple systems, rendering traditional content-matching retrieval methods ineffective at identifying context causally relevant to complex tasks—resulting in low true lead rates and high false lead rates. To overcome this limitation, the work proposes an unsupervised context synthesis framework that leverages employees’ observable attention behaviors as a core signal of contextual relevance. The framework constructs individual digital twin signatures and dynamically combines seven types of attention filters—proportional, inverse, differential, recurrent, comparative, sequential, and collective—within a four-stage behavior-pattern-driven assembly pipeline to extract task-relevant context from raw behavioral traces. Evaluated on a sales lead identification task, the method increases the true lead rate from 9.5% to 61.9% (a 6.5× improvement) while reducing the false lead rate from 90.5% to 18.8%, substantially outperforming existing retrieval paradigms.
📝 Abstract
In enterprise operations, the context required for an AI agent task is scattered across systems of record, static information stores, and communication channels. What is stored is system state, a lossy representation of the work that actually happened [2, 52]. The prevailing approach [17, 31, 34, 36] retrieves by matching request content to what is stored; for narrow requests this works well. But synthesis quality depends on knowing what to surface and how to interpret it: knowledge specific to each organization, team, and individual [5, 57, 61], present in behavioral patterns, absent from any retrieval index. For complex agentic tasks it breaks down: True Lead Rate is low, False Lead Rate is high, and the model has no mechanism to improve. We present X-SYNTH, a framework for enterprise context synthesis grounded in human attention, the digitally observable interaction signatures of each worker, encoding not just what they did but the sequence in which they did it, along with implicit reward signals. Behavioral traces preceding positive outcomes are distinguishable from those that did not, without external labeling. X-SYNTH models each individual's behavioral baseline as a Digital Twin Signature (DTS) and selects among seven qualitatively distinct attention filters: Proportional, Inverse, Differential, Recurrent, Comparative, Sequential, and Collective, per individual and per query, to identify causally relevant activity signatures. A four-stage pipeline assembles ranked context grounded in behavioral patterns rather than query embeddings. On a sales lead identification task, a frontier model unaided achieves 9.5% True Lead Rate (TLR) with 90.5% False Lead Rate (FLR). Augmented with X-SYNTH, TLR rises to 61.9% (6.5x) while FLR falls to 18.8%. Enterprise context synthesis is not a retrieval problem. It is a relevance problem, and human attention is its most reliable ground truth.
Problem

Research questions and friction points this paper is trying to address.

enterprise context synthesis
human attention
relevance
behavioral traces
AI agent
Innovation

Methods, ideas, or system contributions that make the work stand out.

context synthesis
human attention
digital twin signature
behavioral traces
attention filters