LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence in Pathology

📅 2026-02-21

📈 Citations: 0

✨ Influential: 0

career value

210K/year

🤖 AI Summary

This work addresses the limitations of traditional pathology image analysis, which relies on coarse-grained visual-textual diagnosis and lacks molecular-level evidence, thereby hindering precise and interpretable AI-driven diagnostics. To overcome this, the authors propose a tool-centric, bottom-up large vision-language model agent framework that integrates domain-adaptive tools, a hierarchical planner, and atomic execution nodes (AENs) to enable molecular-informed pathological reasoning. The study introduces a novel AEN-based reasoning trajectory construction mechanism coupled with a trajectory-aware fine-tuning strategy, effectively mitigating task drift caused by long-context inputs. This approach substantially enhances tool invocation accuracy and reasoning robustness in complex pathological tasks, ultimately establishing a scalable and highly trustworthy intelligent diagnostic system.

Technology Category

Application Category

📝 Abstract

The emergence of tool-calling-based agent systems introduces a more evidence-driven paradigm for pathology image analysis in contrast to the coarse-grained text-image diagnostic approaches. With the recent large-scale experimental adoption of spatial transcriptomics technologies, molecularly validated pathological diagnosis is becoming increasingly open and accessible. In this work, we propose LAMMI-Pathology (LVLM-Agent System for Molecularly Informed Medical Intelligence in Pathology), a scalable agent framework for domain-specific agent tool-calling. LAMMI-Pathology adopts a tool-centric, bottom-up architecture in which customized domain-adaptive tools serve as the foundation. These tools are clustered by domain style to form component agents, which are then coordinated through a top-level planner hierarchically, avoiding excessively long context lengths that could induce task drift. Based on that, we introduce a novel trajectory construction mechanism based on Atomic Execution Nodes (AENs), which serve as reliable and composable units for building semi-simulated reasoning trajectories that capture credible agent-tool interactions. Building on this foundation, we develop a trajectory-aware fine-tuning strategy that aligns the planner's decision-making process with these multi-step reasoning trajectories, thereby enhancing inference robustness in pathology understanding and its adaptive use of the customized toolset.

Problem

Research questions and friction points this paper is trying to address.

molecularly informed pathology

tool-calling agent

evidence-driven diagnosis

spatial transcriptomics

pathology image analysis

Innovation

Methods, ideas, or system contributions that make the work stand out.

tool-centric agent

bottom-up architecture

Atomic Execution Nodes