KnowThyself: An Agentic Assistant for LLM Interpretability

📅 2025-11-05

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Existing LLM interpretability tools are fragmented and require substantial coding expertise, hindering natural-language interaction and visualization-supported explanations. This paper proposes a multi-agent conversational interpretability platform that integrates heterogeneous explanation tools seamlessly into a unified dialogue workflow via a modular agent architecture—comprising query reformulation, intelligent routing, and context integration mechanisms. An orchestrating LLM coordinates agent routers and domain-specific analytical modules, enabling model upload, natural-language question answering, and interactive, visualization-augmented explanation generation. The key contribution is “zero-code” deep model introspection: users can perform comprehensive, interpretable analyses without programming, significantly lowering technical barriers. This enhances the usability, extensibility, and accessibility of interpretability tools, establishing a novel paradigm for building transparent and trustworthy AI systems. (149 words)

Technology Category

Application Category

📝 Abstract

We develop KnowThyself, an agentic assistant that advances large language model (LLM) interpretability. Existing tools provide useful insights but remain fragmented and code-intensive. KnowThyself consolidates these capabilities into a chat-based interface, where users can upload models, pose natural language questions, and obtain interactive visualizations with guided explanations. At its core, an orchestrator LLM first reformulates user queries, an agent router further directs them to specialized modules, and the outputs are finally contextualized into coherent explanations. This design lowers technical barriers and provides an extensible platform for LLM inspection. By embedding the whole process into a conversational workflow, KnowThyself offers a robust foundation for accessible LLM interpretability.

Problem

Research questions and friction points this paper is trying to address.

Developing an agentic assistant for LLM interpretability

Consolidating fragmented tools into chat-based interface

Lowering technical barriers through conversational workflow

Innovation

Methods, ideas, or system contributions that make the work stand out.

Chat-based interface for model uploads and queries

Orchestrator LLM and agent router for query processing

Interactive visualizations with guided contextual explanations

🔎 Similar Papers

No similar papers found.

Authors to Follow