XAM: Interactive Explainability for Authorship Attribution Models

📅 2025-12-07

📈 Citations: 0

✨ Influential: 0

career value

214K/year

🤖 AI Summary

To address the weak interpretability and reliance on predefined stylistic labels in embedded author attribution models, this paper proposes IXAM, an interactive explanation framework. IXAM enables multi-granularity exploration—lexical, sentential, and document-level—within the embedding space, allowing users to dynamically construct prediction explanations grounded in actual writing features, thereby overcoming limitations of static rules or fixed stylistic dimensions. By integrating explainable AI (XAI) techniques with interactive visualization, IXAM supports user-driven explanation generation and validation. A user study demonstrates that IXAM significantly improves explanation credibility, depth of understanding, and interaction satisfaction (p < 0.01). This work establishes a novel explanation paradigm for black-box text classification models, characterized by flexibility, transparency, and practical usability.

Technology Category

Application Category

📝 Abstract

We present IXAM, an Interactive eXplainability framework for Authorship Attribution Models. Given an authorship attribution (AA) task and an embedding-based AA model, our tool enables users to interactively explore the model's embedding space and construct an explanation of the model's prediction as a set of writing style features at different levels of granularity. Through a user evaluation, we demonstrate the value of our framework compared to predefined stylistic explanations.

Problem

Research questions and friction points this paper is trying to address.

Develops an interactive framework for explaining authorship attribution models

Enables exploration of embedding spaces and style feature explanations

Compares interactive explanations with predefined stylistic analysis methods

Innovation

Methods, ideas, or system contributions that make the work stand out.

Interactive exploration of embedding space for authorship attribution

Constructs explanations as multi-granularity writing style features

User evaluation shows value over predefined stylistic explanations

🔎 Similar Papers

Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges