🤖 AI Summary
To address the challenge of interpreting sensor time-series discrepancies in IoT applications—particularly for non-expert users—this paper introduces the first structured explanation framework specifically designed for temporal differences, automatically generating parseable JSON-formatted explanations. Methodologically, it employs a lightweight time-series encoder to extract discriminative features, while leveraging a frozen large language model (LLM) as a decoder to produce semantically coherent, structured outputs—thereby avoiding the computational and data overhead of LLM fine-tuning. Trained and optimized on the TORI dataset, our approach outperforms visual question-answering baselines and pretrained-encoder-based retrieval methods across three key metrics: explanation accuracy, structural compliance, and user interpretability. Key contributions include: (1) establishing and implementing a standardized, structured explanation paradigm for time-series discrepancies; and (2) empirically validating the efficacy of the frozen-LLM-plus-specialized-temporal-encoder architecture for low-resource explanatory tasks.
📝 Abstract
In many IoT applications, the central interest lies not in individual sensor signals but in their differences, yet interpreting such differences requires expert knowledge. We propose DiffNator, a framework for structured explanations of differences between two time series. We first design a JSON schema that captures the essential properties of such differences. Using the Time-series Observations of Real-world IoT (TORI) dataset, we generate paired sequences and train a model that combine a time-series encoder with a frozen LLM to output JSON-formatted explanations. Experimental results show that DiffNator generates accurate difference explanations and substantially outperforms both a visual question answering (VQA) baseline and a retrieval method using a pre-trained time-series encoder.