Domain-Independent Automatic Generation of Descriptive Texts for Time-Series Data

📅 2024-09-25
🏛️ IEEE International Conference on Acoustics, Speech, and Signal Processing
📈 Citations: 3
Influential: 0
📄 PDF
🤖 AI Summary
Time-series data often lack descriptive textual annotations, hindering the training of text generation models. Method: This paper introduces a novel forward-and-backward dual-path pairing construction framework, pioneering a backward generation paradigm: starting from human-crafted rule-based textual descriptions, it synthesizes corresponding time-series data in reverse. Based on this paradigm, we construct TACO—the first cross-domain time-series–text paired dataset. We further propose a contrastive learning–based text generation framework that jointly models abstract time-series features and incorporates backward rule guidance. Results: Experiments demonstrate zero-shot textual generation on unseen domains, significantly improving cross-domain generalization and generated text quality. Moreover, the approach enhances interpretability of time-series understanding and boosts human–machine interaction efficiency.

Technology Category

Application Category

📝 Abstract
Due to scarcity of time-series data annotated with descriptive texts, training a model to generate descriptive texts for time-series data is challenging. In this study, we propose a method to systematically generate domain-independent descriptive texts from time-series data. We identify two distinct approaches for creating pairs of time-series data and descriptive texts: the forward approach and the backward approach. By implementing the novel backward approach, we create the Temporal Automated Captions for Observations (TACO) dataset. Experimental results demonstrate that a contrastive learning based model trained using the TACO dataset is capable of generating descriptive texts for time-series data in novel domains.
Problem

Research questions and friction points this paper is trying to address.

Generates descriptive texts for time-series data automatically
Overcomes scarcity of annotated time-series text data
Proposes domain-independent method using contrastive learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Domain-independent text generation from time-series data
Novel backward approach for dataset creation
Contrastive learning model trained on TACO dataset
🔎 Similar Papers
No similar papers found.
K
Kota Dohi
R&D Group, Hitachi Ltd.
A
Aoi Ito
R&D Group, Hitachi Ltd. and Hosei University
H
Harsh Purohit
R&D Group, Hitachi Ltd.
T
Tomoya Nishida
R&D Group, Hitachi Ltd.
T
Takashi Endo
R&D Group, Hitachi Ltd.
Yohei Kawaguchi
Yohei Kawaguchi
Hitachi, Ltd.
Acoustic Signal ProcessingSignal ProcessingMachine LearningSpeech ProcessingAI