Multilingual Extraction and Recognition of Implicit Discourse Relations in Speech and Text

📅 2026-02-04

📈 Citations: 0

✨ Influential: 0

career value

188K/year

🤖 AI Summary

This study addresses the challenge of implicit discourse relation classification, which requires inferring semantic connections from context—a task hindered by the limitations of text-only approaches in capturing cross-lingual and cross-modal cues. To this end, the authors present the first multilingual multimodal dataset for implicit discourse relations, covering English, French, and Spanish, and propose a multimodal method that integrates textual and acoustic features. Leveraging the Qwen2-Audio model for joint audio-text modeling, the approach enables effective cross-lingual transfer. Experimental results demonstrate that the proposed multimodal fusion significantly outperforms unimodal baselines relying solely on text or audio, with particularly pronounced gains observed for low-resource languages.

Technology Category

Application Category

📝 Abstract

Implicit discourse relation classification is a challenging task, as it requires inferring meaning from context. While contextual cues can be distributed across modalities and vary across languages, they are not always captured by text alone. To address this, we introduce an automatic method for distantly related and unrelated language pairs to construct a multilingual and multimodal dataset for implicit discourse relations in English, French, and Spanish. For classification, we propose a multimodal approach that integrates textual and acoustic information through Qwen2-Audio, allowing joint modeling of text and audio for implicit discourse relation classification across languages. We find that while text-based models outperform audio-based models, integrating both modalities can enhance performance, and cross-lingual transfer can provide substantial improvements for low-resource languages.

Problem

Research questions and friction points this paper is trying to address.

implicit discourse relations

multilingual

multimodal

cross-lingual transfer

low-resource languages

Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal learning

implicit discourse relations

cross-lingual transfer