DanmuA11y: Making Time-Synced On-Screen Video Comments (Danmu) Accessible to Blind and Low Vision Users via Multi-Viewer Audio Discussions

📅 2025-01-27

📈 Citations: 0

✨ Influential: 0

career value

220K/year

🤖 AI Summary

To address the accessibility challenges faced by blind and low-vision (BLV) users in comprehending, locating, and engaging with video danmaku (real-time overlaid comments), this paper introduces the first multi-voice audio discussion paradigm for danmaku vocalization. Our approach comprises three core technical components: context-enhanced semantic parsing, non-intrusive audio–video fusion, and socially informed multi-track audio organization. The system integrates text-to-speech synthesis, time-synchronized summarization, spatial audio rendering, and preference-driven scheduling to enable real-time, spatialized, and personalized danmaku vocalization. Evaluated with 12 BLV participants, the system improved danmaku comprehension accuracy by 68%, achieved a viewing fluency rating of 4.6/5, and enabled 92% of users to perceive significantly enhanced co-presence and community belongingness—thereby, for the first time, faithfully reproducing the social interaction essence of danmaku at the auditory level.

Technology Category

Application Category

📝 Abstract

By overlaying time-synced user comments on videos, Danmu creates a co-watching experience for online viewers. However, its visual-centric design poses significant challenges for blind and low vision (BLV) viewers. Our formative study identified three primary challenges that hinder BLV viewers' engagement with Danmu: the lack of visual context, the speech interference between comments and videos, and the disorganization of comments. To address these challenges, we present DanmuA11y, a system that makes Danmu accessible by transforming it into multi-viewer audio discussions. DanmuA11y incorporates three core features: (1) Augmenting Danmu with visual context, (2) Seamlessly integrating Danmu into videos, and (3) Presenting Danmu via multi-viewer discussions. Evaluation with twelve BLV viewers demonstrated that DanmuA11y significantly improved Danmu comprehension, provided smooth viewing experiences, and fostered social connections among viewers. We further highlight implications for enhancing commentary accessibility in video-based social media and live-streaming platforms.

Problem

Research questions and friction points this paper is trying to address.

Accessibility

Synchronized_Barrage

Visual_Impairment

Innovation

Methods, ideas, or system contributions that make the work stand out.

DanmuA11y System

Multi-voice Discussion

Accessibility Enhancement

🔎 Similar Papers

Motion Design Principles for Accessible Video-based Learning: Addressing Cognitive Challenges for Deaf and Hard of Hearing Learners

2024-09-30arXiv.orgCitations: 1

Apple

Cupertino, United States of America

Natural Language Processing Researcher

Kitware

Arlington, Virginia

Authors to Follow