IMPACT: A Dataset for Multi-Granularity Human Procedural Action Understanding in Industrial Assembly

📅 2026-04-11
📈 Citations: 0
Influential: 0
📄 PDF

career value

193K/year
🤖 AI Summary
This work addresses the absence of benchmark datasets supporting fine-grained, procedural action understanding under realistic deployment conditions in industrial assembly scenarios. The authors introduce a synchronized five-view RGB-D dataset centered on authentic assembly and disassembly tasks using commercial angle grinders. For the first time within a single industrial workflow, the dataset jointly provides synchronized egocentric-exocentric recordings, decoupled hand annotations, compliance state tracking, and explicit supervision of anomaly-recovery behaviors. It comprises 112 trials (39.5 hours) from 13 participants, featuring hierarchical action–state–compliance annotations, partial-order workflow modeling, NASA-TLX cognitive load assessments, and cross-view empty-interval alignment. Baseline experiments reveal fundamental limitations of current methods in handling incomplete observations, flexible execution paths, and corrective actions, establishing a new benchmark and set of challenges for future research.

Technology Category

Application Category

📝 Abstract
We introduce IMPACT, a synchronized five-view RGB-D dataset for deployment-oriented industrial procedural understanding, built around real assembly and disassembly of a commercial angle grinder with professional-grade tools. To our knowledge, IMPACT is the first real industrial assembly benchmark that jointly provides synchronized ego-exo RGB-D capture, decoupled bimanual annotation, compliance-aware state tracking, and explicit anomaly--recovery supervision within a single real industrial workflow. It comprises 112 trials from 13 participants totaling 39.5 hours, with multi-route execution governed by a partial-order prerequisite graph, a six-category anomaly taxonomy, and operator cognitive load measured via NASA-TLX. The annotation hierarchy links hand-specific atomic actions to coarse procedural steps, component assembly states, and per-hand compliance phases, with synchronized null spans across views to decouple perceptual limitations from algorithmic failure. Systematic baselines reveal fundamental limitations that remain invisible to single-task benchmarks, particularly under realistic deployment conditions that involve incomplete observations, flexible execution paths, and corrective behavior. The full dataset, annotations, and evaluation code are available at https://github.com/Kratos-Wen/IMPACT.
Problem

Research questions and friction points this paper is trying to address.

procedural action understanding
industrial assembly
multi-granularity
anomaly recovery
RGB-D dataset
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-granularity procedural understanding
synchronized ego-exo RGB-D
decoupled bimanual annotation
compliance-aware state tracking
anomaly-recovery supervision
🔎 Similar Papers
No similar papers found.
Di Wen
Di Wen
Karlsruhe Institute of Technology
Fine-grained Action UnderstandingAnomaly DetectionRobustnessUncertainty
Z
Zeyun Zhong
Karlsruhe Institute of Technology
David Schneider
David Schneider
PhD Student, Karlsruhe Institute of Technology
M
Manuel Zaremski
Karlsruhe Institute of Technology
L
Linus Kunzmann
Karlsruhe Institute of Technology
Yitian Shi
Yitian Shi
PhD student at KIT
GraspingRobotic graspingRobotic Manipulation
R
Ruiping Liu
Karlsruhe Institute of Technology
Yufan Chen
Yufan Chen
Karlsruhe Institute of Technology
Document AnalysisComputer VisionRobust Deep Learning
Junwei Zheng
Junwei Zheng
CV:HCI, KIT; CVG, ETH Zurich
Visual LocalizationScene UnderstandingAssistive Technology
J
Jiahang Li
Karlsruhe Institute of Technology
J
Jonas Hemmerich
Karlsruhe Institute of Technology
Q
Qiyi Tong
Italian Institute of Technology
P
Patric Grauberger
Karlsruhe Institute of Technology
Arash Ajoudani
Arash Ajoudani
Tenured Senior Scientist, Istituto Italiano di Tecnologia
Collaborative RoboticsPhysical Human-Robot interactionHuman-Robot CollaborationAssistive RoboticsTelerobotics
Danda Pani Paudel
Danda Pani Paudel
INSAIT Sofia University
Computer VisionRoboticsEarth Observation
Sven Matthiesen
Sven Matthiesen
Professor, IPEK-Institute of Product Engineering Karlsruhe, Karlsruhe Institute of Technology (KIT)
engineering designusabilityDesign Educationpower-toolhuman-machine-symbiosis
B
Barbara Deml
Karlsruhe Institute of Technology
J
Jürgen Beyerer
Karlsruhe Institute of Technology
Luc Van Gool
Luc Van Gool
professor computer vision INSAIT Sofia University, em. KU Leuven, em. ETHZ, Toyota Lab TRACE
computer visionmachine learningAIautonomous carscultural heritage
Rainer Stiefelhagen
Rainer Stiefelhagen
Karlsruhe Institute of Technology, Karlsruhe, Germany
Computer visionMultimodal interactionAccessibility
Kunyu Peng
Kunyu Peng
Karlsruhe Institute of Technology
video understandingopen set recognitiongeneralizable deep learning