đ¤ AI Summary
In 3D human pose analysis, existing multi-task approachesâsuch as pose estimation, completion, and denoisingâare typically modeled independently, leading to poor generalization across tasks and scenarios. To address this, we propose PADS, a unified framework based on diffusion models. PADS formulates diverse pose analysis tasks as regularized inverse problems incorporating kinematic constraints, and employs a task-agnostic pose prior to guide conditional denoisingâenabling multi-task inference within a single model without fine-tuning. This work is the first to systematically integrate diffusion models into general-purpose 3D pose analysis, establishing an end-to-end âprior learningâinverse problem solvingâ paradigm. Extensive experiments on multiple benchmarks demonstrate that PADS significantly improves accuracy and robustness under challenging conditionsâincluding occlusion, sensor noise, and missing jointsâvalidating its strong generalization capability and practical applicability.
đ Abstract
Diffusion models have demonstrated remarkable success in generative modeling. In this paper, we propose PADS (Pose Analysis by Diffusion Synthesis), a novel framework designed to address various challenges in 3D human pose analysis through a unified pipeline. Central to PADS are two distinctive strategies: i) learning a task-agnostic pose prior using a diffusion synthesis process to effectively capture the kinematic constraints in human pose data, and ii) unifying multiple pose analysis tasks like estimation, completion, denoising, etc, as instances of inverse problems. The learned pose prior will be treated as a regularization imposing on task-specific constraints, guiding the optimization process through a series of conditional denoising steps. PADS represents the first diffusion-based framework for tackling general 3D human pose analysis within the inverse problem framework. Its performance has been validated on different benchmarks, signaling the adaptability and robustness of this pipeline.