DiLO: Disentangled Latent Optimization for Learning Shape and Deformation in Grouped Deforming 3D Objects

📅 2025-11-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses unsupervised disentanglement of shape and deformation factors for groups of deformable 3D objects (e.g., humans, animals, faces). We propose a decoupled latent-variable optimization framework that jointly trains a generative network with two permutation-invariant PointNet encoders. Leveraging a two-stage training strategy and custom regularization, our method achieves shape–deformation separation without paired annotations. The core innovation lies in enforcing structural disentanglement directly in the latent space, enabling deformation transfer, fine-grained classification, and interpretable analysis. Extensive evaluation on multiple benchmark datasets demonstrates effectiveness: downstream task performance matches or surpasses that of complex supervised or strongly assumption-driven approaches. Our method significantly advances the practicality and generalizability of unsupervised 3D deformation modeling.

Technology Category

Application Category

📝 Abstract
In this work, we propose a disentangled latent optimization-based method for parameterizing grouped deforming 3D objects into shape and deformation factors in an unsupervised manner. Our approach involves the joint optimization of a generator network along with the shape and deformation factors, supported by specific regularization techniques. For efficient amortized inference of disentangled shape and deformation codes, we train two order-invariant PoinNet-based encoder networks in the second stage of our method. We demonstrate several significant downstream applications of our method, including unsupervised deformation transfer, deformation classification, and explainability analysis. Extensive experiments conducted on 3D human, animal, and facial expression datasets demonstrate that our simple approach is highly effective in these downstream tasks, comparable or superior to existing methods with much higher complexity.
Problem

Research questions and friction points this paper is trying to address.

Unsupervised parameterization of 3D objects into shape and deformation factors
Joint optimization of generator network with regularization techniques
Amortized inference of disentangled codes for deformation transfer and classification
Innovation

Methods, ideas, or system contributions that make the work stand out.

Disentangled latent optimization for shape and deformation
Joint optimization with regularization for unsupervised learning
PoinNet encoders for amortized inference of codes
Mostofa Rafid Uddin
Mostofa Rafid Uddin
CBD, School of Computer Science, Carnegie Mellon University
Computational BiologyMachine LearningComputer Vision
J
Jana Armouti
Carnegie Mellon University, Pittsburgh, PA 15213, USA
U
Umong Sain
Bangladesh University of Engineering and Technology, Dhaka 1000, Bangladesh
M
Md Asib Rahman
Bangladesh University of Engineering and Technology, Dhaka 1000, Bangladesh
X
Xingjian Li
Carnegie Mellon University, Pittsburgh, PA 15213, USA
M
Min Xu
Carnegie Mellon University, Pittsburgh, PA 15213, USA