DiLO: Disentangled Latent Optimization for Learning Shape and Deformation in Grouped Deforming 3D Objects

📅 2025-11-08

📈 Citations: 0

✨ Influential: 0

career value

202K/year

🤖 AI Summary

This work addresses unsupervised disentanglement of shape and deformation factors for groups of deformable 3D objects (e.g., humans, animals, faces). We propose a decoupled latent-variable optimization framework that jointly trains a generative network with two permutation-invariant PointNet encoders. Leveraging a two-stage training strategy and custom regularization, our method achieves shape–deformation separation without paired annotations. The core innovation lies in enforcing structural disentanglement directly in the latent space, enabling deformation transfer, fine-grained classification, and interpretable analysis. Extensive evaluation on multiple benchmark datasets demonstrates effectiveness: downstream task performance matches or surpasses that of complex supervised or strongly assumption-driven approaches. Our method significantly advances the practicality and generalizability of unsupervised 3D deformation modeling.

Technology Category

Application Category

📝 Abstract

In this work, we propose a disentangled latent optimization-based method for parameterizing grouped deforming 3D objects into shape and deformation factors in an unsupervised manner. Our approach involves the joint optimization of a generator network along with the shape and deformation factors, supported by specific regularization techniques. For efficient amortized inference of disentangled shape and deformation codes, we train two order-invariant PoinNet-based encoder networks in the second stage of our method. We demonstrate several significant downstream applications of our method, including unsupervised deformation transfer, deformation classification, and explainability analysis. Extensive experiments conducted on 3D human, animal, and facial expression datasets demonstrate that our simple approach is highly effective in these downstream tasks, comparable or superior to existing methods with much higher complexity.

Problem

Research questions and friction points this paper is trying to address.

Unsupervised parameterization of 3D objects into shape and deformation factors

Joint optimization of generator network with regularization techniques

Amortized inference of disentangled codes for deformation transfer and classification

Innovation

Methods, ideas, or system contributions that make the work stand out.

Disentangled latent optimization for shape and deformation

Joint optimization with regularization for unsupervised learning

PoinNet encoders for amortized inference of codes

🔎 Similar Papers

DualContrast: Unsupervised Disentangling of Content and Transformations with Implicit Parameterization

2024-05-27arXiv.orgCitations: 0

World Labs

$250,000-$350,000 base salary (good-faith estimate for San Francisco Bay Area upon hire; actual offer based on experience, skills, and qualifications)

San Francisco / San Francisco Office, San Francisco, California, United States

Research Scientist Intern, Multimodal Generative AI and Robotics (PhD)