ForgeDreamer: Industrial Text-to-3D Generation with Multi-Expert LoRA and Cross-View Hypergraph

📅 2026-03-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing text-to-3D generation methods are limited in industrial settings by cross-category semantic interference and inadequate modeling of high-order geometric structures. To address these challenges, this work proposes a multi-expert LoRA ensemble mechanism to disentangle semantic knowledge and suppress interference, alongside a cross-view hypergraph modeling framework to capture high-order geometric dependencies among multiple viewpoints. Through joint semantic-geometric optimization, the proposed approach significantly outperforms current state-of-the-art methods on a newly curated industrial dataset, achieving manufacturing-grade geometric consistency while preserving strong semantic generalization capabilities.

Technology Category

Application Category

📝 Abstract
Current text-to-3D generation methods excel in natural scenes but struggle with industrial applications due to two critical limitations: domain adaptation challenges where conventional LoRA fusion causes knowledge interference across categories, and geometric reasoning deficiencies where pairwise consistency constraints fail to capture higher-order structural dependencies essential for precision manufacturing. We propose a novel framework named ForgeDreamer addressing both challenges through two key innovations. First, we introduce a Multi-Expert LoRA Ensemble mechanism that consolidates multiple category-specific LoRA models into a unified representation, achieving superior cross-category generalization while eliminating knowledge interference. Second, building on enhanced semantic understanding, we develop a Cross-View Hypergraph Geometric Enhancement approach that captures structural dependencies spanning multiple viewpoints simultaneously. These components work synergistically improved semantic understanding, enables more effective geometric reasoning, while hypergraph modeling ensures manufacturing-level consistency. Extensive experiments on a custom industrial dataset demonstrate superior semantic generalization and enhanced geometric fidelity compared to state-of-the-art approaches. Our code and data are provided in the supplementary material attached in the appendix for review purposes.
Problem

Research questions and friction points this paper is trying to address.

text-to-3D generation
domain adaptation
geometric reasoning
knowledge interference
structural dependencies
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-Expert LoRA
Cross-View Hypergraph
Text-to-3D Generation
Geometric Reasoning
Industrial 3D Modeling
🔎 Similar Papers
No similar papers found.
Junhao Cai
Junhao Cai
Shanghai AI Lab, HKUST
RoboticsComputer Vision
Deyu Zeng
Deyu Zeng
Shenzhen University
machine learning
J
Junhao Pang
Shenzhen University, Shenzhen, Guangdong 518060, China
L
Lini Li
Shenzhen University, Shenzhen, Guangdong 518060, China
Zongze Wu
Zongze Wu
Shenzhen University
Machine LearningDeep LearningIndustrial ManufacturingInternet of Things
X
Xiaopin Zhong
Shenzhen University, Shenzhen, Guangdong 518060, China