OmniFuser: Adaptive Multimodal Fusion for Service-Oriented Predictive Maintenance

📅 2025-11-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address untimely tool condition prediction—leading to quality degradation and unplanned downtime—in intelligent manufacturing, this paper proposes a vision-force multimodal fusion framework tailored for milling processes. Methodologically, we design a contamination-free cross-modal disentangled fusion mechanism that separates shared representations from modality-specific features; further, we introduce a recursive refinement pathway and a residual information anchoring strategy to enhance fusion robustness, and employ a parallel feature extraction network for efficient dynamic integration of heterogeneous signals. Evaluated on a real-world milling dataset, our approach significantly outperforms state-of-the-art methods, achieving high-accuracy three-class classification (sharp/used/blunt) of tool states and accurate multi-step cutting force prediction. The framework delivers a reliable, reusable technical foundation for industrial intelligent maintenance systems.

Technology Category

Application Category

📝 Abstract
Accurate and timely prediction of tool conditions is critical for intelligent manufacturing systems, where unplanned tool failures can lead to quality degradation and production downtime. In modern industrial environments, predictive maintenance is increasingly implemented as an intelligent service that integrates sensing, analysis, and decision support across production processes. To meet the demand for reliable and service-oriented operation, we present OmniFuser, a multimodal learning framework for predictive maintenance of milling tools that leverages both visual and sensor data. It performs parallel feature extraction from high-resolution tool images and cutting-force signals, capturing complementary spatiotemporal patterns across modalities. To effectively integrate heterogeneous features, OmniFuser employs a contamination-free cross-modal fusion mechanism that disentangles shared and modality-specific components, allowing for efficient cross-modal interaction. Furthermore, a recursive refinement pathway functions as an anchor mechanism, consistently retaining residual information to stabilize fusion dynamics. The learned representations can be encapsulated as reusable maintenance service modules, supporting both tool-state classification (e.g., Sharp, Used, Dulled) and multi-step force signal forecasting. Experiments on real-world milling datasets demonstrate that OmniFuser consistently outperforms state-of-the-art baselines, providing a dependable foundation for building intelligent industrial maintenance services.
Problem

Research questions and friction points this paper is trying to address.

Predicts milling tool conditions using multimodal visual and sensor data
Integrates heterogeneous features through contamination-free cross-modal fusion
Enables reliable tool-state classification and force signal forecasting
Innovation

Methods, ideas, or system contributions that make the work stand out.

Fuses visual and sensor data for predictive maintenance
Employs contamination-free cross-modal fusion mechanism
Uses recursive refinement to stabilize fusion dynamics
Z
Ziqi Wang
School of Software Technology, Zhejiang University
Hailiang Zhao
Hailiang Zhao
ZJU 100 Young Professor, Zhejiang University
Service ComputingEdge ComputingLearning-Augmented Algorithms
Yuhao Yang
Yuhao Yang
University of Hong Kong
Large Language ModelsAgentic ModelsFoundation ModelsGraph Learning
D
Daojiang Hu
School of Software Technology, Zhejiang University
C
Cheng Bao
School of Computer Science and Technology, East China Normal University, Shanghai, China
M
Mingyi Liu
Faculty of Computing, Harbin Institute of Technology
K
Kai Di
Hangzhou School of Automation, Zhejiang Normal University
Schahram Dustdar
Schahram Dustdar
Professor of Computer Science, Member of Academia Europaea, IEEE|EAI|AAIA Fellow, TU Wien, Austria
Distributed SystemsInternet of ThingsEdge ComputingEdge IntelligenceEdge AI
Z
Zhongjie Wang
Faculty of Computing, Harbin Institute of Technology
S
Shuiguang Deng
College of Computer Science and Technology, Zhejiang University