Intelligent Collaborative Optimization for Rubber Tyre Film Production Based on Multi-path Differentiated Clipping Proximal Policy Optimization

📅 2025-11-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of collaborative optimization in rubber tire thin-film production—stemming from strong inter-subsystem coupling, highly nonlinear interactions, and significant dynamic response delays—this paper proposes an intelligent collaborative optimization framework based on deep reinforcement learning. We innovatively design a multi-path differential clipping Proximal Policy Optimization (PPO) algorithm, integrating multi-branch policy networks with gradient-constrained optimization to simultaneously ensure policy convergence and substantially enhance dynamic adaptability in high-dimensional, complex systems. The method enables real-time, multi-objective coordinated control and stable regulation of critical process parameters, such as film width and thickness. Experimental and industrial deployment results demonstrate a 23.6% improvement in regulation accuracy and a 41.2% reduction in steady-state fluctuations compared to conventional approaches, while maintaining excellent real-time performance and robustness. This work establishes a practical, autonomous decision-making paradigm for intelligent tire manufacturing.

Technology Category

Application Category

📝 Abstract
The advent of smart manufacturing is addressing the limitations of traditional centralized scheduling and inflexible production line configurations in the rubber tyre industry, especially in terms of coping with dynamic production demands. Contemporary tyre manufacturing systems form complex networks of tightly coupled subsystems pronounced nonlinear interactions and emergent dynamics. This complexity renders the effective coordination of multiple subsystems, posing an essential yet formidable task. For high-dimensional, multi-objective optimization problems in this domain, we introduce a deep reinforcement learning algorithm: Multi-path Differentiated Clipping Proximal Policy Optimization (MPD-PPO). This algorithm employs a multi-branch policy architecture with differentiated gradient clipping constraints to ensure stable and efficient high-dimensional policy updates. Validated through experiments on width and thickness control in rubber tyre film production, MPD-PPO demonstrates substantial improvements in both tuning accuracy and operational efficiency. The framework successfully tackles key challenges, including high dimensionality, multi-objective trade-offs, and dynamic adaptation, thus delivering enhanced performance and production stability for real-time industrial deployment in tyre manufacturing.
Problem

Research questions and friction points this paper is trying to address.

Optimizing high-dimensional multi-objective production in tyre manufacturing
Addressing dynamic adaptation challenges in complex industrial subsystems
Improving coordination of nonlinear interactions in tyre film production
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-branch policy architecture for stable policy updates
Differentiated gradient clipping for high-dimensional optimization
Deep reinforcement learning for dynamic industrial production control
🔎 Similar Papers
No similar papers found.
Y
Yinghao Ruan
State Key Laboratory of Physical Oceanography and the Faculty of Information Science and Engineering, Ocean University of China, Qingdao, 266100
W
Wei Pang
S
Shuaihao Liu
H
Huili Yang
Master’s degree, Senior Engineer (Principal), Research areas: R&D and intelligent manufacturing of rubber equipment, system reliability engineering, innovative configurable development for complex machinery
L
Leyi Han
Master’s degree, Associate Senior Engineer, Research areas: electrical automation for rubber machinery, big data, and intelligent operation and maintenance
Xinghui Dong
Xinghui Dong
Ocean University of China
Computer VisionVisual PerceptionTexture AnalysisNon-destructive Testing