🤖 AI Summary
Existing control algorithm research predominantly evaluates performance under default configurations, neglecting the critical yet underexplored dimension of “tunability potential”—i.e., the extent to which expert-driven parameter tuning can improve real-world deployment performance.
Method: This paper formally defines tunability potential and introduces an LLM-driven, multi-level expert simulation framework. Leveraging LLM agents to emulate domain experts’ parameter adjustment strategies, the framework quantifies the practical tunability space of control algorithms across diverse deployment scenarios. It integrates control theory, automated hyperparameter optimization, and empirical closed-loop validation, enabling cross-task and cross-system transferable assessment.
Contribution/Results: Experimental evaluation demonstrates that the framework reliably identifies algorithm-specific tunability ceilings and guides effective parameter optimization, yielding significant improvements in closed-loop control performance—thereby bridging the gap between theoretical design and operational efficacy.
📝 Abstract
Control algorithms in production environments typically require domain experts to tune their parameters and logic for specific scenarios. However, existing research predominantly focuses on algorithmic performance under ideal or default configurations, overlooking the critical aspect of Tuning Potential. To bridge this gap, we introduce Crucible, an agent that employs an LLM-driven, multi-level expert simulation to turn algorithms and defines a formalized metric to quantitatively evaluate their Tuning Potential. We demonstrate Crucible's effectiveness across a wide spectrum of case studies, from classic control tasks to complex computer systems, and validate its findings in a real-world deployment. Our experimental results reveal that Crucible systematically quantifies the tunable space across different algorithms. Furthermore, Crucible provides a new dimension for algorithm analysis and design, which ultimately leads to performance improvements. Our code is available at https://github.com/thu-media/Crucible.