MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis

📅 2026-03-24
📈 Citations: 0
Influential: 0
📄 PDF
📝 Abstract
Existing LLM-based Kubernetes diagnostic systems cannot learn from operational experience, operating on static knowledge bases without improving from past resolutions. We present MetaKube, an experience-aware LLM framework through three synergistic innovations: (1) an Episodic Pattern Memory Network (EPMN) that abstracts diagnostic patterns from historical resolutions and provides confidence-calibrated retrieval for both rapid pattern matching and guided causal exploration, (2) a meta-cognitive controller that dynamically routes between intuitive and analytical pathways based on problem familiarity, optimizing the trade-off between speed and depth, and (3) KubeLLM, a locally-deployable 8B model enhanced through domain-specific post-training on our 7,000-sample Kubernetes Fault Resolution Dataset. Evaluation on 1,873 real-world scenarios demonstrates MetaKube transforms Qwen3-8B from 50.9 to 90.5 points, approaching GPT-4.1 performance while ensuring complete data privacy. EPMN contributes 15.3% improvement through experiential learning, with continuous learning experiments showing progressive gains as the system accumulates operational knowledge. The source code and related resources are available at https://github.com/MetaKube-LLM-for-Kubernetes-Diagnosis/MetaKube.
Problem

Research questions and friction points this paper is trying to address.

Kubernetes failure diagnosis
experience-aware learning
operational experience
static knowledge base
continuous learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

experience-aware LLM
Episodic Pattern Memory Network
meta-cognitive controller
Kubernetes failure diagnosis
continual learning
🔎 Similar Papers
No similar papers found.
Wei Sun
Wei Sun
Shenzhen International Quantum Academy
Quantum physicsIntegrated photonicsAMO
T
Ting Wang
School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen; Shenzhen Future Network of Intelligence Institute, Shenzhen, China
X
Xinran Tian
Shenzhen Future Network of Intelligence Institute, Shenzhen, China
W
Wanshun Lan
China Mobile Communications Group Guangdong Co., Ltd., Guangzhou, China
X
Xuhan Feng
China Mobile Communications Group Guangdong Co., Ltd., Guangzhou, China
H
Haoyue Li
China Mobile Communications Group Guangdong Co., Ltd., Guangzhou, China
Fangxin Wang
Fangxin Wang
The Chinese University of Hong Kong, Shenzhen
Multimedia StreamingCloud & Edge ComputingDeep Learning