From Craft to Constitution: A Governance-First Paradigm for Principled Agent Engineering

📅 2025-10-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current large language model (LLM) agents face a “craftsmanship crisis” when transitioning from prototypes to production deployment: rigidly imposing deterministic software engineering paradigms results in fragile, unpredictable, and untrustworthy systems. To address the inherent probabilistic nature of LLMs, this paper introduces a *Governance-First* paradigm. Its core innovation is ArbiterOS—a formal runtime governance system integrating principle-driven control, probabilistic reasoning modeling, and verifiable coordination mechanisms. Moving beyond imperative development, ArbiterOS ensures agent behavior that is consistent, interpretable, and controllable. Experimental evaluation demonstrates substantial improvements in stability and trustworthiness on complex tasks. ArbiterOS thus provides the first verifiable and scalable governance infrastructure for engineering LLM agents in high-stakes operational environments.

Technology Category

Application Category

📝 Abstract
The advent of powerful Large Language Models (LLMs) has ushered in an ``Age of the Agent,'' enabling autonomous systems to tackle complex goals. However, the transition from prototype to production is hindered by a pervasive ``crisis of craft,'' resulting in agents that are brittle, unpredictable, and ultimately untrustworthy in mission-critical applications. This paper argues this crisis stems from a fundamental paradigm mismatch -- attempting to command inherently probabilistic processors with the deterministic mental models of traditional software engineering. To solve this crisis, we introduce a governance-first paradigm for principled agent engineering, embodied in a formal architecture we call ArbiterOS.
Problem

Research questions and friction points this paper is trying to address.

Addressing brittle unpredictable agents in mission-critical applications
Solving paradigm mismatch between probabilistic AI and deterministic engineering
Introducing governance-first architecture for trustworthy autonomous systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

Governance-first paradigm for principled agent engineering
Formal architecture called ArbiterOS implementation
Addresses probabilistic processors with deterministic models
🔎 Similar Papers
No similar papers found.
Q
Qiang Xu
CURE Lab., Dept. of CSE, The Chinese University of Hong Kong, Hong Kong S.A.R.
Xiangyu Wen
Xiangyu Wen
Computer Science and Engineering, CUHK
AI SecurityLLM Reasoning
C
Changran Xu
CURE Lab., Dept. of CSE, The Chinese University of Hong Kong, Hong Kong S.A.R.
Z
Zeju Li
CURE Lab., Dept. of CSE, The Chinese University of Hong Kong, Hong Kong S.A.R.
Jianyuan Zhong
Jianyuan Zhong
The Chinese University of Hong Kong
Machine Learning