Quantized Online LQR

📅 2026-04-13

📈 Citations: 0

✨ Influential: 0

career value

207K/year

🤖 AI Summary

This work addresses the challenge of online linear quadratic regulator (LQR) control for unknown dynamical systems under communication constraints. Conventional approaches suffer from persistent noise due to per-step state quantization and high communication overhead. To overcome these limitations, the authors propose a novel paradigm wherein the local agent estimates system dynamics from observations and transmits only this dynamic estimate over a low-rate communication link to a remote controller. The remote controller then computes the optimal policy and sends it back for execution using the local agent’s exact state. The study establishes, for the first time, an Ω(log T) fundamental lower bound on communication complexity required to achieve sublinear regret and introduces the QCE-LQR algorithm that matches this bound. As the quantization resolution increases, QCE-LQR’s regret smoothly converges to that of the unquantized certainty-equivalent benchmark. Empirical evaluations on four standard systems, including a Boeing 747 model, demonstrate that QCE-LQR attains performance comparable to the unquantized controller within T = 10,000 steps.

Technology Category

Application Category

📝 Abstract

We study online linear-quadratic regulation (LQR) with unknown dynamics under communication rate constraints. Classical networked control quantizes the plant state at every time step, requiring $O(T)$ total bits while injecting persistent quantization noise that limits control performance. We consider a setting where the plant observes its state locally and can estimate system dynamics via ordinary least squares, while a remote controller possesses knowledge of the control cost. Rather than quantizing the raw state, the plant transmits learned dynamics estimates over a rate-limited uplink, and the controller returns the optimal control policy so that the plant can compute actions locally using its superior state knowledge. We first prove a fundamental information-theoretic lower bound: any scheme achieving $O(T^α)$ regret for $α\in [1/2,1)$ compared to the optimal infinite horizon LQR controller that knows the true system dynamics must transmit at least $Ω(\log T)$ bits. We then design the \textbf{Quantized Certainty Equivalent (QCE-LQR)} algorithm, which matches this bound. The resulting regret bound contains inflation factors $Q_{\mathrm{slow}}(\varrho)$ and $Q_{\mathrm{fast}}(\varrho)$ that vanish as the codebook resolution increases, smoothly recovering the unquantized baseline regret. Numerical experiments on four benchmark systems -- from a scalar unstable plant to a 24-parameter Boeing 747 lateral model -- confirm that a variant of QCE-LQR achieves regret comparable to an unquantized certainty equivalent controller over a horizon of $T=10{,}000$ steps.

Problem

Research questions and friction points this paper is trying to address.

quantized control

online LQR

communication constraints

unknown dynamics

regret minimization

Innovation

Methods, ideas, or system contributions that make the work stand out.

Quantized LQR

Online Control

Communication Constraints