Bespoke OLAP: Synthesizing Workload-Specific One-size-fits-one Database Engines

πŸ“… 2026-03-02
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work proposes a fully automated, end-to-end approach to synthesizing specialized OLAP engines tailored to specific analytical workloads. Unlike general-purpose OLAP systems, which incur structural overhead that impedes peak performance, the proposed method leverages large language model–driven code generation, iterative performance feedback, automated correctness verification, and co-optimization of storage and query execution architectures to generate complete, custom engines without human intervention. For the first time, this framework enables the automatic synthesis of high-performance OLAP engines within minutes to hours, achieving order-of-magnitude speedups over state-of-the-art general-purpose systems such as DuckDB. By eliminating the traditional trade-off between engineering effort and performance inherent in manual specialization, the approach breaks a key bottleneck in analytical data system design.

Technology Category

Application Category

πŸ“ Abstract
Modern OLAP engines are designed to support arbitrary analytical workloads, but this generality incurs structural overhead, including runtime schema interpretation, indirection layers, and abstraction boundaries, even in highly optimized systems. An engine specialized to a fixed workload can eliminate these costs and exploit workload-specific data structures and execution algorithms for substantially higher performance. Historically, constructing such bespoke engines has been economically impractical due to the high manual engineering effort. Recent advances in LLM-based code synthesis challenge this tradeoff by enabling automated system generation. However, naively prompting an LLM to produce a database engine does not yield a correct or efficient design, as effective synthesis requires systematic performance feedback, structured refinement, and careful management of deep architectural interdependencies. We present Bespoke OLAP, a fully autonomous synthesis pipeline for constructing high-performance database engines tightly tailored to a given workload. Our approach integrates iterative performance evaluation and automated validation to guide synthesis from storage to query execution. We demonstrate that Bespoke OLAP can generate a workload-specific engine from scratch within minutes to hours, achieving order-of-magnitude speedups over modern general-purpose systems such as DuckDB.
Problem

Research questions and friction points this paper is trying to address.

OLAP
bespoke database
workload-specific
code synthesis
performance overhead
Innovation

Methods, ideas, or system contributions that make the work stand out.

Bespoke OLAP
LLM-based code synthesis
workload-specific database
autonomous system generation
OLAP engine optimization
πŸ”Ž Similar Papers
No similar papers found.