BUILDA: A Thermal Building Data Generation Framework for Transfer Learning

📅 2025-08-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
High-quality, large-scale labeled data for transfer learning in building thermodynamics are severely scarce. Method: This paper introduces the first automated synthetic data generation framework specifically designed for transfer learning. It leverages a lightweight single-zone Modelica model exported as a Functional Mock-up Unit (FMU), enabling efficient simulation in Python. Through systematic parameter randomization—covering building typologies, climate zones, and operational conditions—it generates diverse, large-scale time-series thermal environment data, substantially reducing reliance on domain-specific building simulation expertise. Contribution/Results: The synthesized dataset is empirically validated to effectively support both pretraining and fine-tuning, significantly enhancing model generalization and cross-scenario transfer performance. This framework establishes a critical data infrastructure for scalable AI research in the built environment.

Technology Category

Application Category

📝 Abstract
Transfer learning (TL) can improve data-driven modeling of building thermal dynamics. Therefore, many new TL research areas emerge in the field, such as selecting the right source model for TL. However, these research directions require massive amounts of thermal building data which is lacking presently. Neither public datasets nor existing data generators meet the needs of TL research in terms of data quality and quantity. Moreover, existing data generation approaches typically require expert knowledge in building simulation. We present BuilDa, a thermal building data generation framework for producing synthetic data of adequate quality and quantity for TL research. The framework does not require profound building simulation knowledge to generate large volumes of data. BuilDa uses a single-zone Modelica model that is exported as a Functional Mock-up Unit (FMU) and simulated in Python. We demonstrate BuilDa by generating data and utilizing it for pretraining and fine-tuning TL models.
Problem

Research questions and friction points this paper is trying to address.

Lack of thermal building data for transfer learning research
Existing data generators fail in quality and quantity for TL
Need for expert knowledge in current simulation methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generates synthetic thermal building data for transfer learning
Uses Modelica model exported as FMU for simulation
Requires minimal building simulation expertise
🔎 Similar Papers
No similar papers found.
T
Thomas Krug
Technical University of Applied Sciences Rosenheim, Rosenheim, Germany
F
Fabian Raisch
Technical University of Munich, Munich, Germany
D
Dominik Aimer
Technical University of Applied Sciences Rosenheim, Rosenheim, Germany
M
Markus Wirnsberger
Technical University of Applied Sciences Rosenheim, Rosenheim, Germany
F
Ferdinand Sigg
Technical University of Applied Sciences Rosenheim, Rosenheim, Germany
Benjamin Schäfer
Benjamin Schäfer
Karlsruhe Institute of Technology
Data-driven ModellingMachine LearningEnergy SystemsExplainable AIStochastic Systems
B
Benjamin Tischler
Technical University of Applied Sciences Rosenheim, Rosenheim, Germany