A Virtual Laboratory for Managing Computational Experiments

πŸ“… 2025-04-01
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Computational experiments face declining reproducibility and metadata management challenges as scale increases. To address this, we propose a metadata-driven, full-lifecycle management approach for computational experiments and implement it in SCHEMA Labβ€”a virtual laboratory. Our method introduces an ontology-based metadata model explicitly designed for experimental lifecycles, enabling structured capture of configurations, execution logs, and performance metrics. We further incorporate an experiment lineage graph and semantic grouping mechanisms to support cross-instance traceability and multi-experiment relational analysis. Architecturally, SCHEMA Lab adopts a web-based microservices design, RESTful APIs, and visual workflow orchestration. Empirical evaluation demonstrates a 92% experiment reproduction success rate and over 60% reduction in configuration and audit time. Deployed across multiple HPC and AI research teams, the system significantly enhances scientific reproducibility and collaborative efficiency.

Technology Category

Application Category

πŸ“ Abstract
Computational experiments have become essential for scientific discovery, allowing researchers to test hypotheses, analyze complex datasets, and validate findings. However, as computational experiments grow in scale and complexity, ensuring reproducibility and managing detailed metadata becomes increasingly challenging, especially when orchestrating complex sequence of computational tasks. To address these challenges we have developed a virtual laboratory called SCHEMA lab, focusing on capturing rich metadata such as experiment configurations and performance metrics, to support computational reproducibility. SCHEMA lab enables researchers to create experiments by grouping together multiple executions and manage them throughout their life cycle. In this demonstration paper, we present the SCHEMA lab architecture, core functionalities, and implementation, emphasizing its potential to significantly enhance reproducibility and efficiency in computational research.
Problem

Research questions and friction points this paper is trying to address.

Managing reproducibility in large-scale computational experiments
Capturing detailed metadata for experiment configurations
Orchestrating complex sequences of computational tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Virtual laboratory for managing computational experiments
Captures rich metadata for reproducibility
Groups multiple executions for lifecycle management
E
Eleni Adamidi
Information Management Systems Institute, Athena Research Center, Athens, Greece
P
Panayiotis Deligiannis
Information Management Systems Institute, Athena Research Center, Athens, Greece
N
Nikos Foutris
Information Management Systems Institute, Athena Research Center, Athens, Greece
Thanasis Vergoulis
Thanasis Vergoulis
IMSI, "Athena" RC, Greece
Scientific databasesScientometricsMachine LearningData management