🤖 AI Summary
This study addresses the critical gap in generating and evolving high-fidelity, privacy-preserving synthetic test data for sensitive domains such as e-government, where effective methods that do not require access to real original data remain scarce. It presents the first systematic literature review on synthetic test data generation and evolution techniques that operate without reliance on real data, following the Kitchenham protocol. From an initial pool of 1,013 publications retrieved from IEEE Xplore, ACM Digital Library, and Scopus, 75 studies were rigorously selected, revealing 37 approaches that partially meet the requirements, with nine emerging as the most promising candidates. The analysis further uncovers a pervasive limitation across existing methods: insufficient support for the continuous evolution of synthetic data, thereby identifying and filling a significant research void in this field.
📝 Abstract
High‐level system testing of applications that use data from e‐Government services as input requires test data that is real‐life‐like but where the privacy of personal information is guaranteed. Applications with such strong requirement include information exchange between countries, medicine, banking, and so on. This review aims to synthesise the current state‐of‐the‐practice in this domain.