AI-based feedback-driven framework for quality assurance of synthetic data generated for use in mobile advertising software testing

Erol, Ekrem; Senan, SİBEL

doi:10.1007/s11219-026-09761-8

AI-based feedback-driven framework for quality assurance of synthetic data generated for use in mobile advertising software testing

Erol E., Senan S.

SOFTWARE QUALITY JOURNAL, cilt.34, sa.2, 2026 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 34 Sayı: 2
Basım Tarihi: 2026
Doi Numarası: 10.1007/s11219-026-09761-8
Dergi Adı: SOFTWARE QUALITY JOURNAL
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, ABI/INFORM, Compendex, INSPEC
İstanbul Üniversitesi-Cerrahpaşa Adresli: Evet

Özet

Software testing processes heavily rely on the quality and realism of test data. Traditional approaches often face limitations due to static or insufficient data, failing to capture complex real-world user behaviors, especially in dynamic domains like mobile advertising. To address this challenge, this paper presents a novel AI-based framework designed to autonomously generate and assure the quality of synthetic data intended for use in testing processes. The system integrates a GAN-inspired feedback-driven generative framework based on MLP regression models for generating realistic tabular data mimicking mobile advertising logs. The self testing mechanism of the proposed AI-based framework evaluates the functional realism of synthetic data using Random Forest and XGBoost machine learning models to compare predictive performance. This performance is quantitatively measured by the Mean Squared Error (MSE) metric. If the mechanism detects inadequate data quality (high MSE), an automated feedback loop activates GAN retraining, enabling continuous improvement. Experimental results demonstrate the effectiveness of the proposed system in generating high-quality synthetic data that closely aligns with real-world data characteristics. This study contributes a fully automated, feedback-driven synthetic data generation and quality assurance system, enhancing the potential for more robust and realistic software testing by providing validated test inputs applicable to domains with dynamic user behavior.