2024
Authors
Viana, D; Teixeira, R; Baptista, J; Pinto, T;
Publication
International Conference on Electrical, Computer, and Energy Technologies, ICECET 2024
Abstract
This article presents a comprehensive state of the art analysis of the challenging domain of synthetic data generation. Focusing on the problem of synthetic data generation, the paper explores various difficulties that are identified, especially in real-world problems such as those is the scope of power and, energy systems, including the amount of data, data privacy concerns, temporal considerations, dynamic generation, delays, and failures. The investigation delves into the multifaceted nature of the challenges presented by these factors in the synthesis process. The review thoroughly examines different models used in synthetic data generation, covering Generative Adversarial Networks (GANs), Variational Autoencoder (VAE), Synthetic Minority Oversampling Technique (SMOTE), Data Synthesizer (DS) and E. Non-Parametric SynthPop (SP-NP). Each model is dissected with respect to its advantages, disadvantages, and applicability in different data generation scenarios. Special attention is paid to the nuanced aspects of dynamic data generation and the mitigation of challenges such as delays and failures. The insights drawn from this review contribute to a deeper understanding of the landscape around synthetic data generation, providing a valuable resource for researchers, practitioners, and stakeholders who aim to harness the potential of synthetic data in addressing real-world data challenges. The paper concludes by outlining possible avenues for future research and development in this ever-evolving field. © 2024 IEEE.
2025
Authors
Viana, D; Teixeira, R; Soares, T; Baptista, J; Pinto, T;
Publication
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Abstract
This study explores models for synthetic data generation of time series. In order to improve the achieved results, i.e., the data generated, new ways of improvement are explored and different models of synthetic data generation are compared. The model addressed in this work is the Generative Adversarial Networks (GANs), known for generating data similar to the original basis data through the training of a generator. The GANs are applied using the datasets of Quinta de Santa Bárbara and the Pinhão region, with the main variables being the Average temperature, Wind direction, Average wind speed, Maximum instantaneous wind speed and Solar radiation. The model allowed to generate missing data in a given period and, in turn, enables to analyze the results and compare them with those of a multiple linear regression method, being able to evaluate the effectiveness of the generated data. In this way, through the study and analysis of the GANs we can see if the model presents effectiveness and accuracy in the synthetic generation of meteorological data. With the proper conclusions of the results, this information can be used in order to improve the search for different models and the ability to generate synthetic time series data, which is representative of the real, original, data. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
2024
Authors
Pinto, J; Grasel, B; Baptista, J;
Publication
Electronics
Abstract
2024
Authors
Schneider, S; Drexel, R; Zelger, T; Baptista, J;
Publication
BauSim Conference Proceedings - Proceedings of BauSim 2024: 10th Conference of IBPSA-Germany and Austria
Abstract
2024
Authors
Sousa, A; Grasel, B; Baptista, J;
Publication
Applied Sciences
Abstract
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.