Detalhes
Nome
Maria Eduarda SilvaCargo
Investigador CoordenadorDesde
01 janeiro 2022
Nacionalidade
PortugalCentro
Laboratório de Inteligência Artificial e Apoio à DecisãoContactos
+351220402963
maria.e.silva@inesctec.pt
2026
Autores
Vale, Jaime; Silva, Vanessa Freitas; Silva, Maria Eduarda; Silva, Fernando;
Publicação
Abstract
Time series data are essential for a wide range of applications, particularly in developing robust machine learning models. However, access to high-quality datasets is often limited due to privacy concerns, acquisition costs, and labeling challenges. Synthetic time series generation has emerged as a promising solution to address these constraints. In this work, we present a framework for generating synthetic time series by leveraging complex networks mappings. Specifically, we investigate whether time series transformed into Quantile Graphs (QG) -- and then reconstructed via inverse mapping -- can produce synthetic data that preserve the statistical and structural properties of the original. We evaluate the fidelity and utility of the generated data using both simulated and real-world datasets, and compare our approach against state-of-the-art Generative Adversarial Network (GAN) methods. Results indicate that our quantile graph-based methodology offers a competitive and interpretable alternative for synthetic time series generation.
2025
Autores
Silva, I; Silva, ME; Pereira, I;
Publicação
Springer Proceedings in Mathematics and Statistics
Abstract
The presence of missing data poses a common challenge for time series analysis in general since the most usual requirement is that the data is equally spaced in time and therefore imputation methods are required. For time series of counts, the usual imputation methods which usually produce real valued observations, are not adequate. This work employs Bayesian principles for handling missing data within time series of counts, based on first-order integer-valued autoregressive (INAR) models, namely Approximate Bayesian Computation (ABC) and Gibbs sampler with Data Augmentation (GDA) algorithms. The methodologies are illustrated with synthetic and real data and the results indicate that the estimates are consistent and present less bias when the percentage of missing observations decreases, as expected. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
2025
Autores
Silva, VF; Silva, ME; Ribeiro, P; Silva, F;
Publicação
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS
Abstract
In recent years, there has been a surge in the prevalence of high- and multidimensional temporal data across various scientific disciplines. These datasets are characterized by their vast size and challenging potential for analysis. Such data typically exhibit serial and cross-dependency and possess high dimensionality, thereby introducing additional complexities to conventional time series analysis methods. To address these challenges, a recent and complementary approach has emerged, known as network-based analysis methods for multivariate time series. In univariate settings, quantile graphs have been employed to capture temporal transition properties and reduce data dimensionality by mapping observations to a smaller set of sample quantiles. To confront the increasingly prominent issue of high dimensionality, we propose an extension of quantile graphs into a multivariate variant, which we term Multilayer Quantile Graphs. In this innovative mapping, each time series is transformed into a quantile graph, and inter-layer connections are established to link contemporaneous quantiles of pairwise series. This enables the analysis of dynamic transitions across multiple dimensions. In this study, we demonstrate the effectiveness of this new mapping using synthetic and benchmark multivariate time series datasets. We delve into the resulting network's topological structures, extract network features, and employ these features for original dataset analysis. Furthermore, we compare our results with a recent method from the literature. The resulting multilayer network offers a significant reduction in the dimensionality of the original data while capturing serial and cross-dimensional transitions. This approach facilitates the characterization and analysis of large multivariate time series datasets through network analysis techniques.
2025
Autores
Silva, VF; Silva, ME; Ribeiro, P; Silva, F;
Publicação
DATA MINING AND KNOWLEDGE DISCOVERY
Abstract
Multivariate time series analysis is a vital but challenging task, with multidisciplinary applicability, tackling the characterization of multiple interconnected variables over time and their dependencies. Traditional methodologies often adapt univariate approaches or rely on assumptions specific to certain domains or problems, presenting limitations. A recent promising alternative is to map multivariate time series into high-level network structures such as multiplex networks, with past work relying on connecting successive time series components with interconnections between contemporary timestamps. In this work, we first define a novel cross-horizontal visibility mapping between lagged timestamps of different time series and then introduce the concept of multilayer horizontal visibility graphs. This allows describing cross-dimension dependencies via inter-layer edges, leveraging the entire structure of multilayer networks. To this end, a novel parameter-free topological measure is proposed and common measures are extended for the multilayer setting. Our approach is general and applicable to any kind of multivariate time series data. We provide an extensive experimental evaluation with both synthetic and real-world datasets. We first explore the proposed methodology and the data properties highlighted by each measure, showing that inter-layer edges based on cross-horizontal visibility preserve more information than previous mappings, while also complementing the information captured by commonly used intra-layer edges. We then illustrate the applicability and validity of our approach in multivariate time series mining tasks, showcasing its potential for enhanced data analysis and insights.
2024
Autores
Barbosa, S; Silva, ME; Rousseau, DD;
Publicação
NONLINEAR PROCESSES IN GEOPHYSICS
Abstract
Palaeoclimate time series, reflecting the state of Earth's climate in the distant past, occasionally display very large and rapid shifts showing abrupt climate variability. The identification and characterisation of these abrupt transitions in palaeoclimate records is of particular interest as this allows for understanding of millennial climate variability and the identification of potential tipping points in the context of current climate change. Methods that are able to characterise these events in an objective and automatic way, in a single time series, or across two proxy records are therefore of particular interest. In our study the matrix profile approach is used to describe Dansgaard-Oeschger (DO) events, abrupt warmings detected in the Greenland ice core, and Northern Hemisphere marine and continental records. The results indicate that canonical events DO-19 and DO-20, occurring at around 72 and 76 ka, are the most similar events over the past 110 000 years. These transitions are characterised by matching transitions corresponding to events DO-1, DO-8, and DO-12. They are abrupt, resulting in a rapid shift to warmer conditions, followed by a gradual return to cold conditions. The joint analysis of the delta 18O and Ca2+ time series indicates that the transition corresponding to the DO-19 event is the most similar event across the two time series.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.