Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by LIAAD

2023

Machine Learning and Principles and Practice of Knowledge Discovery in Databases

Authors
Koprinska, I; Mignone, P; Guidotti, R; Jaroszewicz, S; Fröning, H; Gullo, F; Ferreira, PM; Roqueiro, D; Ceddia, G; Nowaczyk, S; Gama, J; Ribeiro, R; Gavaldà, R; Masciari, E; Ras, Z; Ritacco, E; Naretto, F; Theissler, A; Biecek, P; Verbeke, W; Schiele, G; Pernkopf, F; Blott, M; Bordino, I; Danesi, IL; Ponti, G; Severini, L; Appice, A; Andresini, G; Medeiros, I; Graça, G; Cooper, L; Ghazaleh, N; Richiardi, J; Saldana, D; Sechidis, K; Canakoglu, A; Pido, S; Pinoli, P; Bifet, A; Pashami, S;

Publication
Communications in Computer and Information Science

Abstract

2023

Machine Learning and Principles and Practice of Knowledge Discovery in Databases

Authors
Koprinska, I; Mignone, P; Guidotti, R; Jaroszewicz, S; Fröning, H; Gullo, F; Ferreira, PM; Roqueiro, D; Ceddia, G; Nowaczyk, S; Gama, J; Ribeiro, R; Gavaldà, R; Masciari, E; Ras, Z; Ritacco, E; Naretto, F; Theissler, A; Biecek, P; Verbeke, W; Schiele, G; Pernkopf, F; Blott, M; Bordino, I; Danesi, IL; Ponti, G; Severini, L; Appice, A; Andresini, G; Medeiros, I; Graça, G; Cooper, L; Ghazaleh, N; Richiardi, J; Saldana, D; Sechidis, K; Canakoglu, A; Pido, S; Pinoli, P; Bifet, A; Pashami, S;

Publication
Communications in Computer and Information Science

Abstract

2023

Wavelet-based fuzzy clustering of interval time series

Authors
D'Urso, P; De Giovanni, L; Maharaj, EA; Brito, P; Teles, P;

Publication
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING

Abstract
We investigate the fuzzy clustering of interval time series using wavelet variances and covariances; in particular, we use a fuzzy c-medoids clustering algorithm. Traditional hierarchical and non-hierarchical clustering methods lead to the identification of mutually exclusive clusters whereas fuzzy clustering methods enable the identification of overlapping clusters, implying that one or more series could belong to more than one cluster simultaneously. An interval time series (ITS) which arises when interval-valued observa-tions are recorded over time is able to capture the variability of values within each interval at each time point. This is in contrast to single-point information available in a classical time series. Our main contribution is that by combining wavelet analysis, interval data analysis and fuzzy clustering, we are able to capture information which would otherwise have not been contemplated by the use of traditional crisp clustering methods on classical time series for which just a single value is recorded at each time point. Through simulation studies, we show that under some circumstances fuzzy c-medoids clustering performs better when applied to ITS than when it is applied to the corresponding traditional time series. Applications to exchange rates ITS and sea-level ITS show that the fuzzy clustering method reveals different and more meaningful results than when applied to associated single-point time series.

2023

Classification and Data Science in the Digital Age

Authors
Brito, P; Dias, JG; Lausen, B; Montanari, A; Nugent, R;

Publication
Studies in Classification, Data Analysis, and Knowledge Organization

Abstract

2023

Preface

Authors
Brito, P; Dias, G; Lausen, B; Montanari, A; Nugent, R;

Publication
Studies in Classification, Data Analysis, and Knowledge Organization

Abstract
[No abstract available]

2023

GASTeN: Generative Adversarial Stress Test Networks

Authors
Cunha, L; Soares, C; Restivo, A; Teixeira, LF;

Publication
ADVANCES IN INTELLIGENT DATA ANALYSIS XXI, IDA 2023

Abstract
Concerns with the interpretability of ML models are growing as the technology is used in increasingly sensitive domains (e.g., health and public administration). Synthetic data can be used to understand models better, for instance, if the examples are generated close to the frontier between classes. However, data augmentation techniques, such as Generative Adversarial Networks (GAN), have been mostly used to generate training data that leads to better models. We propose a variation of GANs that, given a model, generates realistic data that is classified with low confidence by a given classifier. The generated examples can be used in order to gain insights on the frontier between classes. We empirically evaluate our approach on two well-known image classification benchmark datasets, MNIST and Fashion MNIST. Results show that the approach is able to generate images that are closer to the frontier when compared to the original ones, but still realistic. Manual inspection confirms that some of those images are confusing even for humans.

  • 28
  • 440