Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by LIAAD

2018

Bandit-Based Automated Machine Learning

Authors
Das Dores, SCN; Soares, C; Ruiz, D;

Publication
2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS)

Abstract
Machine Learning (ML) has been successfully applied to a wide range of domains and applications. Since the number of ML applications is growing, there is a need for tools that boost the data scientist's productivity. Automated Machine Learning (AutoML) is the field of ML that aims to address these needs through the development of solutions which enable data science practitioners, experts and non-experts, to efficiently create fine-tuned predictive models with minimum intervention. In this paper, we present the application of the multi-armed bandit optimization algorithm Hyperband to address the AutoML problem of generating customized classification workflows, a combination of preprocessing methods and ML algorithms including hyperparameter optimization. Experimental results comparing the bandit-based approach against Auto ML Bayesian Optimization methods show that this new approach is superior to the state-of-the-art methods in the test evaluation and equivalent to them in a statistical analysis.

2018

Analysing the Footprint of Classifiers in Overlapped and Imbalanced Contexts

Authors
Mercier, M; Santos, MS; Abreu, PH; Soares, C; Soares, JP; Santos, J;

Publication
Advances in Intelligent Data Analysis XVII - 17th International Symposium, IDA 2018, 's-Hertogenbosch, The Netherlands, October 24-26, 2018, Proceedings

Abstract
It is recognised that the imbalanced data problem is aggravated by other difficulty factors, such as class overlap. Over the years, several research works have focused on this problematic, although presenting two major hitches: the limitation of test domains and the lack of a formulation of the overlap degree, which makes results hard to generalise. This work studies the performance degradation of classifiers with distinct learning biases in overlap and imbalanced contexts, focusing on the characteristics of the test domains (shape, dimensionality and imbalance ratio) and on to what extent our proposed overlapping measure (degOver) is aligned with the performance results observed. Our results show that MLP and CART classifiers are the most robust to high levels of class overlap, even for complex domains, and that KNN and linear SVM are the most aligned with degOver. Furthermore, we found that the dimensionality of data also plays an important role in explaining performance results. © Springer Nature Switzerland AG 2018.

2018

Label Expansion for Multi-Label Classification

Authors
Rivolli, A; Soares, C; de Carvalho, ACPLF;

Publication
2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS)

Abstract
In multi-label classification tasks, instances are simultaneously associated with multiple labels, representing different and, possibly, related concepts from a domain. One characteristic of these tasks is a high class-label imbalance. In order to obtain improved predictive models, several algorithms either have explored the label dependencies or have dealt with the problem of imbalanced labels. This work proposes a label expansion approach which combines both alternatives. For such, some labels are expanded with data from a related class label, making the labels more balanced and representative. Preliminary experiments show the effectiveness of this approach to improve the Binary Relevance strategy. Particularly, it reduced the number of labels that were never predicted in the test instances. Although the results are preliminary, they are potentially attractive, considering the scale and consistency of the improvement obtained, as well as the broad scope of the proposed approach.

2018

A Framework for Analytical Approaches to Combine Interpretable Models

Authors
Strecht, P; Moreira, JM; Soares, C;

Publication
Information Management and Big Data, 5th International Conference, SIMBig 2018, Lima, Peru, September 3-5, 2018, Proceedings.

Abstract
Analytic approaches to combine interpretable models, although presented in different contexts, can be generalized to highlight the components that can be specialized. We propose a framework that structures the combination process, formalizes the problems that can be solved in alternative ways and evaluates the combined models based on their predictive ability to replace the base ones, without loss of interpretability. The framework is illustrated with a case study using data from the University of Porto, Portugal, where experiments were carried out. The results show that grouping base models by scientific areas, ordering by the number of variables and intersecting their underlying rules creates conditions for the combined models to outperform them. © 2019, Springer Nature Switzerland AG.

2018

Generalizing Knowledge in Decentralized Rule-Based Models

Authors
Strecht, P; Moreira, JM; Soares, C;

Publication
ECML PKDD 2018 Workshops - DMLE 2018 and IoTStream 2018, Dublin, Ireland, September 10-14, 2018, Revised Selected Papers

Abstract
Knowledge generalization of ruled-based models, such as decision trees or decision rules, have emerged from different backgrounds. This particular kind of models, given their interpretability, offer several possibilities to be combined. Despite each distinct context, common patterns have emerged revealing the systemic nature of the problem. In this paper, we look at the problem of generalizing the knowledge contained in a set of models as a process formalizing the operations that can be addressed in alternative ways. We also include a set-up to evaluate gen-eralized models based on their ability to replace the base ones from a predictive performance perspective, without loss of interpretability.

2018

Providing proactiveness: Data analysis techniques portfolios

Authors
Sillitti, A; Anakabe, JF; Basurko, J; Dam, P; Ferreira, H; Ferreiro, S; Gijsbers, J; He, S; Hegedus, C; Holenderski, M; Hooghoudt, JO; Lecuona, I; Leturiondo, U; Marcelis, Q; Moldován, I; Okafor, E; de Sá, CR; Romero, R; Sarr, B; Schomaker, L; Shekar, AK; Soares, C; Sprong, H; Theodorsen, S; Tourwé, T; Urchegui, G; Webers, G; Yang, Y; Zubaliy, A; Zugasti, E; Zurutuza, U;

Publication
The MANTIS Book: Cyber Physical System Based Proactive Collaborative Maintenance

Abstract

  • 157
  • 429