Publicacoes - INESC TEC

Publicações

Publicações por LIAAD

2017

Mind the Gap: A Well Log Data Analysis

Autores
Lopes, RuiL.; Jorge, Alipio;

Publicação
CoRR

Abstract

2017

Proceedings of the Workshop on Data Mining for Oil and Gas

Autores
Jorge, AlipioMario; Larrazábal, German; Guillén, Pablo; Lopes, RuiL.;

Publicação
CoRR

Abstract

2017

Relevance-Based Evaluation Metrics for Multi-class Imbalanced Domains

Autores
Branco, P; Torgo, L; Ribeiro, RP;

Publicação
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT I

Abstract
The class imbalance problem is a key issue that has received much attention. This attention has been mostly focused on two-classes problems. Fewer solutions exist for the multi-classes imbalance problem. From an evaluation point of view, the class imbalance problem is challenging because a non-uniform importance is assigned to the classes. In this paper, we propose a relevance-based evaluation framework that incorporates user preferences by allowing the assignment of differentiated importance values to each class. The presented solution is able to overcome difficulties detected in existing measures and increases discrimination capability. The proposed framework requires the assignment of a relevance score to the problem classes. To deal with cases where the user is not able to specify each class relevance, we describe three mechanisms to incorporate the existing domain knowledge into the relevance framework. These mechanisms differ in the amount of information available and assumptions made regarding the domain. They also allow the use of our framework in common settings of multi-class imbalanced problems with different levels of information available. © 2017, Springer International Publishing AG.

FecharLer Abstract

2017

Exploring Resampling with Neighborhood Bias on Imbalanced Regression Problems

Autores
Branco, P; Torgo, L; Ribeiro, RP;

Publicação
PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2017)

Abstract
Imbalanced domains are an important problem that arises in predictive tasks causing a loss in the performance of the most relevant cases for the user. This problem has been intensively studied for classification problems. Recently it was recognized that imbalanced domains occur in several other contexts and for a diversity of types of tasks. This paper focus on imbalanced regression tasks. Resampling strategies are among the most successful approaches to imbalanced domains. In this work we propose variants of existing resampling strategies that are able to take into account the information regarding the neighborhood of the examples. Instead of performing sampling uniformly, our proposals bias the strategies for reinforcing some regions of the data sets. In an extensive set of experiments we provide evidence of the advantage of introducing a neighborhood bias in the resampling strategies.

FecharLer Abstract

2017

SMOGN: a Pre-processing Approach for Imbalanced Regression

Autores
Branco, P; Torgo, L; Ribeiro, RP;

Publicação
First International Workshop on Learning with Imbalanced Domains: Theory and Applications, LIDTA@PKDD/ECML 2017, 22 September 2017, Skopje, Macedonia

Abstract

2017

Proceedings of the Workshop on IoT Large Scale Learning from Data Streams co-located with the 2017 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2017), Skopje, Macedonia, September 18-22, 2017

Autores
Mouchaweh, MS; Bifet, A; Bouchachia, H; Gama, J; Ribeiro, RP;

Publicação
IOTSTREAMING@PKDD/ECML

Abstract