Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por LIAAD

2015

Data Stream Classification Based on the Gamma Classifier

Autores
Valeria Uriarte Arcia, AV; Lopez Yanez, I; Yanez Marquez, C; Gama, J; Camacho Nieto, O;

Publicação
MATHEMATICAL PROBLEMS IN ENGINEERING

Abstract
The ever increasing data generation confronts us with the problem of handling online massive amounts of information. One of the biggest challenges is how to extract valuable information from these massive continuous data streams during single scanning. In a data stream context, data arrive continuously at high speed; therefore the algorithms developed to address this context must be efficient regarding memory and time management and capable of detecting changes over time in the underlying distribution that generated the data. This work describes a novel method for the task of pattern classification over a continuous data stream based on an associative model. The proposed method is based on the Gamma classifier, which is inspired by the Alpha-Beta associative memories, which are both supervised pattern recognition models. The proposed method is capable of handling the space and time constrain inherent to data stream scenarios. The Data Streaming Gamma classifier (DS-Gamma classifier) implements a sliding window approach to provide concept drift detection and a forgetting mechanism. In order to test the classifier, several experiments were performed using different data stream scenarios with real and synthetic data streams. The experimental results show that the method exhibits competitive performance when compared to other state-of-the-art algorithms.

2015

Concept Drift Detection with Clustering via Statistical Change Detection Methods

Autores
Sakamoto, Y; Fukui, K; Gama, J; Nicklas, D; Moriyama, K; Numao, M;

Publicação
2015 Seventh International Conference on Knowledge and Systems Engineering (KSE)

Abstract
We propose a concept drift detection method utilizing statistical change detection in which a drift detection method and the Page-Hinkley test are employed. Our method enables users to annotate clustering results without constructing a model of drift detection for every input. In our experiments using synthetic data, we evaluated our proposed method on the basis of detection delay and false detection, also revealed relations between the degree of drift and parameters of the method.

2015

Data Stream Classification Guided by Clustering on Nonstationary Environments and Extreme Verification Latency

Autores
Souza, VMAd; Silva, DF; Gama, J; Batista, GEAPA;

Publicação
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30 - May 2, 2015

Abstract
Data stream classification algorithms for nonstationary environments frequently assume the availability of class labels, instantly or with some lag after the classification. However, certain applications, mainly those related to sensors and robotics, involve high costs to obtain new labels during the classification phase. Such a scenario in which the actual labels of processed data are never available is called extreme verification latency. Extreme verification latency requires new classification methods capable of adapting to possible changes over time without external supervision. This paper presents a fast, simple, intuitive and accurate algorithm to classify nonstationary data streams in an extreme verification latency scenario, namely Stream Classification Algorithm Guided by Clustering - SCARGC. Our method consists of a clustering followed by a classification step applied repeatedly in a closed loop fashion. We show in several classification tasks evaluated in synthetic and real data that our method is faster and more accurate than the state-of-the-art. Copyright © SIAM.

2015

Links between Scores, Real Default and Pricing: Evidence from the Freddie Mac’s Loan-Level Dataset

Autores
Rocha Sousa, M; Gama, J; Brandão, E;

Publicação
Journal of Economics, Business and Management

Abstract

2015

Special track on data streams

Autores
Rodrigues, PP; Bifet, A; Krishnaswamy, S; Gama, J;

Publicação
Proceedings of the ACM Symposium on Applied Computing

Abstract

2015

Keynote speaker 2: Real time data mining

Autores
Gama, J;

Publicação
2015 IEEE International Conference on Evolving and Adaptive Intelligent Systems, EAIS 2015, Douai, France, December 1-3, 2015

Abstract

  • 240
  • 430