Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by LIAAD

2012

Next challenges for adaptive learning systems

Authors
Zliobaite, I; Bifet, A; Gaber, MM; Gabrys, B; Gama, J; Minku, LL; Musial, K;

Publication
SIGKDD Explorations

Abstract

2012

Identifying Relationships in Transactional Data

Authors
Rodrigues, M; Gama, J; Ferreira, CA;

Publication
ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2012

Abstract
Association rules is the traditional way used to study market basket or transactional data. One drawback of this analysis is the huge number of rules generated. As a complement to association rules, Association Rules Network (ARN), based on Social Network Analysis (SNA) has been proposed by several researchers. In this work we study a real market basket analysis problem, available in a Belgian supermarket, using ARNs. We learn ARNs by considering the relationships between items that appear more often in the consequent of the association rules. Moreover, we propose a more compact variant of ARNs: the Maximal Itemsets Social Network. In order to assess the quality of these structures, we compute SNA based metrics, like weighted degree and utility of community.

2012

Improving the offline clustering stage of data stream algorithms in scenarios with variable number of clusters

Authors
Faria, ER; Barros, RC; Gama, J; Carvalho, ACPLF;

Publication
Proceedings of the ACM Symposium on Applied Computing

Abstract
Many data stream clustering algorithms operate in two well-defined steps: (i) online statistical data collection stage; and (ii) offline macro-clustering stage. The well-known k-means algorithm is often employed for performing the offline macro-clustering step. The conventional k-means algorithm assumes that the number of clusters (k) is defined a priori by the user. Given the difficulty of defining the value of k a priori in real-world problems, we describe a new approach that allows estimating k dynamically from streams with variable number of clusters, which is a common scenario in data with a non-stationary distribution. In addition, we combine our dynamic approach with two different strategies for initializing the centroids during the offline clustering. Analysis of results suggest that, using the dynamic approach, the method k-means++ for centroids initialization present better results. © 2012 Authors.

2012

Very fast decision rules for multi-class problems

Authors
Kosina, P; Gama, J;

Publication
Proceedings of the ACM Symposium on Applied Computing

Abstract
Decision rules are one of the most interpretable and flexible models for data mining prediction tasks. Till now, few works presented online, any-time and one-pass algorithms for learning decision rules in the stream mining scenario. A quite recent algorithm, the Very Fast Decision Rules (VFDR), learns set of rules, where each rule discriminates one class from all the other. In this work we extend the VFDR algorithm by decomposing a multi-class problem into a set of two-class problems and inducing a set of discriminative rules for each binary problem. The proposed algorithm maintains all properties required when learning from stationary data streams: online and any-time classifiers, processing each example once. Moreover, it is able to learn ordered and unordered rule sets. The new approach is evaluated on various real and artificial datasets. The new algorithm improves the performance of the previous version and is competitive with the state-of-the-art decision tree learning method for data streams. © 2012 ACM.

2012

A density-based clustering approach for behavior change detection in data streams

Authors
Vallim, RMM; Filho, JAA; Carvalho, ACPLF; Gama, J;

Publication
Proceedings - Brazilian Symposium on Neural Networks, SBRN

Abstract
Mining data streams poses many challenges to existing Machine Learning algorithms. Algorithms designed to learn in this scenario need to constantly update their decision models in accordance with current data behavior. Therefore, the ability to detect when the behavior of the stream is changing is an important feature of any learning technique approaching data streams. This work is concerned with unsupervised behavior change detection. It suggests the use of density-based clustering and an entropy measurement for change detection that is independent of the number and format of clusters. The proposed approach uses a modified version of the Den Stream algorithm that is designed to better cope with the entropy calculation. Experimental results using synthetic data provide insight on how clustering and novelty detection algorithms can be used for change detection in data streams. © 2012 IEEE.

2012

Mobile data stream mining: From algorithms to applications

Authors
Krishnaswamy, S; Gama, J; Gaber, MM;

Publication
Proceedings - 2012 IEEE 13th International Conference on Mobile Data Management, MDM 2012

Abstract
This paper presents an overview of the current state-of-the-art in mobile data stream mining. This area of mobile data stream mining is significant for a number of new application domains such as mobile crowd sensing and mobile activity recognition. The paper presents the strategies and techniques for adaptation that are essential in order to perform real-time, continuous data mining on mobile devices. We present an overview of the algorithms research in this area. Finally, we discuss the key toolkits, systems and applications of mobile data stream mining. © 2012 IEEE.

  • 301
  • 430