Publicacoes - INESC TEC

Publicações

Publicações por Pedro Pereira Rodrigues

2011

Learning from medical data streams: An introduction

Autores
Rodrigues, PP; Pechenizkiy, M; Gaber, MM; Gama, J;

Publicação
CEUR Workshop Proceedings

Abstract
Clinical practice and research are facing a new challenge created by the rapid growth of health information science and technology, and the complexity and volume of biomedical data. Machine learning from medical data streams is a recent area of research that aims to provide better knowledge extraction and evidence-based clinical decision support in scenarios where data are produced as a continuous flow. This year's edition of AIME, the Conference on Artificial Intelligence in Medicine, enabled the sound discussion of this area of research, mainly by the inclusion of a dedicated workshop. This paper is an introduction to LEMEDS, the Learning from Medical Data Streams workshop, which highlights the contributed papers, the invited talk and expert panel discussion, as well as related papers accepted to the main conference.

FecharLer Abstract

2012

Editorial message: Special track on data streams

Autores
Rodrigues, PP; Bifet, A; Krishnaswamy, S; Gama, J;

Publicação
Proceedings of the ACM Symposium on Applied Computing

Abstract

2009

Special track on data streams

Autores
Gama, J; Carvalho, A; Rodrigues, PP; Aguilar, J;

Publicação
Proceedings of the ACM Symposium on Applied Computing

Abstract

2008

Hierarchical clustering of time-series data streams

Autores
Rodrigues, PP; Gama, J; Pedroso, JP;

Publicação
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

Abstract
This paper presents and analyzes an incremental system for clustering streaming time series. The Online Divisive-Agglomerative Clustering (ODAC) system continuously maintains a tree-like hierarchy of clusters that evolves with data, using a top-down strategy. The splitting criterion is a correlation-based dissimilarity measure among time series, splitting each node by the farthest pair of streams. The system also uses a merge operator that reaggregates a previously split node in order to react to changes in the correlation structure between time series. The split and merge operators are triggered in response to changes in the diameters of existing clusters, assuming that in stationary environments, expanding the structure leads to a decrease in the diameters of the clusters. The system is designed to process thousands of data streams that flow at a high rate. The main features of the system include update time and memory consumption that do not depend on the number of examples in the stream. Moreover, the time and memory required to process an example decreases whenever the cluster structure expands. Experimental results on artificial and real data assess the processing qualities of the system, suggesting a competitive performance on clustering streaming time series, exploring also its ability to deal with concept drift.

FecharLer Abstract

2006

ODAC: Hierarchical Clustering of Time Series Data Streams

Autores
Rodrigues, PP; Gama, J; Pedroso, JP;

Publicação
PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING

Abstract
This paper presents a time series whole clustering system that incrementally constructs a tree-like hierarchy of clusters, using a top-down strategy. The Online Divisive-Agglomerative Clustering (ODAC) system uses a correlation-based dissimilarity measure between time series over a data stream and possesses an agglomerative phase to enhance a dynamic behavior capable of concept drift detection. Main features include splitting and agglomerative criteria based on the diameters of existing clusters and supported by a. significance level. At each new example, only the leaves are updated, reducing computation of unneeded dissimilarities and speeding up the process every time the structure grows. Experimental results on artificial and real data suggest competitive performance on clustering time series and show that the system is equivalent to a batch divisive clustering on stationary time series, being also capable of dealing with concept drift. With this work, we assure the possibility and importance of hierarchical incremental time series whole clustering in the data stream paradigm, presenting a. valuable and usable option.

FecharLer Abstract

2024

Achieving rapid and significant results in healthcare services by using the theory of constraints

Autores
Bacelar Silva, GM; Cox, JF III; Rodrigues, P;

Publicação
HEALTH SYSTEMS

Abstract
Lack of timeliness and capacity are seen as fundamental problems that jeopardise healthcare delivery systems everywhere. Many believe the shortage of medical providers is causing this timeliness problem. This action research presents how one doctor implemented the theory of constraints (TOC) to improve the throughput (quantity of patients treated) of his ophthalmology imaging practice by 64% in a few weeks with little to no expense. The five focusing steps (5FS) guided the TOC implementation - which included the drum-buffer-rope scheduling and buffer management - and occurred in a matter of days. The implementation provided significant bottom-line results almost immediately. This article explains each step of the 5FS in general terms followed by specific applications to healthcare services, as well as the detailed use in this action research. Although TOC successfully addressed the practice problems, this implementation was not sustained after the TOC champion left the organisation. However, this drawback provided valuable knowledge. The article provides insightful knowledge to help readers implement TOC in their environments to provide immediate and significant results at little to no expense.

FecharLer Abstract