Publicacoes - INESC TEC

Publicações

Publicações por José Luís Borges

2021

Knowledge-Assisted Visualization of Multi-Level Origin-Destination Flows Using Ontologies

Autores
Sobral, T; Galvao, T; Borges, J;

Publicação
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS

Abstract
Origin-destination matrices help understand the movement of people within cities. This work is built upon the premise that stakeholders, e.g. decision makers, need to analyze mobility flows from spatio-temporal perspectives that are appropriate to their context of analysis. The data retrieved from sensors and Intelligent Transportation Systems are useful for this purpose due to their lower acquisition costs and fine granularity, although it is complex to use such data in an integrated way, as they might have heterogeneous representations of spatio-temporal attributes and granularities. Most of the related works on the analysis of OD flows consider matrices with a fixed spatio-temporal aggregation level, and do not explore the intrinsic issue of data heterogeneity. Herein we report our findings on building the semantic foundation of knowledge-assisted visualization tools for analyzing OD matrices from multiple stakeholder levels. We propose a set of ontology design patterns for modeling the semantics of OD data, and the relations between the spatio-temporal constructs that stakeholders ought to choose when visualizing urban mobility flows. Our approach aims to be reusable by researchers and practitioners. We describe a practical implementation using estimated flows from smart card data from Porto, Portugal.

FecharLer Abstract

2022

Understanding Overlap in Automatic Root Cause Analysis in Manufacturing Using Causal Inference

Autores
Oliveira, EE; Migueis, VL; Borges, JL;

Publicação
IEEE ACCESS

Abstract
Overlap has been identified in previous works as a significant obstacle to automated diagnosis using data mining algorithms, since it makes it impossible to discern how each machine influences product quality. Several solutions that handle overlap have been proposed, but the final result is a list of potential overlapped root causes. The goal of this paper is to develop a solution resilient to overlap that can determine the true root cause from a list of possible root causes, when possible, and determine the conditions in which it is possible to identify the root causes. This allows for a better understanding of overlap, and enables the development of a fully automatic root cause analysis for manufacturing. To do so, we propose an automatic root cause analysis approach that uses causal inference and do calculus to determine the true root cause. The proposed approach was validated on simulated and real case-study data, and allowed for an estimation of the effect of a product passing through a certain machine while disregarding the effect of overlap, in certain conditions. The results were on par with the state-of-the-art solutions capable of handling overlap. The contributions of this paper are a graphical definition of overlap, the identification of the conditions in which is possible to overcome the effect of overlap, and a solution that can present a single true root cause when such conditions are met.

FecharLer Abstract

2022

On the influence of overlap in automatic root cause analysis in manufacturing

Autores
Oliveira, EE; Migueis, VL; Borges, JL;

Publicação
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH

Abstract
To improve manufacturing processes, it is essential to find the root causes of occurring problems, in order to solve them permanently. Automatic Root Cause Analysis (ARCA) solutions aid analysts in finding such root causes, by using automatic data analysis to improve the digital decision. When trying to locate the root cause of a problem in a manufacturing process, a phenomenon can occur that disrupts the application of ARCA solutions. Overlap, as we denominated, is a phenomenon where local synchronicities in the manufacturing process lead to data where it is impossible to discern the influence of each location in the quality of products, which impedes automated diagnosis, especially when using classifiers. This paper identifies and defines overlap, and proposes a two-phase ARCA solution that uses factor-ranking algorithms, instead of classifiers. The proposed solution is evaluated in simulated and real case-study data. Results proved the presence of overlap in the datasets, and its negative impact on classifiers. The proposed solution has a positive performance detecting root causes even in the presence of overlap.

FecharLer Abstract

2024

Multidimensional subgroup discovery on event logs

Autores
Ribeiro, J; Fontes, T; Soares, C; Borges, JL;

Publicação
EXPERT SYSTEMS WITH APPLICATIONS

Abstract
Subgroup discovery (SD) aims at finding significant subgroups of a given population of individuals characterized by statistically unusual properties of interest. SD on event logs provides insight into particular behaviors of processes, which may be a valuable complement to the traditional process analysis techniques, especially for low -structured processes. This paper proposes a scalable and efficient method to search significant SD rules on frequent sequences of events, exploiting their multidimensional nature. With this method, it is intended to identify significant subsequences of events where the distribution of values of some target aspect is significantly different than the same distribution for the entire event log. A publicly available real -life event log of a Dutch hospital is used as a running example to demonstrate the applicability of our method. The proposed approach was applied on a real -life case study based on the public transport of a medium size European city (Porto, Portugal), for which the event data consists of 133 million smartcard travel validations from buses, trams and trains. The results include a characterization of mobility flows over multiple aspects, as well as the identification of unexpected behaviors in the flow of commuters (public transport). The generated knowledge provided a useful insight into the behavior of travelers, which can be applied at operational, tactical and strategic business levels, enhancing the current view of the transport services to transport authorities and operators.

FecharLer Abstract

2022

VisAC: An interactive tool for visual analysis of consanguinity in the ancestry of individuals

Autores
Borges, J;

Publicação
INFORMATION VISUALIZATION

Abstract
Genealogy studies are growing in popularity, and researchers are increasingly using visualization methods to summarize and communicate their findings. A family tree is a visual representation of family members and their relationships that is commonly used to support the research of a family's history and publish the results. In some cases, an ancestor may occur in more than one place in the lineage of an individual, which is one of the reasons for the occurrence of consanguineous marriages, that is, marriages between blood relative spouses. Current methods for family tree visualization were not designed to analyze and assess the level of consanguinity in the ancestry of individuals. This paper proposes VisAC, an interactive tool to support the visual analysis of consanguinity in individuals' ancestry. The inbreeding coefficient is used as a measure of consanguinity. The coefficient corresponds to an estimate of the probability that two alleles (a variant of a given gene) in the DNA were inherited from the same individual. A visualization design and an interactive tool were developed with genealogists' support. In addition, the feedback collected through a questionnaire about two demo videos and tests with three target users strongly supports the effectiveness of the family tree visual representation and the adequacy of the interactive tool for the exploratory analysis task. Real-world examples are given to illustrate the usefulness of the visualization design, and an example of exploratory analysis is presented to illustrate the use of the interactive tool. In summary, this work formulates the task of visual analysis of consanguinity in ancestors' trees and proposes VisAC, a new visualization tool to support the task.

FecharLer Abstract

2023

Overlap in Automatic Root Cause Analysis in Manufacturing: An Information Theory-Based Approach

Autores
Oliveira, EE; Migueis, VL; Borges, JL;

Publicação
APPLIED SCIENCES-BASEL

Abstract
Automatic Root Cause Analysis solutions aid analysts in finding problems' root causes by using automatic data analysis. When trying to locate the root cause of a problem in a manufacturing process, an issue-denominated overlap can occur. Overlap can impede automated diagnosis using algorithms, as the data make it impossible to discern the influence of each machine on the quality of products. This paper proposes a new measure of overlap based on an information theory concept called Positive Mutual Information. This new measure allows for a more detailed analysis. A new approach is developed for automatically finding the root causes of problems when overlap occurs. A visualization that depicts overlapped locations is also proposed to ease practitioners' analysis. The proposed solution is validated in simulated and real case-study data. Compared to previous solutions, the proposed approach improves the capacity to pinpoint a problem's root causes.

FecharLer Abstract