Publications

Publications by LIAAD

2021

Report on the 4th international workshop on narrative extraction from texts (Text2Story 2021) at ECIR 2021

Authors
Campos, R; Jorge, AM; Jatowt, A; Bhatia, S; Finlayson, MA; Cordeiro, JP; Rocha, C; Ribeiro, A; Mansouri, B; Ansah, J; Pasquali, A;

Publication
SIGIR Forum

Abstract

2021

Improving Portuguese Semantic Role Labeling with Transformers and Transfer Learning

Authors
Oliveira, S; Loureiro, D; Jorge, A;

Publication
2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA)

Abstract
The Natural Language Processing task of determining Who did what to whom is called Semantic Role Labeling. For English, recent methods based on Transformer models have allowed for major improvements in this task over the previous state of the art. However, for low resource languages, like Portuguese, currently available semantic role labeling models are hindered by scarce training data. In this paper, we explore a model architecture with only a pre-trained Transformer-based model, a linear layer, softmax and Viterbi decoding. We substantially improve the state-of-the-art performance in Portuguese by over 15 F1. Additionally, we improve semantic role labeling results in Portuguese corpora by exploiting cross-lingual transfer learning using multilingual pre-trained models, and transfer learning from dependency parsing in Portuguese, evaluating the various proposed approaches empirically.

CloseRead Abstract

2021

Transformers and Transfer Learning for Improving Portuguese Semantic Role Labeling

Authors
Oliveira, S; Loureiro, D; Jorge, A;

Publication
CoRR

Abstract

2021

A Survey on Data-Driven Predictive Maintenance for the Railway Industry

Authors
Davari, N; Veloso, B; Costa, GD; Pereira, PM; Ribeiro, RP; Gama, J;

Publication
SENSORS

Abstract
In the last few years, many works have addressed Predictive Maintenance (PdM) by the use of Machine Learning (ML) and Deep Learning (DL) solutions, especially the latter. The monitoring and logging of industrial equipment events, like temporal behavior and fault events-anomaly detection in time-series-can be obtained from records generated by sensors installed in different parts of an industrial plant. However, such progress is incipient because we still have many challenges, and the performance of applications depends on the appropriate choice of the method. This article presents a survey of existing ML and DL techniques for handling PdM in the railway industry. This survey discusses the main approaches for this specific application within a taxonomy defined by the type of task, employed methods, metrics of evaluation, the specific equipment or process, and datasets. Lastly, we conclude and outline some suggestions for future research.

CloseRead Abstract

2021

Predictive maintenance based on anomaly detection using deep learning for air production unit in the railway industry

Authors
Davari, N; Veloso, B; Ribeiro, RP; Pereira, PM; Gama, J;

Publication
2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA)

Abstract
Predictive maintenance methods assist early detection of failures and errors in machinery before they reach critical stages. This study proposes a data-driven predictive maintenance framework for the air production unit (APU) system of a train of Metro do Porto by deep learning based on a sparse autoencoder (SAE) network that efficiently detects abnormal data and considerably reduces the false alarm rate. Several analog and digital sensors installed on the APU system allow the detection of behavioral changes and deviations from the normal pattern by analyzing the collected data. We implemented two versions of the SAE network in which we inputted analog sensors data and digital sensors data, and the experimental results show that the failures due to air leakage problems are predicted by analog sensors data while other types of failures are identified by digital sensors data. A low pass filter is applied to the output of the SAE network, and a sequence of abnormal data is used as an alarm for the APU system failure. Performance indicators of the SAE network with digital sensors data, in terms of F1 Score, Recall, and Precision, are respectively, about 33.6%, 42%, and 28% better than those of the SAE network with analog sensors data. For comparison purposes, we also implemented a variational autoencoder (VAE). The results show that SAE performance is better than that of VAE by 14%, 77%, and 37% respectively, for Recall, Precision and F1 Score.

CloseRead Abstract

2021

Current Trends in Learning from Data Streams

Authors
Gama, J; Veloso, B; Aminian, E; Ribeiro, RP;

Publication
9TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS, BDA 2021

Abstract
This article presents our recent work on the topic of learning from data streams. We focus on emerging topics, including fraud detection, learning from rare cases, and hyper-parameter tuning for streaming data. © 2021, Springer Nature Switzerland AG.

CloseRead Abstract