Publications

Publications by LIAAD

2024

Recommendation Systems in E-commerce: Link Prediction in Multilayer Bipartite Networks

Authors
Ramoa, L; Campos, P;

Publication
Digital Transformation and Enterprise Information Systems

Abstract
As we delve into how technology enhances supply chain management efficiency and tackles specific e-business challenges, we must recognize the critical synergy with recommendation systems. These systems align with digital transformation goals, enhancing customer experiences, enabling data-driven decisions, promoting innovation, and embracing a customer-centric approach. During the 2020 COVID-19 surge, e-commerce experienced increased activity, highlighting the significance of recommendation systems in forecasting new purchases. This chapter introduces a novel approach to understanding customer–product interactions through multilayer bipartite networks, employing a hybrid recommendation system with k-means and weighted slope one algorithms. This approach enhances clarity, explainability, and information gains, aiding tasks like inventory optimization. The study concludes that the model’s predicted results differ from the actual ratings and that the system is effective in improving decision-making processes and customer recommendations. © 2025 selection and editorial matter, Adelaide Martins and Carolina Machado.

CloseRead Abstract

2024

Federated Learning in Medical Image Analysis: A Systematic Survey

Authors
da Silva, FR; Camacho, R; Tavares, JMRS;

Publication
ELECTRONICS

Abstract
Medical image analysis is crucial for the efficient diagnosis of many diseases. Typically, hospitals maintain vast repositories of images, which can be leveraged for various purposes, including research. However, access to such image collections is largely restricted to safeguard the privacy of the individuals whose images are being stored, as data protection concerns come into play. Recently, the development of solutions for Automated Medical Image Analysis has gained significant attention, with Deep Learning being one solution that has achieved remarkable results in this area. One promising approach for medical image analysis is Federated Learning (FL), which enables the use of a set of physically distributed data repositories, usually known as nodes, satisfying the restriction that the data do not leave the repository. Under these conditions, FL can build high-quality, accurate deep-learning models using a lot of available data wherever it is. Therefore, FL can help researchers and clinicians diagnose diseases and support medical decisions more efficiently and robustly. This article provides a systematic survey of FL in medical image analysis, specifically based on Magnetic Resonance Imaging, Computed Tomography, X-radiography, and histology images. Hence, it discusses applications, contributions, limitations, and challenges and is, therefore, suitable for those who want to understand how FL can contribute to the medical imaging domain.

CloseRead Abstract

2024

Imitation learning for aerobatic maneuvering in fixed-wing aircraft

Authors
Freitas, H; Camacho, R; Silva, DC;

Publication
JOURNAL OF COMPUTATIONAL SCIENCE

Abstract
This study focuses on the task of developing automated models for complex aerobatic aircraft maneuvers. The approach employed here utilizes Behavioral Cloning, a technique in which human pilots supply a series of sample maneuvers. These maneuvers serve as training data for a Machine Learning algorithm, enabling the system to generate control models for each maneuver. The optimal instances for each maneuver were chosen based on a set of objective evaluation criteria. By utilizing these selected sets of examples, resilient models were developed, capable of reproducing the maneuvers performed by the human pilots who supplied the examples. In certain instances, these models even exhibited superior performance compared to the pilots themselves, a phenomenon referred to as the clean-up effect. We also explore the application of transfer learning to adapt the developed controllers to various airplane models, revealing compelling evidence that transfer learning is effective for refining them for targeted aircraft. A comprehensive set of intricate maneuvers was executed through a meta -controller capable of orchestrating the fundamental maneuvers acquired through imitation. This undertaking yielded promising outcomes, demonstrating the proficiency of several Machine Learning models in successfully executing highly intricate aircraft maneuvers.

CloseRead Abstract

2024

Spatio-Temporal Parallel Transformer Based Model for Traffic Prediction

Authors
Kumar, R; Mendes-moreira, J; Chandra, J;

Publication
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

Abstract
Traffic forecasting problems involve jointly modeling the non-linear spatio-temporal dependencies at different scales. While graph neural network models have been effectively used to capture the non-linear spatial dependencies, capturing the dynamic spatial dependencies between the locations remains a major challenge. The errors in capturing such dependencies propagate in modeling the temporal dependencies between the locations, thereby severely affecting the performance of long-term predictions. While transformer-based mechanisms have been recently proposed for capturing the dynamic spatial dependencies, these methods are susceptible to fluctuations in data brought on by unforeseen events like traffic congestion and accidents. To mitigate these issues we propose an improvised spatio-temporal parallel transformer (STPT) based model for traffic prediction that uses multiple adjacency graphs passed through a pair of coupled graph transformer- convolution network units, operating in parallel, to generate more noise-resilient embeddings. We conduct extensive experiments on 4 real-world traffic datasets and compare the performance of STPT with several state-of-the-art baselines, in terms of measures like RMSE, MAE, and MAPE. We find that using STPT improves the performance by around 10 - 34% as compared to the baselines. We also investigate the applicability of the model on other spatio-temporal data in other domains. We use a Covid-19 dataset to predict the number of future occurrences in different regions from a given set of historical occurrences. The results demonstrate the superiority of our model for such datasets.

CloseRead Abstract

2024

KDBI special issue: Explainability feature selection framework application for LSTM multivariate time-series forecast self optimization

Authors
Rodrigues, EM; Baghoussi, Y; Mendes-Moreira, J;

Publication
EXPERT SYSTEMS

Abstract
Deep learning models are widely used in multivariate time series forecasting, yet, they have high computational costs. One way to reduce this cost is by reducing data dimensionality, which involves removing unimportant or low importance information with the proper method. This work presents a study on an explainability feature selection framework composed of four methods (IMV-LSTM Tensor, LIME-LSTM, Average SHAP-LSTM, and Instance SHAP-LSTM) aimed at using the LSTM black-box model complexity to its favour, with the end goal of improving the error metrics and reducing the computational cost on a forecast task. To test the framework, three datasets with a total of 101 multivariate time series were used, with the explainability methods outperforming the baseline methods in most of the data, be it in error metrics or computation time for the LSTM model training.

CloseRead Abstract

2024

Online boxplot derived outlier detection

Authors
Mazarei, A; Sousa, R; Mendes-Moreira, J; Molchanov, S; Ferreira, HM;

Publication
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS

Abstract
Outlier detection is a widely used technique for identifying anomalous or exceptional events across various contexts. It has proven to be valuable in applications like fault detection, fraud detection, and real-time monitoring systems. Detecting outliers in real time is crucial in several industries, such as financial fraud detection and quality control in manufacturing processes. In the context of big data, the amount of data generated is enormous, and traditional batch mode methods are not practical since the entire dataset is not available. The limited computational resources further compound this issue. Boxplot is a widely used batch mode algorithm for outlier detection that involves several derivations. However, the lack of an incremental closed form for statistical calculations during boxplot construction poses considerable challenges for its application within the realm of big data. We propose an incremental/online version of the boxplot algorithm to address these challenges. Our proposed algorithm is based on an approximation approach that involves numerical integration of the histogram and calculation of the cumulative distribution function. This approach is independent of the dataset's distribution, making it effective for all types of distributions, whether skewed or not. To assess the efficacy of the proposed algorithm, we conducted tests using simulated datasets featuring varying degrees of skewness. Additionally, we applied the algorithm to a real-world dataset concerning software fault detection, which posed a considerable challenge. The experimental results underscored the robust performance of our proposed algorithm, highlighting its efficacy comparable to batch mode methods that access the entire dataset. Our online boxplot method, leveraging dataset distribution to define whiskers, consistently achieved exceptional outlier detection results. Notably, our algorithm demonstrated computational efficiency, maintaining constant memory usage with minimal hyperparameter tuning.

CloseRead Abstract