Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by Luís Paulo Reis

2023

Using Deep Reinforcement Learning for Navigation in Simulated Hallways

Authors
Leao, G; Almeida, F; Trigo, E; Ferreira, H; Sousa, A; Reis, LP;

Publication
2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC

Abstract
Reinforcement Learning (RL) is a well-suited paradigm to train robots since it does not require any previous information or database to train an agent. This paper explores using Deep Reinforcement Learning (DRL) to train a robot to navigate in maps containing different sorts of obstacles and which emulate hallways. Training and testing were performed using the Flatland 2D simulator and a Deep Q-Network (DQN) provided by OpenAI gym. Different sets of maps were used for training and testing. The experiments illustrate how well the robot is able to navigate in maps distinct from the ones used for training by learning new behaviours (namely following walls) and highlight the key challenges when solving this task using DRL, including the appropriate definition of the state space and reward function, as well as of the stopping criteria during training.

2022

Forecasting Omicron Variant of Covid-19 with ANN Model in European Countries – Number of Cases, Deaths, and ICU Patients

Authors
Carvalho, K; Reis, LP; Teixeira, JP;

Publication
Communications in Computer and Information Science

Abstract
Accurate predictions of time series are increasingly required to support judgments in a variety of decisions. Several predictive models are available to support these predictions, depending on how each field offers a data variety with varied behavior. The use of artificial neural networks (ANN) at the beginning of the COVID-19 pandemic was significant since the tool may offer forecasting data for various conditions and hence assist in governing critical choices. In this context, this paper describes a system for predicting the daily number of cases, fatalities, and Intensive Care Unit (ICU) patients for the next 28 days in five European countries: Portugal, the United Kingdom, France, Italy, and Germany. The database selection is based on comparable mitigation processes to analyze the impact of safety procedure flexibilization with the most recent numbers of COVID-19. Additionally, it is intended to check the algorithm's adaptability to different variants throughout time. The network's input data has been normalized to account for the size of the countries in the study and smoothed by seven days. The mean absolute error (MAE) was employed as a comparing criterion of two datasets, one with data from the beginning of the pandemic and another with data from the last year, since all variables (cases, deaths, and ICU patients) may be tendentious in percentage analysis. The best architecture produced a general MAE prediction for the 28 days ahead of 256,53 daily cases, 0,59 daily deaths, and 1,63 ICU patients, all numbers normalized by million people. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

2023

Machine Learning Data Markets: Evaluating the Impact of Data Exchange on the Agent Learning Performance

Authors
Baghcheband, H; Soares, C; Reis, LP;

Publication
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT I

Abstract
In recent years, the increasing availability of distributed data has led to a growing interest in transfer learning across multiple nodes. However, local data may not be adequate to learn sufficiently accurate models, and the problem of learning from multiple distributed sources remains a challenge. To address this issue, Machine Learning Data Markets (MLDM) have been proposed as a potential solution. In MLDM, autonomous agents exchange relevant data in a cooperative relationship to improve their models. Previous research has shown that data exchange can lead to better models, but this has only been demonstrated with only two agents. In this paper, we present an extended evaluation of a simple version of the MLDM framework in a collaborative scenario. Our experiments show that data exchange has the potential to improve learning performance, even in a simple version of MLDM. The findings conclude that there exists a direct correlation between the number of agents and the gained performance, while an inverse correlation was observed between the performance and the data batch sizes. The results of this study provide important insights into the effectiveness of MLDM and how it can be used to improve learning performance in distributed systems. By increasing the number of agents, a more efficient system can be achieved, while larger data batch sizes can decrease the global performance of the system. These observations highlight the importance of considering both the number of agents and the data batch sizes when designing distributed learning systems using the MLDM framework.

2023

Deep Reinforcement Learning for Creating Advanced Humanoid Robotic Soccer Skills

Authors
Reis, LP;

Publication
Proceedings of the 20th International Conference on Informatics in Control, Automation and Robotics, ICINCO 2023, Rome, Italy, November 13-15, 2023, Volume 1

Abstract

2023

Knowledge Discovery for Risk Assessment in Economic and Food Safety

Authors
Silva, MC; Faria, BM; Reis, LP;

Publication
Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2023, Volume 1: KDIR, Rome, Italy, November 13-15, 2023.

Abstract

2023

Deep Reinforcement Learning for Creating Advanced Humanoid Robotic Soccer Skills

Authors
Reis, LP;

Publication
Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2023, Volume 1: KDIR, Rome, Italy, November 13-15, 2023.

Abstract

  • 86
  • 88