Publications

Publications by HumanISE

2024

Enhancing Medical Imaging Through Data Augmentation: A Review

Authors
Teixeira, B; Pinto, G; Filipe, V; Teixeira, A;

Publication
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2024 WORKSHOPS, PT II

Abstract
This article conducts a comprehensive review of the existing literature on data augmentation and data generation techniques within the context of medical image processing. Addressing the challenges associated with building sizable medical image datasets, including the rarity of certain medical conditions, patient privacy concerns, the need for expert labeling, and the associated expenses, this review focuses on methodologies aimed at enhancing the volume and diversity of available data. Special emphasis is placed on techniques such as data augmentation and data generation, with a particular interest in their application to medical image datasets. The objective is to provide a synthesis of current research, methodologies, and advancements in this domain, offering insights into the state-of-the-art practices and identifying potential avenues for future developments in medical image data augmentation.

CloseRead Abstract

2024

Application of a Genetic Algorithm for Optimising the Location of Electric Vehicle Charging Stations

Authors
Pinto, J; Mejia, MA; Macedo, LH; Filipe, V; Pinto, T;

Publication
Progress in Artificial Intelligence - 23rd EPIA Conference on Artificial Intelligence, EPIA 2024, Viana do Castelo, Portugal, September 3-6, 2024, Proceedings, Part II

Abstract
The number of electric vehicles has been increasing significantly due to various factors, such as the higher prices of fossil fuels, concerns about the increasing pollution, and the resulting incentive to use energy from renewable sources. There are currently a few charging facilities, which are still quite scattered, and several are still experimental, requiring appropriate planning of this infrastructure in order to support the growing number of electric vehicles adequately. Thus, optimising the location of charging stations becomes a critical issue, which can be achieved through the application of mathematical models and data analysis tools. An example is genetic algorithms, which have demonstrated their versatility in solving complex optimisation problems, especially those involving multiple variables. This work presents a proposal for a more comprehensive genetic algorithm model that encompasses all variables from the perspectives of all entities involved. Its experimentation was conducted using real data, with the aim of finding the best combination of locations, minimising the total number of stations and maximising the coverage of the area under study. Thus, it is essential to carefully consider user preferences, accessibility, energy demand, and existing electrical infrastructure to ensure an effective and sustainable installation. The findings highlight the crucial role of these computing tools in addressing complex problems from various viewpoints, leading to solutions that cater to the needs of all parties involved. While not necessarily perfect, these solutions represent a balanced compromise across multiple dimensions of the problem. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

CloseRead Abstract

2024

Automatic Food Labels Reading System

Authors
Pires, D; Filipe, V; Gonçalves, L; Sousa, A;

Publication
Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST

Abstract
Growing obesity has been a worldwide issue for several years. This is the outcome of common nutritional disorders which results in obese individuals who are prone to many diseases. Managing diet while simultaneously dealing with the obligations of a working adult can be difficult. Today, people have a very fast-paced life and sometimes neglect food choices. In order to simplify the interpretation of the Nutri-score labeling this paper proposes a method capable of automatically reading food labels with this format. This method is intended to support users when choosing the products to buy based on the letter identification of the label. For this purpose, a dataset was created, and a prototype mobile application was developed using a deep learning network to recognize the Nutri-score information. Although the final solution is still in progress, the reading module, which includes the proposed method, achieved an encouraging and promising accuracy (above 90%). The upcoming developments of the model include information to the user about the nutritional value of the analyzed product combining it’s Nutri-score label and composition. © ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2024.

CloseRead Abstract

2024

Detection of Landmarks in X-Ray Images Through Deep Learning

Authors
Fernandes, M; Filipe, V; Sousa, A; Gonçalves, L;

Publication
Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST

Abstract
This paper presents a study on the automated detection of landmarks in medical x-ray images using deep learning techniques. In this work we developed two neural networks based on semantic segmentation to automatically detect landmarks in x-ray images, using a dataset of 200 encephalogram images: the UNet architecture and the FPN architecture. The UNet and FPN architectures are compared and it can be concluded that the FPN model, with IoU=0.91, is more robust and accurate in predicting landmarks. The study also had the goal of direct application in a medical context of diagnosing the models and their predictions. Our research team also developed a metric analysis, based on the encephalograms in the dataset, on the type of Mandibular Occlusion of the patients, thus allowing a fast and accurate response in the identification and classification of a diagnosis. The paper highlights the potential of deep learning for automating the detection of anatomical landmarks in medical imaging, which can save time, improve diagnostic accuracy, and facilitate treatment planning. We hope to develop a universal model in the future, capable of evaluating any type of metric using image segmentation. © ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2024.

CloseRead Abstract

2024

Pylung: A Supporting Tool for Comparative Study of ViT and CNN-Based Models Used for Lung Nodules Classification

Authors
Marques, F; Pestana, P; Filipe, V;

Publication
Lecture Notes in Networks and Systems

Abstract
Lung cancer is a significant global health concern, and accurate classification of lung nodules plays a crucial role in its early detection and treatment. This paper evaluates and compares the performance of Vision Transformer (ViT) and Convolutional Neural Network (CNN) models for lung nodule classification using the Pylung tool proposed in this work. The study aims to address the lack of research on ViT in lung nodule classification and proposes ViT as an alternative to CNN. The Lung Image Database Consortium and Image Database Resource Initiative (LIDC-IDRI) dataset is utilized for training and evaluation. The Pylung tool is employed for dataset preprocessing and comparison of models. Three models, ViT, VGG16, and ResNet50, are analyzed, and their hyperparameters are optimized using Optuna. The results show that ViT achieves the highest accuracy (99.06%) in nodule classification compared to VGG16 (98.71%) and ResNet50 (98.46%). The study contributes by introducing ViT as a model for lung nodule classification, presenting the Pylung tool for model comparison, and suggesting further investigations to improve the accuracy. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

CloseRead Abstract

2024

Performance Analysis of CNN Models in the Detection and Classification of Diabetic Retinopathy

Authors
Lúcio, F; Filipe, V; Gonçalves, L;

Publication
Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST

Abstract
This study focuses on investigating different CNN architectures and assessing their effectiveness in classifying Diabetic Retinopathy, a diabetes-associated disease that ranks among the primary causes of adult blindness. However, early detection can significantly prevent its debilitating consequences. While regular screening is advised for diabetic patients, limited access to specialized medical professionals can hinder its implementation. To address this challenge, deep learning techniques provide promising solutions, primarily through their application in the analysis of fundus retina images for diagnosis. Several CNN architectures, including MobileNetV2, VGG16, VGG19, InceptionV3, InceptionResNetV2, Xception, DenseNet121, ResNet50, ResNet50V2, and EfficientNet (ranging from EfficientNetB0 to EfficientNetB6), were implemented to assess and analyze their performance in classifying Diabetic Retinopathy. The dataset comprised 3662 Fundus retina images. Prior to training, the networks underwent pre-training using the ImageNet database, with a Gaussian filter applied to the images as a preprocessing step. As a result, the Efficient-Net stands out for achieving the best performance results with a good balance between model size and computational efficiency. By utilizing the EfficientNetB2 network, a model was trained with an accuracy of 85% and a screening capability of 98% for Diabetic Retinopathy. This model holds the potential to be implemented during the screening stages of Diabetic Retinopathy, aiding in the early identification of individuals at risk. © ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2024.

CloseRead Abstract