Ricardo Pereira Cruz

O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais

Instituição
Investigação
Domínios de Investigação
Inteligência Artificial

Bioengenharia

Comunicações

Ciência e Engenharia dos Computadores
Fotónica

Sistemas de Energia

Robótica

Engenharia e Gestão de Sistemas
CENTROS DE INVESTIGAÇÃO
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Inovação
Inovação / Tec4

TEC4AGRO-FOOD

TEC4ENERGY

TEC4HEALTH

TEC4INDUSTRY

TEC4SEA

TECPARTNERSHIPS

Tecnologias Disponíveis
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Laboratórios
Laboratórios de Investigação

iilab
Comunicação
Notícias

Eventos

Media

Boletim Informativo
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Junte-se a nós
Contactos

Home
Pessoas
Ricardo Pereira Cruz

Ler apresentação completa

Ricardo P. M. Cruz é Professor Auxiliar na Faculdade de Engenharia da Universidade do Porto e investigador no INESC TEC. O seu trabalho centra-se em machine learning, particularmente em deep learning e visão computacional. Licenciado em Ciência da Computação (2012), Mestrado em Engenharia Matemática (2015), ambos pela Universidade do Porto, e Doutorado em Informática (2021) pela Universidade do Porto, Aveiro e Minho. Os seus tópicos abrangem aspetos transversais de machine learning com aplicações em saúde e condução autónoma, detalhados em mais de 20 publicações com mais de 100 citações.

Ler apresentação completa

Sobre

Ricardo P. M. Cruz é Professor Auxiliar na Faculdade de Engenharia da Universidade do Porto e investigador no INESC TEC. O seu trabalho centra-se em machine learning, particularmente em deep learning e visão computacional. Licenciado em Ciência da Computação (2012), Mestrado em Engenharia Matemática (2015), ambos pela Universidade do Porto, e Doutorado em Informática (2021) pela Universidade do Porto, Aveiro e Minho. Os seus tópicos abrangem aspetos transversais de machine learning com aplicações em saúde e condução autónoma, detalhados em mais de 20 publicações com mais de 100 citações.

Tópicos
de interesse

Detalhes

Nome
Ricardo Pereira Cruz
Cargo
Investigador Colaborador Externo
Desde
01 outubro 2013

Nacionalidade
Portugal
Centro
Centro de Telecomunicações e Multimédia
Contactos
+351222094299
ricardo.p.cruz@inesctec.pt

001

Publicações

Ler todas as publicações

2025

CNN explanation methods for ordinal regression tasks

Autores
Barbero-Gómez, J; Cruz, RPM; Cardoso, JS; Gutiérrez, PA; Hervás-Martínez, C;

Publicação
NEUROCOMPUTING

Abstract
The use of Convolutional Neural Network (CNN) models for image classification tasks has gained significant popularity. However, the lack of interpretability in CNN models poses challenges for debugging and validation. To address this issue, various explanation methods have been developed to provide insights into CNN models. This paper focuses on the validity of these explanation methods for ordinal regression tasks, where the classes have a predefined order relationship. Different modifications are proposed for two explanation methods to exploit the ordinal relationships between classes: Grad-CAM based on Ordinal Binary Decomposition (GradOBDCAM) and Ordinal Information Bottleneck Analysis (OIBA). The performance of these modified methods is compared to existing popular alternatives. Experimental results demonstrate that GradOBD-CAM outperforms other methods in terms of interpretability for three out of four datasets, while OIBA achieves superior performance compared to IBA.

FecharLer Abstract

2025

Learning Ordinality in Semantic Segmentation

Autores
Cruz, RPM; Cristino, R; Cardoso, JS;

Publicação
IEEE ACCESS

Abstract
Semantic segmentation consists of predicting a semantic label for each image pixel. While existing deep learning approaches achieve high accuracy, they often overlook the ordinal relationships between classes, which can provide critical domain knowledge (e.g., the pupil lies within the iris, and lane markings are part of the road). This paper introduces novel methods for spatial ordinal segmentation that explicitly incorporate these inter-class dependencies. By treating each pixel as part of a structured image space rather than as an independent observation, we propose two regularization terms and a new metric to enforce ordinal consistency between neighboring pixels. Two loss regularization terms and one metric are proposed for structural ordinal segmentation, which penalizes predictions of non-ordinal adjacent classes. Five biomedical datasets and multiple configurations of autonomous driving datasets demonstrate the efficacy of the proposed methods. Our approach achieves improvements in ordinal metrics and enhances generalization, with up to a 15.7% relative increase in the Dice coefficient. Importantly, these benefits come without additional inference time costs. This work highlights the significance of spatial ordinal relationships in semantic segmentation and provides a foundation for further exploration in structured image representations.

FecharLer Abstract

2024

Active Supervision: Human in the Loop

Autores
Cruz, RPM; Shihavuddin, ASM; Maruf, MH; Cardoso, JS;

Publicação
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I

Abstract
After the learning process, certain types of images may not be modeled correctly because they were not well represented in the training set. These failures can then be compensated for by collecting more images from the real-world and incorporating them into the learning process - an expensive process known as active learning. The proposed twist, called active supervision, uses the model itself to change the existing images in the direction where the boundary is less defined and requests feedback from the user on how the new image should be labeled. Experiments in the context of class imbalance show the technique is able to increase model performance in rare classes. Active human supervision helps provide crucial information to the model during training that the training set lacks.

FecharLer Abstract

2024

YOLOMM - You Only Look Once for Multi-modal Multi-tasking

Autores
Campos, F; Cerqueira, FG; Cruz, RPM; Cardoso, JS;

Publicação
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I

Abstract
Autonomous driving can reduce the number of road accidents due to human error and result in safer roads. One important part of the system is the perception unit, which provides information about the environment surrounding the car. Currently, most manufacturers are using not only RGB cameras, which are passive sensors that capture light already in the environment but also Lidar. This sensor actively emits laser pulses to a surface or object and measures reflection and time-of-flight. Previous work, YOLOP, already proposed a model for object detection and semantic segmentation, but only using RGB. This work extends it for Lidar and evaluates performance on KITTI, a public autonomous driving dataset. The implementation shows improved precision across all objects of different sizes. The implementation is entirely made available: https://github.com/filipepcampos/yolomm.

FecharLer Abstract

2024

Condition Invariance for Autonomous Driving by Adversarial Learning

Autores
Silva, DTE; Cruz, RPM;

Publicação
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I

Abstract
Object detection is a crucial task in autonomous driving, where domain shift between the training and the test set is one of the main reasons behind the poor performance of a detector when deployed. Some erroneous priors may be learned from the training set, therefore a model must be invariant to conditions that might promote such priors. To tackle this problem, we propose an adversarial learning framework consisting of an encoder, an object-detector, and a condition-classifier. The encoder is trained to deceive the condition-classifier and aid the object-detector as much as possible throughout the learning stage, in order to obtain highly discriminative features. Experiments showed that this framework is not very competitive regarding the trade-off between precision and recall, but it does improve the ability of the model to detect smaller objects and some object classes.

FecharLer Abstract

Teses
supervisionadas

Ricardo Pereira Cruz

Sobre

Detalhes

Nome

Cargo

Desde

Nacionalidade

Centro

Contactos

CLARE

CNN explanation methods for ordinal regression tasks

Learning Ordinality in Semantic Segmentation

Active Supervision: Human in the Loop

YOLOMM - You Only Look Once for Multi-modal Multi-tasking

Condition Invariance for Autonomous Driving by Adversarial Learning

Uncertainty-Driven Out-of-Distribution Detection in 3D LiDAR Object Detection for Autonomous Driving

Introducing Domain Knowledge to Scene Parsing in Autonomous Driving