Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por Jaime Cardoso

2019

Importance of subject-dependent classification and imbalanced distributions in driver sleepiness detection in realistic conditions

Autores
Silveira, CS; Cardoso, JS; Lourenco, AL; Ahlstrom, C;

Publicação
IET INTELLIGENT TRANSPORT SYSTEMS

Abstract
The first in-depth study on the use of electrocardiogram and electrooculogram for subject-dependent classification in driver sleepiness/fatigue under realistic driving conditions is presented in this work. Since acquisitions in simulated environments may be misleading for sleepiness assessment, performing studies on road are required. For that purpose, the authors present a database resulting from a field driving study performed in the SleepEye project. Based on previous research, supervised machine learning methods are implemented and applied to 16 heart- and 25 eye-based extracted features, mostly related to heart rate variability and blink events, respectively, in order to study the influence of subject dependency in sleepiness classification, using different classifiers and dealing with imbalanced class distributions. Results showed a significantly worse performance in subject-independent classification: a decrease of similar to 40 and 20% in the detection rate of the 'sleepy' class for two and three classes, respectively. Since physiological signals are the ones that present the most individual characteristics, a subject-independent classification can be even harder to perform. Transfer learning techniques and methods for imbalanced distributions are promising approaches and need further investigation.

2019

Directional Support Vector Machines

Autores
Pernes, D; Fernande, K; Cardoso, JS;

Publicação
APPLIED SCIENCES-BASEL

Abstract
Several phenomena are represented by directional-angular or periodic-data; from time references on the calendar to geographical coordinates. These values are usually represented as real values restricted to a given range (e.g., [0, 2 pi)), hiding the real nature of this information. In order to handle these variables properly in supervised classification tasks, alternatives to the naive Bayes classifier and logistic regression were proposed in the past. In this work, we propose directional-aware support vector machines. We address several realizations of the proposed models, studying their kernelized counterparts and their expressiveness. Finally, we validate the performance of the proposed Support Vector Machines (SVMs) against the directional naive Bayes and directional logistic regression with real data, obtaining competitive results.

2019

The Challenges of Applying Deep Learning for Hemangioma Lesion Segmentation

Autores
Alves, PG; Cardoso, JS; Do Bom Sucesso, M;

Publicação
Proceedings - European Workshop on Visual Information Processing, EUVIP

Abstract
Infantile Hemangiomas (IH) make up the most common type of benign vascular tumors affecting children. They can grow for several months until beginning to involute. In present-day clinical practice there's no objective monitoring protocol. For more objective measures, an automatic evaluation system (CAD system) is needed to aid clinicians in assessing the effectiveness of a given patient's response to a treatment. One of the stages of these systems is the lesion segmentation. This work addresses the automatic segmentation of lesions in IH. Acknowledging that the methods in the literature for IH lesion segmentation lag behind the state-of-the-art in the image segmentation community, we conduct a comparison of various methodologies for the segmentation of the IH, including both shallow and deep methodologies. Acknowledging the lack of data in the field for a robust learning of deep models, we also evaluate transfer learning techniques to benefit from knowledge extracted in other skin lesions. The best results were obtained with the shortest path method and a multiscale convolutional neural network that merges two pipelines working at different scales. Although promising, the results put in evidence the need for better databases, collected under suitable acquisition protocols. © 2018 IEEE.

2019

Driver drowsiness detection: A comparison between intrusive and non-intrusive signal acquisition methods

Autores
Oliveira, L; Cardoso, JS; Lourenço, A; Ahlström, C;

Publicação
Proceedings - European Workshop on Visual Information Processing, EUVIP

Abstract
Driver drowsiness is a major cause of road accidents, many of which result in fatalities. A solution to this problem is the inclusion of a drowsiness detector in vehicles to alert the driver if sleepiness is detected. To detect drowsiness, physiologic, behavioral (visual) and vehicle-based methods can be used, however, only measures that can be acquired non-intrusively are viable in a real life application. This work uses data from a real-road experiment with sleep deprived drivers to compare the performance of driver drowsiness detection using intrusive acquisition methods, namely electrooculogram (EOG), with camera-based, non-intrusive, methods. A hybrid strategy, combining the described methods with electrocardiogram (ECG) measures, is also evaluated. Overall, the obtained results show that drowsiness detection performance is similar using non-intrusive camera-based measures or intrusive EOG measures. The detection performance increases when combining two methods (ECG + visual) or (ECG + EOG). © 2018 IEEE.

2019

Sparse Multi-Bending Snakes

Autores
Araujo, RJ; Fernandes, K; Cardoso, JS;

Publicação
IEEE TRANSACTIONS ON IMAGE PROCESSING

Abstract
Active contour models are one of the most emblematic algorithms of computer vision. Their strong theoretical foundations and high user interoperahility turned them into a reference approach for object segmentation and tracking tasks. A high number of modifications have already been proposed in order to overcome the known problems of traditional snakes, such as initialization dependence and poor convergence to concavities. In this paper, we address the scenario where the user wants to segment an object that has multiple dynamic regions but some of them do not correspond to the true object boundary. We propose a novel parametric active contour model, the Sparse Multi-Bending snake, which is capable of dividing the contour into a set of contiguous regions with different bending properties. We derive a new energy function that induces such behavior and presents a group optimization strategy that can be used to find the optimal bending resistance parameter for each point of the contour. We show the flexibility of our model in a set of synthetic images. In addition, we consider two real applications, lung segmentation in Computerized Tomography data and hand segmentation in depth images. We show how the proposed method is able to improve the segmentations obtained in both applications, when compared with other active contour models.

2019

On the role of multimodal learning in the recognition of sign language

Autores
Ferreira, PM; Cardoso, JS; Rebelo, A;

Publicação
MULTIMEDIA TOOLS AND APPLICATIONS

Abstract
Sign Language Recognition (SLR) has become one of the most important research areas in the field of human computer interaction. SLR systems are meant to automatically translate sign language into text or speech, in order to reduce the communicational gap between deaf and hearing people. The aim of this paper is to exploit multimodal learning techniques for an accurate SLR, making use of data provided by Kinect and Leap Motion. In this regard, single-modality approaches as well as different multimodal methods, mainly based on convolutional neural networks, are proposed. Our main contribution is a novel multimodal end-to-end neural network that explicitly models private feature representations that are specific to each modality and shared feature representations that are similar between modalities. By imposing such regularization in the learning process, the underlying idea is to increase the discriminative ability of the learned features and, hence, improve the generalization capability of the model. Experimental results demonstrate that multimodal learning yields an overall improvement in the sign recognition performance. In particular, the novel neural network architecture outperforms the current state-of-the-art methods for the SLR task.

  • 18
  • 59