Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by CTM

2019

Directional Support Vector Machines

Authors
Pernes, D; Fernande, K; Cardoso, JS;

Publication
APPLIED SCIENCES-BASEL

Abstract
Several phenomena are represented by directional-angular or periodic-data; from time references on the calendar to geographical coordinates. These values are usually represented as real values restricted to a given range (e.g., [0, 2 pi)), hiding the real nature of this information. In order to handle these variables properly in supervised classification tasks, alternatives to the naive Bayes classifier and logistic regression were proposed in the past. In this work, we propose directional-aware support vector machines. We address several realizations of the proposed models, studying their kernelized counterparts and their expressiveness. Finally, we validate the performance of the proposed Support Vector Machines (SVMs) against the directional naive Bayes and directional logistic regression with real data, obtaining competitive results.

2019

The Challenges of Applying Deep Learning for Hemangioma Lesion Segmentation

Authors
Alves, PG; Cardoso, JS; Do Bom Sucesso, M;

Publication
Proceedings - European Workshop on Visual Information Processing, EUVIP

Abstract
Infantile Hemangiomas (IH) make up the most common type of benign vascular tumors affecting children. They can grow for several months until beginning to involute. In present-day clinical practice there's no objective monitoring protocol. For more objective measures, an automatic evaluation system (CAD system) is needed to aid clinicians in assessing the effectiveness of a given patient's response to a treatment. One of the stages of these systems is the lesion segmentation. This work addresses the automatic segmentation of lesions in IH. Acknowledging that the methods in the literature for IH lesion segmentation lag behind the state-of-the-art in the image segmentation community, we conduct a comparison of various methodologies for the segmentation of the IH, including both shallow and deep methodologies. Acknowledging the lack of data in the field for a robust learning of deep models, we also evaluate transfer learning techniques to benefit from knowledge extracted in other skin lesions. The best results were obtained with the shortest path method and a multiscale convolutional neural network that merges two pipelines working at different scales. Although promising, the results put in evidence the need for better databases, collected under suitable acquisition protocols. © 2018 IEEE.

2019

Driver drowsiness detection: A comparison between intrusive and non-intrusive signal acquisition methods

Authors
Oliveira, L; Cardoso, JS; Lourenço, A; Ahlström, C;

Publication
Proceedings - European Workshop on Visual Information Processing, EUVIP

Abstract
Driver drowsiness is a major cause of road accidents, many of which result in fatalities. A solution to this problem is the inclusion of a drowsiness detector in vehicles to alert the driver if sleepiness is detected. To detect drowsiness, physiologic, behavioral (visual) and vehicle-based methods can be used, however, only measures that can be acquired non-intrusively are viable in a real life application. This work uses data from a real-road experiment with sleep deprived drivers to compare the performance of driver drowsiness detection using intrusive acquisition methods, namely electrooculogram (EOG), with camera-based, non-intrusive, methods. A hybrid strategy, combining the described methods with electrocardiogram (ECG) measures, is also evaluated. Overall, the obtained results show that drowsiness detection performance is similar using non-intrusive camera-based measures or intrusive EOG measures. The detection performance increases when combining two methods (ECG + visual) or (ECG + EOG). © 2018 IEEE.

2019

Sparse Multi-Bending Snakes

Authors
Araujo, RJ; Fernandes, K; Cardoso, JS;

Publication
IEEE TRANSACTIONS ON IMAGE PROCESSING

Abstract
Active contour models are one of the most emblematic algorithms of computer vision. Their strong theoretical foundations and high user interoperahility turned them into a reference approach for object segmentation and tracking tasks. A high number of modifications have already been proposed in order to overcome the known problems of traditional snakes, such as initialization dependence and poor convergence to concavities. In this paper, we address the scenario where the user wants to segment an object that has multiple dynamic regions but some of them do not correspond to the true object boundary. We propose a novel parametric active contour model, the Sparse Multi-Bending snake, which is capable of dividing the contour into a set of contiguous regions with different bending properties. We derive a new energy function that induces such behavior and presents a group optimization strategy that can be used to find the optimal bending resistance parameter for each point of the contour. We show the flexibility of our model in a set of synthetic images. In addition, we consider two real applications, lung segmentation in Computerized Tomography data and hand segmentation in depth images. We show how the proposed method is able to improve the segmentations obtained in both applications, when compared with other active contour models.

2019

On the role of multimodal learning in the recognition of sign language

Authors
Ferreira, PM; Cardoso, JS; Rebelo, A;

Publication
MULTIMEDIA TOOLS AND APPLICATIONS

Abstract
Sign Language Recognition (SLR) has become one of the most important research areas in the field of human computer interaction. SLR systems are meant to automatically translate sign language into text or speech, in order to reduce the communicational gap between deaf and hearing people. The aim of this paper is to exploit multimodal learning techniques for an accurate SLR, making use of data provided by Kinect and Leap Motion. In this regard, single-modality approaches as well as different multimodal methods, mainly based on convolutional neural networks, are proposed. Our main contribution is a novel multimodal end-to-end neural network that explicitly models private feature representations that are specific to each modality and shared feature representations that are similar between modalities. By imposing such regularization in the learning process, the underlying idea is to increase the discriminative ability of the learned features and, hence, improve the generalization capability of the model. Experimental results demonstrate that multimodal learning yields an overall improvement in the sign recognition performance. In particular, the novel neural network architecture outperforms the current state-of-the-art methods for the SLR task.

2019

Machine Learning Interpretability: A Survey on Methods and Metrics

Authors
Carvalho, DV; Pereira, EM; Cardoso, JS;

Publication
ELECTRONICS

Abstract
Machine learning systems are becoming increasingly ubiquitous. These systems's adoption has been expanding, accelerating the shift towards a more algorithmic society, meaning that algorithmically informed decisions have greater potential for significant social impact. However, most of these accurate decision support systems remain complex black boxes, meaning their internal logic and inner workings are hidden to the user and even experts cannot fully understand the rationale behind their predictions. Moreover, new regulations and highly regulated domains have made the audit and verifiability of decisions mandatory, increasing the demand for the ability to question, understand, and trust machine learning systems, for which interpretability is indispensable. The research community has recognized this interpretability problem and focused on developing both interpretable models and explanation methods over the past few years. However, the emergence of these methods shows there is no consensus on how to assess the explanation quality. Which are the most suitable metrics to assess the quality of an explanation? The aim of this article is to provide a review of the current state of the research field on machine learning interpretability while focusing on the societal impact and on the developed methods and metrics. Furthermore, a complete literature review is presented in order to identify future directions of work on this field.

  • 105
  • 324