Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por CTM

2018

Physiological Inspired Deep Neural Networks for Emotion Recognition

Autores
Ferreira, PM; Marques, F; Cardoso, JS; Rebelo, A;

Publicação
IEEE ACCESS

Abstract
Facial expression recognition (FER) is currently one of the most active research topics due to its wide range of applications in the human-computer interaction field. An important part of the recent success of automatic FER was achieved thanks to the emergence of deep learning approaches. However, training deep networks for FER is still a very challenging task, since most of the available FER data sets are relatively small. Although transfer learning can partially alleviate the issue, the performance of deep models is still below of its full potential as deep features may contain redundant information from the pre-trained domain. Instead, we propose a novel end-to-end neural network architecture along with a well-designed loss function based on the strong prior knowledge that facial expressions are the result of the motions of some facial muscles and components. The loss function is defined to regularize the entire learning process so that the proposed neural network is able to explicitly learn expression-specific features. Experimental results demonstrate the effectiveness of the proposed model in both lab-controlled and wild environments. In particular, the proposed neural network provides quite promising results, outperforming in most cases the current state-of-the-art methods.

2018

Towards Complementary Explanations Using Deep Neural Networks

Autores
Silva, W; Fernandes, K; Cardoso, MJ; Cardoso, JS;

Publicação
Understanding and Interpreting Machine Learning in Medical Image Computing Applications - First International Workshops MLCN 2018, DLF 2018, and iMIMIC 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 16-20, 2018, Proceedings

Abstract
Interpretability is a fundamental property for the acceptance of machine learning models in highly regulated areas. Recently, deep neural networks gained the attention of the scientific community due to their high accuracy in vast classification problems. However, they are still seen as black-box models where it is hard to understand the reasons for the labels that they generate. This paper proposes a deep model with monotonic constraints that generates complementary explanations for its decisions both in terms of style and depth. Furthermore, an objective framework for the evaluation of the explanations is presented. Our method is tested on two biomedical datasets and demonstrates an improvement in relation to traditional models in terms of quality of the explanations generated. © Springer Nature Switzerland AG 2018.

2018

Elastic deformations for data augmentation in breast cancer mass detection

Autores
Castro, E; Cardoso, JS; Pereira, JC;

Publicação
2018 IEEE EMBS International Conference on Biomedical & Health Informatics, BHI 2018, Las Vegas, NV, USA, March 4-7, 2018

Abstract
Two limitations hamper performance of deep architectures for classification and/or detection in medical imaging: (i) the small amount of available data, and (ii) the class imbalance scenario. While millions of labeled images are available today to build classification tools for natural scenes, the amount of available annotated data for automatic breast cancer screening is limited to a few thousand images, at best. We address these limitations with a method for data augmentation, based on the introduction of random elastic deformations on images of mammograms. We validate this method on three publicly available datasets. Our proposed Convolutional Neural Network (CNN) archi-tecture is trained for mass classification - in a conventional way - , and then used in the more interesting problem of mass detection in full mammograms by transforming the CNN into a Fully Convolutional Network (FCN). © 2018 IEEE.

2018

Multimodal Hierarchical Face Recognition using Information from 2.5D Images

Autores
Monteiro, JC; Freitas, T; Cardoso, JS;

Publicação
U.Porto Journal of Engineering

Abstract
Facial recognition under uncontrolled acquisition environments faces major challenges that limit the deployment of real-life systems. The use of 2.5D information can be used to improve discriminative power of such systems in conditions where RGB information alone would fail. In this paper we propose a multimodal extension of a previous work, based on SIFT descriptors of RGB images, integrated with LBP information obtained from depth scans, modeled by an hierarchical framework motivated by principles of human cognition. The framework was tested on EURECOM dataset and proved that the inclusion of depth information improved significantly the results in all the tested conditions, compared to independent unimodal approaches.

2018

The Challenges of Applying Deep Learning for Hemangioma Lesion Segmentation

Autores
Alves, PG; Cardoso, JS; do Bom Sucesso, M;

Publicação
PROCEEDINGS OF THE 2018 7TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP)

Abstract
Infantile Hemangiomas (IH) make up the most common type of benign vascular tumors affecting children. They can grow for several months until beginning to involute. In present-day clinical practice there's no objective monitoring protocol. For more objective measures, an automatic evaluation system (CAD system) is needed to aid clinicians in assessing the effectiveness of a given patient's response to a treatment. One of the stages of these systems is the lesion segmentation. This work addresses the automatic segmentation of lesions in IH. Acknowledging that the methods in the literature for IH lesion segmentation lag behind the state-of-the-art in the image segmentation community, we conduct a comparison of various methodologies for the segmentation of the IH, including both shallow and deep methodologies. Acknowledging the lack of data in the field for a robust learning of deep models, we also evaluate transfer learning techniques to benefit from knowledge extracted in other skin lesions. The best results were obtained with the shortest path method and a multiscale convolutional neural network that merges two pipelines working at different scales. Although promising, the results put in evidence the need for better databases, collected under suitable acquisition protocols.

2018

Driver drowsiness detection: a comparison between intrusive and non-intrusive signal acquisition methods

Autores
Oliveira, L; Cardoso, JS; Lourenco, A; Ahlstrom, C;

Publicação
PROCEEDINGS OF THE 2018 7TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP)

Abstract
Driver drowsiness is a major cause of road accidents, many of which result in fatalities. A solution to this problem is the inclusion of a drowsiness detector in vehicles to alert the driver if sleepiness is detected. To detect drowsiness, physiologic, behavioral (visual) and vehicle-based methods can be used, however, only measures that can be acquired non-intrusively are viable in a real life application. This work uses data from a real-road experiment with sleep deprived drivers to compare the performance of driver drowsiness detection using intrusive acquisition methods, namely electrooculogram (EOG), with camera based, non-intrusive, methods. A hybrid strategy, combining the described methods with electrocardiogram (ECG) measures, is also evaluated. Overall, the obtained results show that drowsiness detection performance is similar using non-intrusive camera based measures or intrusive EOG measures. The detection performance increases when combining two methods (ECG + visual) or (ECG + EOG).

  • 128
  • 324