Publicacoes - INESC TEC

Publicações

Publicações por Pedro Miguel Carvalho

2018

BMOG: boosted Gaussian Mixture Model with controlled complexity for background subtraction

Autores
Martins, I; Carvalho, P; Corte Real, L; Alba Castro, JL;

Publicação
PATTERN ANALYSIS AND APPLICATIONS

Abstract
Developing robust and universal methods for unsupervised segmentation of moving objects in video sequences has proved to be a hard and challenging task that has attracted the attention of many researchers over the last decades. State-of-the-art methods are, in general, computationally heavy preventing their use in real-time applications. This research addresses this problem by proposing a robust and computationally efficient method, coined BMOG, that significantly boosts the performance of a widely used method based on a Mixture of Gaussians. The proposed solution explores a novel classification mechanism that combines color space discrimination capabilities with hysteresis and a dynamic learning rate for background model update. The complexity of BMOG is kept low, proving its suitability for real-time applications. BMOG was objectively evaluated using the ChangeDetection.net 2014 benchmark. An exhaustive set of experiments was conducted, and a detailed analysis of the results, using two complementary types of metrics, revealed that BMOG achieves an excellent compromise in performance versus complexity.

FecharLer Abstract

2018

GymApp: a Real Time Physical Activity Trainner on Wearable Devices

Autores
Viana, P; Ferreira, T; Castro, L; Soares, M; Pinto, JP; Andrade, T; Carvalho, P;

Publicação
2018 11TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION (HSI)

Abstract
Technological advances are pushing into the mass market innovative wearable devices featuring increasing processing and sensing capacity, non-intrusiveness and ubiquitous use. Sensors built-in those devices, enable acquiring different types of data and by taking advantage of the available processing power, it is possible to run intelligent applications that process the sensed data to offer added -value to the user in multiple domains. Although not new to the modern society, it is unquestionable that the present exercise boom is rapidly spreading across all age groups. However, in a great majority of cases, people perform their physical activity on their own, either due to time or budget constraints and may easily get discouraged if they do not see results or perform exercises inadequately. This paper presents an application, running on a wearable device, aiming at operating as a personal trainer that validates a set of proposed exercises in a sports' session. The developed solution uses inertial sensors of an Android Wear smartwatch and, based on a set of pattern recognition algorithms, detects the rate of success in the execution of a planned workout. The fact that all processing can be executed on the device is a differentiator factor to other existing solutions.

FecharLer Abstract

2020

Efficient CIEDE2000-Based Color Similarity Decision for Computer Vision

Autores
Pereira, A; Carvalho, P; Coelho, G; Corte Real, L;

Publicação
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

Abstract
Color and color differences are critical aspects in many image processing and computer vision applications. A paradigmatic example is object segmentation, where color distances can greatly influence the performance of the algorithms. Metrics for color difference have been proposed in the literature, including the definition of standards such as CIEDE2000, which quantifies the change in visual perception of two given colors. This standard has been recommended for industrial computer vision applications, but the benefits of its application have been impaired by the complexity of the formula. This paper proposes a new strategy that improves the usability of the CIEDE2000 metric when a maximum acceptable distance can be imposed. We argue that, for applications where a maximum value, above which colors are considered to be different, can be established, then it is possible to reduce the amount of calculations of the metric, by preemptively analyzing the color features. This methodology encompasses the benefits of the metric while overcoming its computational limitations, thus broadening the range of applications of CIEDE2000 in both the computer vision algorithms and computational resource requirements.

FecharLer Abstract Ler Publicação Completa

2019

Face Detection in Thermal Images with YOLOv3

Autores
Silva, G; Monteiro, R; Ferreira, A; Carvalho, P; Corte Real, L;

Publicação
ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT II

Abstract
The automotive industry is currently focusing on automation in their vehicles, and perceiving the surroundings of an automobile requires the ability to detect and identify objects, events and persons, not only from the outside of the vehicle but also from the inside of the cabin. This constitutes relevant information for defining intelligent responses to events happening on both environments. This work presents a new method for in-vehicle monitoring of passengers, specifically the task of real-time face detection in thermal images, by applying transfer learning with YOLOv3. Using this kind of imagery for this purpose brings some advantages, such as the possibility of detecting faces during the day and in the dark without being affected by illumination conditions, and also because it's a completely passive sensing solution. Due to the lack of suitable datasets for this type of application, a database of in-vehicle images was created, containing images from 38 subjects performing different head poses and at varying ambient temperatures. The tests in our database show an AP50 of 99.7% and an AP of 78.5%.

FecharLer Abstract

2019

Stereo vision system for human motion analysis in a rehabilitation context

Autores
Matos, AC; Terroso, TA; Corte Real, L; Carvalho, P;

Publicação
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION

Abstract
The present demographic trends point to an increase in aged population and chronic diseases which symptoms can be alleviated through rehabilitation. The applicability of passive 3D reconstruction for motion tracking in a rehabilitation context was explored using a stereo camera. The camera was used to acquire depth and color information from which the 3D position of predefined joints was recovered based on: kinematic relationships, anthropometrically feasible lengths and temporal consistency. Finally, a set of quantitative measures were extracted to evaluate the performed rehabilitation exercises. Validation study using data provided by a marker based as ground-truth revealed that our proposal achieved errors within the range of state-of-the-art active markerless systems and visual evaluations done by physical therapists. The obtained results are promising and demonstrate that the developed methodology allows the analysis of human motion for a rehabilitation purpose.

FecharLer Abstract

2018

Improving Audiovisual Content Annotation Through a Semi-automated Process Based on Deep Learning

Autores
Vilaça, L; Viana, P; Carvalho, P; Andrade, MT;

Publicação
Proceedings of the Tenth International Conference on Soft Computing and Pattern Recognition, SoCPaR 2018, Porto, Portugal, December 13-15, 2018

Abstract
Over the last years, Deep Learning has become one of the most popular research fields of Artificial Intelligence. Several approaches have been developed to address conventional challenges of AI. In computer vision, these methods provide the means to solve tasks like image classification, object identification and extraction of features. In this paper, some approaches to face detection and recognition are presented and analyzed, in order to identify the one with the best performance. The main objective is to automate the annotation of a large dataset and to avoid the costy and time-consuming process of content annotation. The approach follows the concept of incremental learning and a R-CNN model was implemented. Tests were conducted with the objective of detecting and recognizing one personality within image and video content. Results coming from this initial automatic process are then made available to an auxiliary tool that enables further validation of the annotations prior to uploading them to the archive. Tests show that, even with a small size dataset, the results obtained are satisfactory. © 2020, Springer Nature Switzerland AG.

FecharLer Abstract