Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Sobre

Sobre

Sou Professora Coordenadora no Politécnico do Porto e Investigadora no INESC TEC, no Centro de Telecomunicações e Multimédia, onde lidero a área de Tecnologias de Comunicação Multimédia. Tenho um Doutoramento em Engenharia Electrotécnica e de Computadores pela Universidade do Porto, com um foco na àrea da Gestão de Conteúdos Audiovisuais. Enquanto investigadora do INESC TEC, tenho sido responsável por diversos projectos Europeus e Nacionais, envolvendo parceiros da área da indústria, media e academia. Autora de diversas publicações, sou também revisora activa de artigos submetidos a conferências e revistas, membro de comissões científicas e de organização de conferências. Recentemente, organizei a série de Workshops com o tema "Immersive Media Experiences" (2013-2015) na maior conferência na área de multimédia (ACM Multimedia). Participo frequentemente como perita da Comissão Europeia ou de organismos nacionais na avaliação de propostas de investigação. Os meus interesses de investigação centram-na na área dos sistema de comunicação multimedia, incluindo televisão e novos serviços, gestão de conteúdos, personalização e recomendação, novos formatos e conteúdos imersivos e interactivos.

Tópicos
de interesse
Detalhes

Detalhes

  • Nome

    Paula Viana
  • Cargo

    Responsável de Área
  • Desde

    01 janeiro 1993
020
Publicações

2024

A Machine Learning App for Monitoring Physical Therapy at Home

Autores
Pereira, B; Cunha, B; Viana, P; Lopes, M; Melo, ASC; Sousa, ASP;

Publicação
SENSORS

Abstract
Shoulder rehabilitation is a process that requires physical therapy sessions to recover the mobility of the affected limbs. However, these sessions are often limited by the availability and cost of specialized technicians, as well as the patient's travel to the session locations. This paper presents a novel smartphone-based approach using a pose estimation algorithm to evaluate the quality of the movements and provide feedback, allowing patients to perform autonomous recovery sessions. This paper reviews the state of the art in wearable devices and camera-based systems for human body detection and rehabilitation support and describes the system developed, which uses MediaPipe to extract the coordinates of 33 key points on the patient's body and compares them with reference videos made by professional physiotherapists using cosine similarity and dynamic time warping. This paper also presents a clinical study that uses QTM, an optoelectronic system for motion capture, to validate the methods used by the smartphone application. The results show that there are statistically significant differences between the three methods for different exercises, highlighting the importance of selecting an appropriate method for specific exercises. This paper discusses the implications and limitations of the findings and suggests directions for future research.

2024

Improving Efficiency in Facial Recognition Tasks Through a Dataset Optimization Approach

Autores
Vilça, L; Viana, P; Carvalho, P; Andrade, MT;

Publicação
IEEE ACCESS

Abstract
It is well known that the performance of Machine Learning techniques, notably when applied to Computer Vision (CV), depends heavily on the amount and quality of the training data set. However, large data sets lead to time-consuming training loops and, in many situations, are difficult or even impossible to create. Therefore, there is a need for solutions to reduce their size while ensuring good levels of performance, i.e., solutions that obtain the best tradeoff between the amount/quality of training data and the model's performance. This paper proposes a dataset reduction approach for training data used in Deep Learning methods in Facial Recognition (FR) problems. We focus on maximizing the variability of representations for each subject (person) in the training data, thus favoring quality instead of size. The main research questions are: 1) Which facial features better discriminate different identities? 2) Will it be possible to significantly reduce the training time without compromising performance? 3) Should we favor quality over quantity for very large datasets in FR? This analysis uses a pipeline to discriminate a set of features suitable for capturing the diversity and a cluster-based sampling to select the best images for each training subject, i.e., person. Results were obtained using VGGFace2 and Labeled Faces in the Wild (for benchmarking) and show that, with the proposed approach, a data reduction is possible while ensuring similar levels of accuracy.

2024

Movie trailer genre classification using multimodal pretrained features

Autores
Sulun, S; Viana, P; Davies, MEP;

Publicação
EXPERT SYSTEMS WITH APPLICATIONS

Abstract
We introduce a novel method for movie genre classification, capitalizing on a diverse set of readily accessible pretrained models. These models extract high-level features related to visual scenery, objects, characters, text, speech, music, and audio effects. To intelligently fuse these pretrained features, we train small classifier models with low time and memory requirements. Employing the transformer model, our approach utilizes all video and audio frames of movie trailers without performing any temporal pooling, efficiently exploiting the correspondence between all elements, as opposed to the fixed and low number of frames typically used by traditional methods. Our approach fuses features originating from different tasks and modalities, with different dimensionalities, different temporal lengths, and complex dependencies as opposed to current approaches. Our method outperforms state-of-the-art movie genre classification models in terms of precision, recall, and mean average precision (mAP). To foster future research, we make the pretrained features for the entire MovieNet dataset, along with our genre classification code and the trained models, publicly available.

2023

A Review of Recent Advances and Challenges in Grocery Label Detection and Recognition

Autores
Guimaraes, V; Nascimento, J; Viana, P; Carvalho, P;

Publicação
APPLIED SCIENCES-BASEL

Abstract
When compared with traditional local shops where the customer has a personalised service, in large retail departments, the client has to make his purchase decisions independently, mostly supported by the information available in the package. Additionally, people are becoming more aware of the importance of the food ingredients and demanding about the type of products they buy and the information provided in the package, despite it often being hard to interpret. Big shops such as supermarkets have also introduced important challenges for the retailer due to the large number of different products in the store, heterogeneous affluence and the daily needs of item repositioning. In this scenario, the automatic detection and recognition of products on the shelves or off the shelves has gained increased interest as the application of these technologies may improve the shopping experience through self-assisted shopping apps and autonomous shopping, or even benefit stock management with real-time inventory, automatic shelf monitoring and product tracking. These solutions can also have an important impact on customers with visual impairments. Despite recent developments in computer vision, automatic grocery product recognition is still very challenging, with most works focusing on the detection or recognition of a small number of products, often under controlled conditions. This paper discusses the challenges related to this problem and presents a review of proposed methods for retail product label processing, with a special focus on assisted analysis for customer support, including for the visually impaired. Moreover, it details the public datasets used in this topic and identifies their limitations, and discusses future research directions of related fields.

2023

A Dataset for User Visual Behaviour with Multi-View Video Content

Autores
da Costa, TS; Andrade, MT; Viana, P; Silva, NC;

Publicação
PROCEEDINGS OF THE 2023 PROCEEDINGS OF THE 14TH ACM MULTIMEDIA SYSTEMS CONFERENCE, MMSYS 2023

Abstract
Immersive video applications impose unpractical bandwidth requirements for best-effort networks. With Multi-View(MV) streaming, these can be minimized by resorting to view prediction techniques. SmoothMV is a multi-view system that uses a non-intrusive head tracking mechanism to detect the viewer's interest and select appropriate views. By coupling Neural Networks (NNs) to anticipate the viewer's interest, a reduction of view-switching latency is likely to be obtained. The objective of this paper is twofold: 1) Present a solution for acquisition of gaze data from users when viewing MV content; 2) Describe a dataset, collected with a large-scale testbed, capable of being used to train NNs to predict the user's viewing interest. Tracking data from head movements was obtained from 45 participants using an Intel Realsense F200 camera, with 7 video playlists, each being viewed a minimum of 17 times. This dataset is publicly available to the research community and constitutes an important contribution to reducing the current scarcity of such data. Tools to obtain saliency/heat maps and generate complementary plots are also provided as an open-source software package.

Teses
supervisionadas

2023

Enhancing Indoor Localisation: a Bluetooth Low Energy (BLE) Beacon Placement approach

Autor
JOÃO PEDRO DA SILVA DIAS

Instituição
IPP-ISEP

2023

Solução de Mobilidade numa Cidade Inteligente: Um Sistema de Informação ao Público em Tempo-real

Autor
RODRIGO TEIXEIRA GUILHERME AGUIAR RODRIGUES

Instituição
IPP-ISEP

2023

Image Processing of Grocery Labels for Assisted Analysis

Autor
Jéssica Mireie Fernandes do Nascimento

Instituição
IPP-ISEP

2023

Deteção de Veículos Industriais e Pedestres em armazéns utilizando YOLOv3

Autor
EDUARDO DA SILVA MIRANDA

Instituição
IPP-ISEP

2023

BatEval - Study on different battery technologies for IoT

Autor
AFONSO SERRA DUQUE

Instituição
IPP-ISEP