Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por CTM

2022

Wi-Fi Rate Adaptation using a Simple Deep Reinforcement Learning Approach

Autores
Queirós, R; Almeida, EN; Fontes, H; Ruela, J; Campos, R;

Publicação
CoRR

Abstract

2022

Boosting color similarity decisions using the CIEDE2000_PF Metric

Autores
Pereira, A; Carvalho, P; Corte Real, L;

Publicação
SIGNAL IMAGE AND VIDEO PROCESSING

Abstract
Color comparison is a key aspect in many areas of application, including industrial applications, and different metrics have been proposed. In many applications, this comparison is required to be closely related to human perception of color differences, thus adding complexity to the process. To tackle this, different approaches were proposed through the years, culminating in the CIEDE2000 formulation. In our previous work, we showed that simple color properties could be used to reduce the computational time of a color similarity decision process that employed this metric, which is recognized as having high computational complexity. In this paper, we show mathematically and experimentally that these findings can be adapted and extended to the recently proposed CIEDE2000 PF metric, which has been recommended by the CIE for industrial applications. Moreover, we propose new efficient models that not only achieve lower error rates, but also outperform the results obtained for the CIEDE2000 metric.

2022

Substrate Integrated Waveguide Cavity Backed Slot Antennas for Millimeter-Wave Applications

Autores
Finich, S; Salgado, HM; Pinho, P;

Publicação
2022 16TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP)

Abstract
A low-cost single-layer substrate-integrated waveguide (SIW) cavity-backed slot antenna is proposed for millimeter-wave applications. The structure is designed to operate at the W-band. The T-shaped slot antenna is placed on the back-side of the SIW and fed by a grounded coplanar waveguide (GCPW) transmission line. A transition between the (GCPW) and the SIW is also designed. The simulated results provide that the antenna has a stable gain over the frequency range (98.79-100.56) GHz with a maximum value of around 6 dBi also high radiation efficiency.

2022

A Gaussian Window for Interference Mitigation in Ka-band Digital Beamforming Systems

Autores
Tavares, JS; Avelar, HH; Salgado, HM; Pessoa, LM;

Publicação
2022 13th International Symposium on Communication Systems, Networks and Digital Signal Processing, CSNDSP 2022

Abstract
This paper proposes the use of a Gaussian window on the array factor as an interference mitigation method, aiming to avoid the computational complexity of the MVDR algorithm at the cost of a slight performance reduction. We show that by optimizing the parameters of the Gaussian window, it is possible to effectively mitigate the interfering signal if it is received within a certain angular range from the desired signal, while being still effective beyond that range. Finally, we show that the effectiveness of this approach is maintained across the full frequency reception range of the Ka-band, and confirm its validity using 8 × 8 and 16 × 16 array sizes. © 2022 IEEE.

2022

Photo2Video: Semantic-Aware Deep Learning-Based Video Generation from Still Content

Autores
Viana, P; Andrade, MT; Carvalho, P; Vilaca, L; Teixeira, IN; Costa, T; Jonker, P;

Publicação
JOURNAL OF IMAGING

Abstract
Applying machine learning (ML), and especially deep learning, to understand visual content is becoming common practice in many application areas. However, little attention has been given to its use within the multimedia creative domain. It is true that ML is already popular for content creation, but the progress achieved so far addresses essentially textual content or the identification and selection of specific types of content. A wealth of possibilities are yet to be explored by bringing the use of ML into the multimedia creative process, allowing the knowledge inferred by the former to influence automatically how new multimedia content is created. The work presented in this article provides contributions in three distinct ways towards this goal: firstly, it proposes a methodology to re-train popular neural network models in identifying new thematic concepts in static visual content and attaching meaningful annotations to the detected regions of interest; secondly, it presents varied visual digital effects and corresponding tools that can be automatically called upon to apply such effects in a previously analyzed photo; thirdly, it defines a complete automated creative workflow, from the acquisition of a photograph and corresponding contextual data, through the ML region-based annotation, to the automatic application of digital effects and generation of a semantically aware multimedia story driven by the previously derived situational and visual contextual data. Additionally, it presents a variant of this automated workflow by offering to the user the possibility of manipulating the automatic annotations in an assisted manner. The final aim is to transform a static digital photo into a short video clip, taking into account the information acquired. The final result strongly contrasts with current standard approaches of creating random movements, by implementing an intelligent content- and context-aware video.

2022

Improving word embeddings in Portuguese: increasing accuracy while reducing the size of the corpus

Autores
Pinto, JP; Viana, P; Teixeira, I; Andrade, M;

Publicação
PEERJ COMPUTER SCIENCE

Abstract
The subjectiveness of multimedia content description has a strong negative impact on tag-based information retrieval. In our work, we propose enhancing available descriptions by adding semantically related tags. To cope with this objective, we use a word embedding technique based on the Word2Vec neural network parameterized and trained using a new dataset built from online newspapers. A large number of news stories was scraped and pre-processed to build a new dataset. Our target language is Portuguese, one of the most spoken languages worldwide. The results achieved significantly outperform similar existing solutions developed in the scope of different languages, including Portuguese. Contributions include also an online application and API available for external use. Although the presented work has been designed to enhance multimedia content annotation, it can be used in several other application areas.

  • 33
  • 322