Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Facts & Numbers
000
Presentation

Telecommunications and Multimedia

At CTM, our vision is to promote a lively and sustainable world where networked intelligence enables ubiquitous interaction with sensory-rich content. Our mission is to develop advanced systems and technologies to enable high capacity, efficient, and secure communications, media knowledge extraction, and immersive ubiquitous multimedia applications.

We work in 4 main areas of research: Optical and Electronic Technologies, Wireless Networks, Multimedia and Communications Technologies, and VCMI (Visual Computing and Machine Intelligence).

Latest News

INESC TEC with five FCT exploratory projects approved in four R&D areas

Telecommunications and Multimedia, Applied Photonics, High-assurance Software and Advanced Computing Systems – these are the four domains that INESC TEC researchers will explore within the scope of the five projects that were approved through the Call for Exploratory Projects promoted by the Foundation for Science and Technology (FCT).

02nd October 2024

Artificial Intelligence

Já arrancou o primeiro projeto europeu liderado pelo INESC TEC na área da saúde

Chama-se AI4Lungs e tem como objetivo desenvolver ferramentas e modelos computacionais baseados em Inteligência Artificial para otimizar o diagnóstico e o tratamento de doenças pulmonares. Através de uma abordagem holística e multimodal, os investigadores vão criar uma solução de cuidados de saúde personalizados para doenças respiratórias. No final de fevereiro, representantes das 18 entidades parceiras do projeto, provenientes de 10 países, reuniram-se no INESC TEC para assinalar o arranque do AI4Lungs.

01st April 2024

Communications

Europe discusses collaboration opportunities in high-frequency wireless communications

Smart propagation environments, improvements in signal processing for the sixth generation of mobile communications, and 6G-centred network and location developments were some of the topics discussed at an event organised by the European projects TERRAMETA (coordinated by INESC TEC), 6G-SHINE and TIMES, in collaboration with RESTART-IN – an Italian PRR.

06th March 2024

Artificial Intelligence

INESC TEC researchers work on the first prototype that applies AI to colorectal diagnosis developed in Portugal

The work behind the first prototype that uses Artificial Intelligence (AI) for colorectal diagnosis was fully developed by Portuguese researchers INESC TEC, and the IMP Diagnostics Molecular & Anatomic Pathology laboratory; the work featured in the renowned international scientific journal npj Precision Oncology (https://www.nature.com/articles/s41698-024-00539-4 ).

05th March 2024

INESC TEC researchers led discussion on wireless communications and computer vision at GLOBECOM

After almost one year, the CONVERGE project (coordinated by INESC TEC) has already showed relevant outcomes at one of the main conferences of the IEEE Communications Society, the GLOBECOM (Malaysia) – namely, through the organisation of a panel. “Convergence of wireless communications and computer vision: a new paradigm created by the CONVERGE project” sought to discuss the new opportunities and potential challenges associated with the use of tools that combine radio with computer vision.

23rd January 2024

194

Featured Projects

INESCTEC.OCEAN

Centre of Excellence in Ocean Research and Engineering

2025-2030

AIMaCoV

AI Mapping and Compilation for CGRAs on RISC-V Hosts

2024-2026

SEAGUARD

Sea Environmental Awareness and Guard enhanced with Unmanned AI Robotic Detection

2024-2027

REPLICA

Replicable Cellular Networking Experiments using ns-3

2024-2025

RFIDCORK

Definição de uma Arquitetura de Referência para Implementação de RFID

2024-2024

TestBed5G_1

Primeiros Pilotos TestBed 5G

2024-2025

OBJECT

OBJECT: Where do our object mental symbols come from? Learning with a neurosymbolic model

2024-2025

AI4LUNGS

AI-BASED PERSONALISED CARE FOR RESPIRATORY DISEASE USING MULTI-MODAL DATA IN PATIENT STRATIFICATION

2024-2027

OPITDEV

O-PITDEV – redesign of product electronics

2024-2024

AICare4U

AI-based Robotic Solution Addressing Compensatory Patterns for Upper Limb Rehabilitation

2024-2026

PHASE IV AI

Privacy compliant health data as a service for AI development

2023-2026

PFAI4_4eD

Programa de Formação Avançada Industria 4 - 4a edição

2023-2023

UNIFY

Compilation Abstraction and Hardware Adaptation for Specialized and General-Purpose Computing Unification

2023-2026

CELLO

The sound of cells: An acoustic platform aiming towards biophysical cell fingerprints for label-free precision medicine

2023-2026

WATSON

A holistic framework with Anticounterfeit and Intelligence-based technologies that will assist food chain stakeholders in rapidly identifying and preventing the spread of fraudulent practices

2023-2026

TORIS

Towards fully printed reconfigurable intelligent surfaces

2023-2025

LUCCA

AI-based Models for Lung Cancer Characterization: a Multimodal and Causal Approach

2023-2025

Shielding

Medidas de shielding de materiais

2023-2023

CAGING

Causality-driven Generative Models for Privacy-preserving Case-based Explanations

2023-2024

CONVERGE

Telecommunications and computer vision convergence tools for research infrastructures

2023-2026

EADIGIFOLK

An European and Ibero-American approach for the digital collection, analysis and dissemination of folk music

2023-2026

A-IQ Ready

Artificial Intelligence using Quantum measured Information for realtime distributed systems at the edge

2023-2025

SuperIoT

Truly sustainable printed electronics-based IoT combining optical and radio wireless technologies

2023-2025

TERRAMETA

Terahertz reconfigurable metasurfaces for ultra-high rate wireless communications

2023-2025

AEROGANP

Creación de un eje transfronterizo de investigación y transferencia de conocimiento en el sector aeronáutico y espacial en la Eurorregión Galicia-Norte de Portugal

2023-2026

A-MoVeR

Mobilizing Agenda for the Development of Products & Systems towards an Intelligent and Green Mobility

2022-2025

OVERWATCH

Integrated holographic management map for safety and crisis events

2022-2025

AURORA

Deteção de atividade no interior do veículo

2022-2023

NEXUS

Innovation Pact - Digital and Green Transition

2022-2025

Vision2Control

Controlo Qualidade Rolamentos por Visão

2022-2023

NewSpacePortugal

Agenda New Space Portugal

2022-2025

SUSTAINABLE PLASTICS

Agenda Mobilizadora para os Plásticos Sustentáveis

2022-2025

Produtech_R3

Agenda Mobilizadora da Fileira das Tecnologias de Produção para a Reindustrialização

2022-2025

IWOW2022

IWOW2022 - NEWFOCUS COST action meeting and workshop

2022-2022

CINDERELLA

Clinical Validation of an AI-based approach to improve the shared decision-making process and outcomes in Breast Cancer Patients proposed for Locoregional treatment

2022-2026

PFAI4_3ed

Programa de Formação Avançada Industria 4 - 3a edição

2022-2022

DivaX

Services for company Europeanisation

2022-2022

ABIS

Automated Biometric Identification System

2022-2022

FORM_I40

Formação Indústria 4.0

2022-2022

THEIA

Automated Perception Driving

2022-2023

CIRCUMSTANCE

Circulating Microbial Signatures for Early Diagnosis of Cancer

2022-2025

OpenMinds

Synchronising creative minds for social cohesion and radical inclusion

2021-2023

HfPT

Health from Portugal

2021-2025

vCardID4

Digital fingerprint enhanced model - 4

2021-2022

WaveCorkCal

Calibração do higrómetro de microondas

2021-2023

CholdaDigital

Consultoria Avançada em Sistemas de Informação e Redes de Comunicações para a Quinta da Cholda

2021-2022

CadPath

Computer-Aided Diagnosis in Pathology

2021-2022

MATinMOL

Matter Waves in Moiré Lattices

2021-2025

5GforUtilities

Tecnologia Celular 5G

2021-2022

DECARBONIZE

DEvelopment of strategies and policies based on energy and non-energy applications towards CARBON neutral cities via digitalization for citIZEns and society

2021-2023

Training4DS

Formação Avançada em Data Science - Altice Labs

2020-2020

PFAI4.0

Programa de Formação Avançada Industria 4.0

2020-2021

iiLab

Ampliação da Infraestrutura Tecnológica do INESC TEC para a Transformação Digital da Indústria

2020-2023

Continental FoF

Fábrica do Futuro da Continental Advanced Antenna

2020-2023

FLY_PT

Mobilizar a indústria aeronáutica nacional para a disrupção no transporte aéreo urbano do futuro

2020-2023

TAMI

Transparent Artificial Medical Intelligence

2020-2023

WiFi4DSO

Tecnologia Wi-Fi aplicada a cenários de um DSO

2019-2019

LeGeM

Learning Representations and Generative Models for 3D Breast Data

2019-2021

CorkNetmon

Network Infrastructure Monitoring for Remote Cork Manufacturing

2019-2020

Inphinit

Bolsa de Doutoramento ”LA CAIXA” Inphinit

2019-2022

SLID

Invitation to collaborate

2019-2022

ProLab

Consultoria profissional com recurso ao laboratório de electrónica

2019-2019

TenisApp2

Aplicação móvel para análise de jogos de ténis - 2

2019-2021

InterConnect

Interoperable Solutions Connecting Smart Homes, Buildings and Grids

2019-2024

NFCAD

Near Field Contact Antenna Development

2019-2020

EuConNeCts4

European Conferences on Networks and Communications

2019-2022

SCA

Serviço de caracterização antenas

2019-2019

STRx

Sistema de transmissão e receção de sinal de orientação eletrónica para a próxima geração de constelações de satélites (LEO e MEO).

2019-2022

RESPONDRONE

NOVEL INTEGRATED SOLUTION OF OPERATING A FLEET OF DRONES WITH MULTIPLE SYNCHRONIZED MISSIONS FOR DISASTER RESPONSES

2019-2022

InterCork

Remote Cork Manufacturing

2019-2019

CLOUD4CANDY

Cloud for CANDY

2019-2019

Evo3DModel

Consultoria para melhoria do sistema INSIGHT

2019-2020

OpenInnoTrain

Research Translation and Applied Knowledge Exchange in Practice through University-Industry-Cooperation

2019-2024

FollicleCounter

Prestação de serviços de investigação e desenvolvimento em matéria de processamento de imagem para maior fiabilidade de aferição dos folículos implantados

2018-2021

NB-IoT

Consultoria no âmbito da tecnolofia Narrowband-internet of Things

2018-2019

XPERIMUS

Experimentação em música na cultura portuguesa: História, contextos e práticas nos séculos XX e XXI

2018-2022

Blueenergy

Blue energy generation using hybrid triboelectric/photovoltaic systems for the long term deployment of Autonomous Underwater Vehicles

2018-2020

PEPCC

Power efficiency and performance for embedded and HPC systems with custom CGRAs

2018-2021

GROW

Long-range broadband underwater wireless communications

2018-2021

AUTOMOTIVE

AUTOmatic multiMOdal drowsiness detecTIon for smart Vehicles

2018-2021

HELP-MD

O poder emocional e curativo da música e da dança

2018-2022

NeurOxide

Integration of oxide thin film transistors and memristors in neuromorphic networks

2018-2022

LUCAS

Lung cancer screening - A non-invasive methodology for early diagnosis

2018-2022

HEMOSwimmers

Hemodynamic optimization around 3D swimming microbots

2018-2022

CLARE

Computer-aided cervical cancer screening

2018-2021

ENDURANCE

Underwater wireless energy and communications enabling long-term deep-sea presence

2018-2020

S-MODE

Screening of antibiotic contamination by mobile devices

2018-2021

UnWSNet

Underwater Wireless Sensor Networks

2018-2018

SIMBED

Fed4fire testbed for experimentation

2018-2019

5G

Componentes e Serviços para Redes 5G

2018-2021

FotoInMotion

Repurposing and enriching images for immersive storytelling through smart digital tools

2018-2020

ConnectedRefinery

Rede de comunicações sem fios para as instalações da Galp Energia em Leixões

2018-2019

Arquitetura_IoT

Consultoria sobre a arquitetura de referência para implementação serviços de informação baseados em IoT

2017-2018

CHIC

Cooperative Holistic view on Internet and Content

2017-2020

CompMash

Music compatibility models for interactive mashup applications

2017-2019

UGREEN

Otimização do Consumo Energético de Redes LTE-U e Wi-Fi em Cenários de Coexistência

2017-2019

TERAPOD

Terahertz based Ultra High Bandwidth Wireless Access Networks

2017-2021

SURGEONMATE

Video processing for surgery analysis

2017-2017

TEC4Sea

Modular Platform for Research, Test and Validation of Technologies supporting a Sustainable Blue Economy

2017-2022

ROMOVI

ROMOVI: Robô Modular e cooperativo para Vinhas de encosta

2017-2019

BCCT.Plan

BCCT.plan: Ferramenta 3D para o planeamento do tratamento conservador do cancro da mama

2016-2020

WI-GREEN

WI-GREEN .: Otimização do consumo energético de redes Wi-Fi sensível aos padrões de tráfego

2016-2018

RAWFIE

Road-, Air- and Water-based Future Internet Experimentation

2016-2019

Cloud-Setup

PLATAFORMA DE PREPARAÇÃO DE CONTEÚDOS AUDIOVISUAIS PARA INGEST NA CLOUD

2016-2019

EVOXANT

Bacterial evolution beyond the cultured isolates - Xanthomonas arboricola pv. juglandis as a paradigm

2016-2020

WISE

TrafficAware Flying Backhaul Mesh Networks

2016-2019

MareCom

Redes e serviços marítimos comunitários

2016-2018

STRONGMAR-CRAS

STRengthening MARritime Technology Research Center

2016-2018

CORAL-TOOLS

CORAL – Sustainable Ocean Exploitation: Tools and Sensors

2016-2018

BLUECOM+

Connecting Humans and Systems at Remote Ocean Areas using Cost-effective Broadband Communications

2015-2017

ENDURE

Enabling Long-Term Deployments of Underwater Robotic Platforms in Remote Oceanic Locations

2015-2017

FOUREYES

TEC4Growth - RL FourEyes - Intelligence, Interaction, Immersion and Innovation for media industries

2015-2019

NanoStima-RL5

NanoSTIMA - Advanced Methodologies for Computer-Aided Detection and Diagnosis

2015-2019

NanoStima-RL1

NanoSTIMA - Macro-to-Nano Human Sensing Technologies

2015-2019

SMILES

SMILES - Smart, Mobile, Intelligent and Large scale Sensing and analytics

2015-2019

VAMOS

Viable Alternative Mine Operating System

2015-2019

iBROW

Innovative ultra-BROadband ubiquitous Wireless communications through terahertz transceivers

2015-2018

SCREEN

Space Cognitive Radio for Electromagnetic Environment maNagement

2015-2016

AnyPLACE

Adaptable Platform for Active Services Exchange

2015-2018

SmarterEMC2

Smarter Grid: Empowering SG Market Actors through Information and Communication Technologies

2015-2017

SEAD

Statistically Enhanced Mixed-Signaland Analog Design

2014-2016

MDX

Simulation Models

2014-2016

Unisat

Serviços de banda-larga em simultâneo com receção de televisão por satélite

2014-2014

TWAVE

Phase conjugated twin waves to unlock the potential of future spatial division multiplexed systems (TWave)

2014-2015

HiperWireless

Microwave Point-to-Multipoint Communications in Free Hiperlan Band (17GHZ)

2014-2015

PGLobal

Desenvolvimento de software para ser integrado numa plataforma de recolha automática e selecção de conteúdos de jornais participantes de vários países

2014-2015

SUNNY

Smart UNmanned aerial vehicle sensor Network for detection of border crossing and illegal entrY

2014-2018

PCSA

Place characterisation from sensing and acting

2013-2014

SIVIC

Wearable Integrated Cardiovascular Surveillance System

2013-2015

Creation

Cognitive Radio Transceiver Design for energy Efficient Daa Transmission

2013-2016

TDT

Digital terrestrial television (DTT) signal monitoring

2013-2014

PICTURE

Patient Information Combined for the Assessment of specific surgical outcomes in breast cancer

2013-2016

Confine

Community Networks Testbed for the Future Internet

2013-2015

MAT

Media Arts and Technology

2013-2015

Sensing

Network Sensing for Critical Systems Monitoring

2013-2015

SmartGrids

Smart Grids

2013-2015

Cooperation

Cooperation and Perception for Augmented Autonomy

2013-2015

ASSIST

Retail futsal statistics

2012-2013

MOGTIDT

Station to obtain 3D images

2012-2012

MTGrid

Multi Technology Communication Infrastructure for the Smart Grid

2012-2015

RETAIL_PRO

Integrated Platform to Strategically Manage Retail Environments

2012-2015

WiSAT

Waveguide passive devices in S and Ka band

2012-2012

SARA

Asset Management System for Road Networks

2012-2015

CPT

Cartesian Polar Transmitter

2012-2014

MC-WMNs

Multiple Context-based Wireless Mesh Networks (MC-WMNs)

2012-2015

SENSEIVER

Low-cost and energy-efficient LTCC sensor/IR-UWB transceiver solutions for a sustainable healthy environment

2011-2015

MIRes

Roadmap for Music Information Research

2011-2013

AdChrono

Automatic optimisation of online advertising

2011-2013

3dBCT

3D Models for Aesthetic Evaluations and Result Predictions in Breast Cancer Procedures

2011-2014

CASA

Computational Auditory Scene Analysis Framework for Sound Segregation in Music Signals

2011-2014

SHAKEIT

Mechanisms of Musical Groove and applications

2011-2013

MultiRadioAccess

Multi-radio (HSPA and WiFi) aggregation to provide a higher rate than those of the individual networks

2011-2011

Steering

Steering of light in nonlinear waveguides with resonant interactions

2011-2014

CNG

New Generation Contents for Education and Vocational Training

2011-2014

AAL4ALL

Ambient Assisted Living for All

2011-2015

SUM

Sensing and Understanding human Motion dynamics

2011-2013

User-Tracking2.0

User-Tracking for Web traffic

2010-2011

SELF-PVP

Self-organizing power management for photovoltaic power plants

2010-2014

ImTV

On demand Immersive-TV for communities of Media Producers and Consumers

2010-2014

NeTS

Next Generation Network Operations and Management

2010-2013

EscolinhasCriativas

Creative Spaces for Creative Kids

2010-2013

Convergence

Content-centric, publish-subscribe service model for the Internet

2010-2013

SafeHomeHealthCare

Interference-free Home Health-Care Smart Spaces using Search Algorithms and Meta-Reality Reflection

2010-2013

OSP

Optical Signal Processing Using Highly Nonlinear Fibers

2010-2013

ProLimb

Electronic sensing for the prophylaxis of lower limb pathologies

2010-2013

ContextAware

Context-aware and personalized multimedia services

2010-2011

REIVE

Intelligent electric networks with plug-in electric vehicles

2010-2012

Alicante

MediA Ecosystem Deployment through Ubiquitous Content-Aware Network Environments

2010-2013

P3.net

On-line daily news platform for young people

2010-2012

LUL

Living Usability Lab for Next Generation Networks

2010-2012

Hotel3.0

Web3.0 Platform for the hospitality market

2010-2012

WOWI

Wireless-optical-wireless interfaces for picocellular access networks

2010-2013

RobVigil

Collaborative and intelligent surveillance robot for the security area

2010-2012

Daphne

Developing aircraft photonic networks

2009-2013

SWIOP

Intelligent and secure Webmail System to Support Personal Organization

2009-2011

SITMe

Metropolitan multi-technology wireless network for public transportation systems

2009-2012

Mobiles

Sustainable electric mobility - Solutions for the logistics associated with electric vehicle battery charging

2009-2012

V-SAT

Passive devices for VSAT (Very Small Aperture Terminal) applications

2009-2009

PortalDouro

Tourism portal for the Douro region

2009-2011

KINETIC

Controller driven adaptive and dynamic music composition systems

2009-2011

ASP-RedesDomesticas

Authentication, Security and Privacy (ASP) Solutions for home networks

2009-2010

MuMoMgt

Multicast and mobility management in heterogeneous access networks

2009-2012

ReCoop

Cooperative Wireless Networks

2009-2012

SemanticPACS

Picture Archiving and Communication System with Semantic Search Engine

2009-2011

Palco3.0

Intelligent Web system to support the management of a social network on music

2008-2011

AHRS

Attitude-Heading Reference System based on MEMS technology

2008-2010

GeCLIFmcast

Management of multicast sessions for IP based services created by users

2008-2009

DR-Vids

Dynamic reconfiguration of logical resources for real time foreground/background video segmentation

2007-2010

Vector

Compilation and Synthesis of Image Processing Algorithms in MATLAB for FPGA-based Custom Vector Units

2007-2011

OMR

Optical recognition system for handwritten music scores

2007-2010

BCCT

Advanced objective method for the evaluation of the aesthetic result of Breast Cancer Conservative Treatment

2007-2010

ROFWDM

Design and Optimisation of WDM Millimetre-Wave Fibre-Radio Systems

2007-2010

EDCine

Enhanced Digital Cinema

2007-2009

VISNETII

Networked Audiovisual Media Technologies

2006-2009

Team
002

Laboratories

Laboratory of Sound and Music Computing

Optical and Electronic Technologies Research Laboratory

Publications

CTM Publications

View all Publications

2025

A survey on cell nuclei instance segmentation and classification: Leveraging context and attention

Authors
Nunes, JD; Montezuma, D; Oliveira, D; Pereira, T; Cardoso, JS;

Publication
MEDICAL IMAGE ANALYSIS

Abstract
Nuclear-derived morphological features and biomarkers provide relevant insights regarding the tumour microenvironment, while also allowing diagnosis and prognosis in specific cancer types. However, manually annotating nuclei from the gigapixel Haematoxylin and Eosin (H&E)-stained Whole Slide Images (WSIs) is a laborious and costly task, meaning automated algorithms for cell nuclei instance segmentation and classification could alleviate the workload of pathologists and clinical researchers and at the same time facilitate the automatic extraction of clinically interpretable features for artificial intelligence (AI) tools. But due to high intra- and inter-class variability of nuclei morphological and chromatic features, as well as H&Estains susceptibility to artefacts, state-of-the-art algorithms cannot correctly detect and classify instances with the necessary performance. In this work, we hypothesize context and attention inductive biases in artificial neural networks (ANNs) could increase the performance and generalization of algorithms for cell nuclei instance segmentation and classification. To understand the advantages, use-cases, and limitations of context and attention-based mechanisms in instance segmentation and classification, we start by reviewing works in computer vision and medical imaging. We then conduct a thorough survey on context and attention methods for cell nuclei instance segmentation and classification from H&E-stained microscopy imaging, while providing a comprehensive discussion of the challenges being tackled with context and attention. Besides, we illustrate some limitations of current approaches and present ideas for future research. As a case study, we extend both a general (Mask-RCNN) and a customized (HoVer-Net) instance segmentation and classification methods with context- and attention-based mechanisms and perform a comparative analysis on a multicentre dataset for colon nuclei identification and counting. Although pathologists rely on context at multiple levels while paying attention to specific Regions of Interest (RoIs) when analysing and annotating WSIs, our findings suggest translating that domain knowledge into algorithm design is no trivial task, but to fully exploit these mechanisms in ANNs, the scientific understanding of these methods should first be addressed.

2025

Causal representation learning through higher-level information extraction

Authors
Silva, F; Oliveira, HP; Pereira, T;

Publication
ACM COMPUTING SURVEYS

Abstract
The large gap between the generalization level of state-of-the-art machine learning and human learning systems calls for the development of artificial intelligence (AI) models that are truly inspired by human cognition. In tasks related to image analysis, searching for pixel-level regularities has reached a power of information extraction still far from what humans capture with image-based observations. This leads to poor generalization when even small shifts occur at the level of the observations. We explore a perspective on this problem that is directed to learning the generative process with causality-related foundations, using models capable of combining symbolic manipulation, probabilistic reasoning, and pattern recognition abilities. We briefly review and explore connections of research from machine learning, cognitive science, and related fields of human behavior to support our perspective for the direction to more robust and human-like artificial learning systems.

2025

Evaluation of Lyrics Extraction from Folk Music Sheets Using Vision Language Models (VLMs)

Authors
Sales Mendes, A; Lozano Murciego, Á; Silva, LA; Jiménez Bravo, M; Navarro Cáceres, M; Bernardes, G;

Publication
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Abstract
Monodic folk music has traditionally been preserved in physical documents. It constitutes a vast archive that needs to be digitized to facilitate comprehensive analysis using AI techniques. A critical component of music score digitization is the transcription of lyrics, an extensively researched process in Optical Character Recognition (OCR) and document layout analysis. These fields typically require the development of specific models that operate in several stages: first, to detect the bounding boxes of specific texts, then to identify the language, and finally, to recognize the characters. Recent advances in vision language models (VLMs) have introduced multimodal capabilities, such as processing images and text, which are competitive with traditional OCR methods. This paper proposes an end-to-end system for extracting lyrics from images of handwritten musical scores. We aim to evaluate the performance of two state-of-the-art VLMs to determine whether they can eliminate the need to develop specialized text recognition and OCR models for this task. The results of the study, obtained from a dataset in a real-world application environment, are presented along with promising new research directions in the field. This progress contributes to preserving cultural heritage and opens up new possibilities for global analysis and research in folk music. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

2025

Model compression techniques in biometrics applications: A survey

Authors
Caldeira, E; Neto, PC; Huber, M; Damer, N; Sequeira, AF;

Publication
INFORMATION FUSION

Abstract
The development of deep learning algorithms has extensively empowered humanity's task automatization capacity. However, the huge improvement in the performance of these models is highly correlated with their increasing level of complexity, limiting their usefulness in human-oriented applications, which are usually deployed in resource-constrained devices. This led to the development of compression techniques that drastically reduce the computational and memory costs of deep learning models without significant performance degradation. These compressed models are especially essential when implementing multi-model fusion solutions where multiple models are required to operate simultaneously. This paper aims to systematize the current literature on this topic by presenting a comprehensive survey of model compression techniques in biometrics applications, namely quantization, knowledge distillation and pruning. We conduct a critical analysis of the comparative value of these techniques, focusing on their advantages and disadvantages and presenting suggestions for future work directions that can potentially improve the current methods. Additionally, we discuss and analyze the link between model bias and model compression, highlighting the need to direct compression research toward model fairness in future works.

2024

Incremental Redundancy HARQ Communication Schemes applied to Energy Efficient IoT Systems

Authors
Silva, SM; Almeida, NT;

Publication
2024 IEEE 22ND MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, MELECON 2024

Abstract
The rapid proliferation of Internet of Things (IoT) systems, encompassing a wide range of devices and sensors with limited battery life, has highlighted the critical need for energy-efficient solutions to extend the operational lifespan of these battery-powered devices. One effective strategy for reducing energy consumption is minimizing the number and size of retransmitted packets in case of communication errors. Among the potential solutions, Incremental Redundancy Hybrid Automatic Repeat reQuest (IR-HARQ) communication schemes have emerged as particularly compelling options by adopting the best aspects of error control, namely, automatic repetition and variable redundancy. This work addresses the challenge by developing a simulator capable of executing and analysing several (H)ARQ schemes using different channel models, such as the Additive White Gaussian Noise (AWGN) and Gilbert-Elliott (GE) models. The primary objective is to compare their performance across multiple metrics, enabling a thorough evaluation of their capabilities. The results indicate that IR-HARQ outperforms alternative methods, especially in the presence of burst errors. Furthermore, its potential for further adaptation and enhancement opens up new ways for optimizing energy consumption and extending the lifespan of battery-powered IoT devices.

Facts & Figures

2R&D Employees

2020

15Academic Staff

2020

82Researchers

2016

Contacts