Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

2024

Uncovering Manipulated Files Using Mathematical Natural Laws

Autores
Fernandes, P; Ciardhuáin, SO; Antunes, M;

Publicação
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I

Abstract
The data exchange between different sectors of society has led to the development of electronic documents supported by different reading formats, namely portable PDF format. These documents have characteristics similar to those used in programming languages, allowing the incorporation of potentially malicious code, which makes them a vector for cyberattacks. Thus, detecting anomalies in digital documents, such as PDF files, has become crucial in several domains, such as finance, digital forensic analysis and law enforcement. Currently, detection methods are mostly based on machine learning and are characterised by being complex, slow and mainly inefficient in detecting zero-day attacks. This paper aims to propose a Benford Law (BL) based model to uncover manipulated PDF documents by analysing potential anomalies in the first digit extracted from the PDF document's characteristics. The proposed model was evaluated using the CIC Evasive PDFMAL-2022 dataset, consisting of 1191 documents (278 benign and 918 malicious). To classify the PDF documents, based on BL, into malicious or benign documents, three statistical models were used in conjunction with the mean absolute deviation: the parametric Pearson and the non-parametric Spearman and Cramer-Von Mises models. The results show a maximum F1 score of 87.63% in detecting malicious documents using Pearson's model, demonstrating the suitability and effectiveness of applying Benford's Law in detecting anomalies in digital documents to maintain the accuracy and integrity of information and promoting trust in systems and institutions.

2024

A Machine Learning App for Monitoring Physical Therapy at Home

Autores
Pereira, B; Cunha, B; Viana, P; Lopes, M; Melo, ASC; Sousa, ASP;

Publicação
SENSORS

Abstract
Shoulder rehabilitation is a process that requires physical therapy sessions to recover the mobility of the affected limbs. However, these sessions are often limited by the availability and cost of specialized technicians, as well as the patient's travel to the session locations. This paper presents a novel smartphone-based approach using a pose estimation algorithm to evaluate the quality of the movements and provide feedback, allowing patients to perform autonomous recovery sessions. This paper reviews the state of the art in wearable devices and camera-based systems for human body detection and rehabilitation support and describes the system developed, which uses MediaPipe to extract the coordinates of 33 key points on the patient's body and compares them with reference videos made by professional physiotherapists using cosine similarity and dynamic time warping. This paper also presents a clinical study that uses QTM, an optoelectronic system for motion capture, to validate the methods used by the smartphone application. The results show that there are statistically significant differences between the three methods for different exercises, highlighting the importance of selecting an appropriate method for specific exercises. This paper discusses the implications and limitations of the findings and suggests directions for future research.

2024

Painless Artificial Intelligence Point-of-Care hemogram diagnosis in Companion Animals

Autores
Barroso, TG; Costa, JM; Gregório, AH; Martins, RC;

Publicação

Abstract
Quantification of erythrocytes and leukocytes is an essential aspect of hemogram diagno- 23 sis in Veterinary Medicine. Flow cytometry analysis, laser scattering, and impedance detection are 24 standard laboratory techniques, verified by manual microscopy counting. Although single-cell scat- 25 tering is already used as a standard technology for differentiating cell counts in flow cytometry, it 26 requires capillary cell separation. The current study investigates the scattering characteristics of 27 whole blood to identify correlations with erythrocytes and leukocytes counts. The scattering infor- 28 mation present in blood samples can be classified into three types: i) geometrical scattering, which 29 occurs when non-absorbed light is reflected and scattered, ii) Mie scattering, which happens when 30 light is scattered by particles of a similar size to the wavelength, and iii) Rayleigh scattering, which occurs when light is scattered by particles that are smaller than the incident light wavelength. In 32 this study, we investigate the scattering correction coefficients of dog blood absorption spectra in 33 the visible-near infrared range, to establish direct correlations with erythrocytes and leukocytes 34 counts, using multivariate linear regression. Our findings demonstrate the possibility of using the 35 scattering properties of dog blood, which is a step towards the existence of a portable and miniatur- 36 ized hemogram diagnosis in Veterinary Clinics worldwide.

2024

Probabilistic Positioning of a Mooring Cable in Sonar Images for In-Situ Calibration of Marine Sensors

Autores
Oliveira A.J.; Ferreira B.M.; Cruz N.A.; Diamant R.;

Publicação
IEEE Transactions on Mobile Computing

Abstract
The calibration of sensors stationed along a cable in marine observatories is a time-consuming and expensive operation that involves taking the mooring out of the water periodically. In this paper, we present a method that allows an underwater vehicle to approach a mooring, in order to take reference measurements along the cable for in-situ sensor calibration. We use the vehicle's Mechanically Scanned Imaging Sonar (MSIS) to identify the cable's reflection within the sonar image. After pre-processing the image to remove noise, enhance contour lines, and perform smoothing, we employ three detection steps: 1) selection of regions of interest that fit the cable's reflection pattern, 2) template matching, and 3) a track-before-detect scheme that utilized the vehicle's motion. The later involves building a lattice of template matching responses for a sequence of sonar images, and using the Viterbi algorithm to find the most probable sequence of cable locations that fits the maximum speed assumed for the surveying vessel. Performance is explored in pool and sea trials, and involves an MSIS onboard an underwater vehicle scanning its surrounding to identify a steel-core cable. The results show a sub-meter accuracy in the multi-reverberant pool environment and in the sea trial. For reproducibility, we share our implementation code.

2024

Comparative Evaluation of Remote Sensing Platforms for Almond Yield Prediction

Autores
Guimarães N.; Fraga H.; Sousa J.J.; Pádua L.; Bento A.; Couto P.;

Publicação
AgriEngineering

Abstract
Almonds are becoming a central element in the gastronomic and food industry worldwide. Over the last few years, almond production has increased globally. Portugal has become the third most important producer in Europe, where this increasing trend is particularly evident. However, the susceptibility of almond trees to changing climatic conditions presents substantial risks, encompassing yield reduction and quality deterioration. Hence, yield forecasts become crucial for mitigating potential losses and aiding decisionmakers within the agri-food sector. Recent technological advancements and new data analysis techniques have led to the development of more suitable methods to model crop yields. Herein, an innovative approach to predict almond yields in the Trás-os-Montes region of Portugal was developed, by using machine learning regression models (i.e., the random forest regressor, XGBRegressor, gradient boosting regressor, bagging regressor, and AdaBoost regressor), coupled with remote sensing data obtained from different satellite platforms. Satellite data from both proprietary and free platforms at different spatial resolutions were used as features in the study (i.e., the GSMP: 11.13 km, Terra: 1 km, Landsat 8: 30 m, Sentinel-2: 10 m, and PlanetScope: 3 m). The best possible combination of features was analyzed and hyperparameter tuning was applied to enhance the prediction accuracy. Our results suggest that high-resolution data (PlanetScope) combined with irrigation information, vegetation indices, and climate data significantly improves almond yield prediction. The XGBRegressor model performed best when using PlanetScope data, reaching a coefficient of determination (R2) of 0.80. However, alternative options using freely available data with lower spatial resolution, such as GSMaP and Terra MODIS LST, also showed satisfactory performance (R2 = 0.68). This study highlights the potential of integrating machine learning models and remote sensing data for accurate crop yield prediction, providing valuable insights for informed decision support in the almond sector, contributing to the resilience and sustainability of this crop in the face of evolving climate dynamics.

2024

Branching pomsets: Design, expressiveness and applications to choreographies

Autores
Edixhoven, L; Jongmans, SS; Proença, J; Castellani, I;

Publicação
JOURNAL OF LOGICAL AND ALGEBRAIC METHODS IN PROGRAMMING

Abstract
Choreographic languages describe possible sequences of interactions among a set of agents. Typical models are based on languages or automata over sending and receiving actions. Pomsets provide a more compact alternative by using a partial order to explicitly represent causality and concurrency between these actions. However, pomsets offer no representation of choices, thus a set of pomsets is required to represent branching behaviour. For example, if an agent Alice can send one of two possible messages to Bob three times, one would need a set of 2 x 2 x 2 distinct pomsets to represent all possible branches of Alice's behaviour. This paper proposes an extension of pomsets, named branching pomsets, with a branching structure that can represent Alice's behaviour using 2 + 2 + 2 ordered actions. We compare the expressiveness of branching pomsets with that of several forms of event structures from the literature. We encode choreographies as branching pomsets and show that the pomset semantics of the encoded choreographies are bisimilar to their operational semantics. Furthermore, we define well-formedness conditions on branching pomsets, inspired by multiparty session types, and we prove that the well-formedness of a branching pomset is a sufficient condition for the realisability of the represented com-munication protocol. Finally, we present a prototype tool that implements our theory of branching pomsets, focusing on its applications to choreographies. (c) 2023 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons .org /licenses /by /4 .0/).

  • 19
  • 3789