Publicacoes - INESC TEC

Publicações

Publicações por CTM

2024

ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection

Autores
Aubard, M; Antal, L; Madureira, A; Teixeira, LF; Ábrahám, E;

Publicação
CoRR

Abstract
This paper introduces ROSAR, a novel framework enhancing the robustness of deep learning object detection models tailored for side-scan sonar (SSS) images, generated by autonomous underwater vehicles using sonar sensors. By extending our prior work on knowledge distillation (KD), this framework integrates KD with adversarial retraining to address the dual challenges of model efficiency and robustness against SSS noises. We introduce three novel, publicly available SSS datasets, capturing different sonar setups and noise conditions. We propose and formalize two SSS safety properties and utilize them to generate adversarial datasets for retraining. Through a comparative analysis of projected gradient descent (PGD) and patch-based adversarial attacks, ROSAR demonstrates significant improvements in model robustness and detection accuracy under SSS-specific conditions, enhancing the model's robustness by up to 1.85%. ROSAR is available at https://github.com/remaro-network/ROSAR-framework.

FecharLer Abstract

2024

A Transition Towards Virtual Representations of Visual Scenes

Autores
Pereira, A; Carvalho, P; Côrte Real, L;

Publicação
Advances in Internet of Things & Embedded Systems

Abstract
We propose a unified architecture for visual scene understanding, aimed at overcoming the limitations of traditional, fragmented approaches in computer vision. Our work focuses on creating a system that accurately and coherently interprets visual scenes, with the ultimate goal to provide a 3D virtual representation, which is particularly useful for applications in virtual and augmented reality. By integrating various visual and semantic processing tasks into a single, adaptable framework, our architecture simplifies the design process, ensuring a seamless and consistent scene interpretation. This is particularly important in complex systems that rely on 3D synthesis, as the need for precise and semantically coherent scene descriptions keeps on growing. Our unified approach addresses these challenges, offering a flexible and efficient solution. We demonstrate the practical effectiveness of our architecture through a proof-of-concept system and explore its potential in various application domains, proving its value in advancing the field of computer vision.

FecharLer Abstract

2024

Systematic review on weapon detection in surveillance footage through deep learning

Autores
Santos, T; Oliveira, H; Cunha, A;

Publicação
COMPUTER SCIENCE REVIEW

Abstract
In recent years, the number of crimes with weapons has grown on a large scale worldwide, mainly in locations where enforcement is lacking or possessing weapons is legal. It is necessary to combat this type of criminal activity to identify criminal behavior early and allow police and law enforcement agencies immediate action.Despite the human visual structure being highly evolved and able to process images quickly and accurately if an individual watches something very similar for a long time, there is a possibility of slowness and lack of attention. In addition, large surveillance systems with numerous equipment require a surveillance team, which increases the cost of operation. There are several solutions for automatic weapon detection based on computer vision; however, these have limited performance in challenging contexts.A systematic review of the current literature on deep learning-based weapon detection was conducted to identify the methods used, the main characteristics of the existing datasets, and the main problems in the area of automatic weapon detection. The most used models were the Faster R-CNN and the YOLO architecture. The use of realistic images and synthetic data showed improved performance. Several challenges were identified in weapon detection, such as poor lighting conditions and the difficulty of small weapon detection, the last being the most prominent. Finally, some future directions are outlined with a special focus on small weapon detection.

FecharLer Abstract

2024

Optimized reconstruction of the absorption spectra of kidney tissues from the spectra of tissue components using the least squares method

Autores
Pinheiro, MR; Fernandes, LE; Carneiro, IC; Carvalho, SD; Henrique, RM; Tuchin, VV; Oliveira, HP; Oliveira, LM;

Publicação
JOURNAL OF BIOPHOTONICS

Abstract
With the objective of developing new methods to acquire diagnostic information, the reconstruction of the broadband absorption coefficient spectra (mu a[lambda]) of healthy and chromophobe renal cell carcinoma kidney tissues was performed. By performing a weighted sum of the absorption spectra of proteins, DNA, oxygenated, and deoxygenated hemoglobin, lipids, water, melanin, and lipofuscin, it was possible to obtain a good match of the experimental mu a(lambda) of both kidney conditions. The weights used in those reconstructions were estimated using the least squares method, and assuming a total water content of 77% in both kidney tissues, it was possible to calculate the concentrations of the other tissue components. It has been shown that with the development of cancer, the concentrations of proteins, DNA, oxygenated hemoglobin, lipids, and lipofuscin increase, and the concentration of melanin decreases. Future studies based on minimally invasive spectral measurements will allow cancer diagnosis using the proposed approach.

FecharLer Abstract

2024

Comparative Study Between Object Detection Models, for Olive Fruit Fly Identification

Autores
Victoriano, M; Oliveira, L; Oliveira, HP;

Publicação
Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2024, Volume 2: VISAPP, Rome, Italy, February 27-29, 2024.

Abstract
Climate change is causing the emergence of new pest species and diseases, threatening economies, public health, and food security. In Europe, olive groves are crucial for producing olive oil and table olives; however, the presence of the olive fruit fly (Bactrocera Oleae) poses a significant threat, causing crop losses and financial hardship. Early disease and pest detection methods are crucial for addressing this issue. This work presents a pioneering comparative performance study between two state-of-the-art object detection models, YOLOv5 and YOLOv8, for the detection of the olive fruit fly from trap images, marking the first-ever application of these models in this context. The dataset was obtained by merging two existing datasets: the DIRT dataset, collected in Greece, and the CIMO-IPB dataset, collected in Portugal. To increase its diversity and size, the dataset was augmented, and then both models were fine-tuned. A set of metrics were calculated, to assess both models performance. Early detection techniques like these can be incorporated in electronic traps, to effectively safeguard crops from the adverse impacts caused by climate change, ultimately ensuring food security and sustainable agriculture. © 2024 by SCITEPRESS – Science and Technology Publications, Lda.

FecharLer Abstract

2024

Radiological Medical Imaging Annotation and Visualization Tool

Autores
Teiga, I; Sousa, JV; Silva, F; Pereira, T; Oliveira, HP;

Publicação
UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION, PT III, UAHCI 2024

Abstract
Significant medical image visualization and annotation tools, tailored for clinical users, play a crucial role in disease diagnosis and treatment. Developing algorithms for annotation assistance, particularly machine learning (ML)-based ones, can be intricate, emphasizing the need for a user-friendly graphical interface for developers. Many software tools are available to meet these requirements, but there is still room for improvement, making the research for new tools highly compelling. The envisioned tool focuses on navigating sequences of DICOM images from diverse modalities, including Magnetic Resonance Imaging (MRI), Computed Tomography (CT) scans, Ultrasound (US), and X-rays. Specific requirements involve implementing manual annotation features such as freehand drawing, copying, pasting, and modifying annotations. A scripting plugin interface is essential for running Artificial Intelligence (AI)-based models and adjusting results. Additionally, adaptable surveys complement graphical annotations with textual notes, enhancing information provision. The user evaluation results pinpointed areas for improvement, including incorporating some useful functionalities, as well as enhancements to the user interface for a more intuitive and convenient experience. Despite these suggestions, participants praised the application's simplicity and consistency, highlighting its suitability for the proposed tasks. The ability to revisit annotations ensures flexibility and ease of use in this context.

FecharLer Abstract