2021
Autores
Oliveira, M; Oliveira, J; Camacho, R; Ferreira, C;
Publicação
BIOSIGNALS: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL 4: BIOSIGNALS
Abstract
Cardiovascular diseases are one of the leading causes of death in the world. In low income countries, heart auscultation is of capital importance since it is an efficient and low cost method to monitor the heart. In this paper, we propose a multi-spot system that aims to detect cardiac anomalies and to support a diagnosis in remote areas with limited heath care response. Our proposed solutions exploits data collected from the four main auscultation spots: Mitral, Pulmonary, Tricuspid and Aorta in a asynchronous way. From the several multi-spot systems implemented, the best results were obtained using a bi-modal system that only processes the Mitral and the Pulmonary spot simultaneously. Using these two spots we have achieved an accuracy between 85.7% (smallest value, using ANN) and the best value of 91.4% (obtained with a logistic regression algorithm). Taking into a account the pediatric population and the incident cardiac pathologies, it happens to be the spots where the observed murmurs were most audible. We have also find out that when using four auscultation spots, the choice of the algorithm is of secondary priority, which does not seem to be the case for a single auscultation spot system. With one single auscultation we have an average of 4% of difference between the results obtained with the algorithms and with four auscultation spots we have a smaller average of 2.1%.
2021
Autores
Cavadas, B; Leite, M; Pedro, N; Magalhaes, AC; Melo, J; Correia, M; Maximo, V; Camacho, R; Fonseca, NA; Figueiredo, C; Pereira, L;
Publicação
MICROORGANISMS
Abstract
The continuous characterization of genome-wide diversity in population and case-cohort samples, allied to the development of new algorithms, are shedding light on host ancestry impact and selection events on various infectious diseases. Especially interesting are the long-standing associations between humans and certain bacteria, such as the case of Helicobacter pylori, which could have been strong drivers of adaptation leading to coevolution. Some evidence on admixed gastric cancer cohorts have been suggested as supporting Homo-Helicobacter coevolution, but reliable experimental data that control both the bacterium and the host ancestries are lacking. Here, we conducted the first in vitro coinfection assays with dual human- and bacterium-matched and -mismatched ancestries, in African and European backgrounds, to evaluate the genome wide gene expression host response to H. pylori. Our results showed that: (1) the host response to H. pylori infection was greatly shaped by the human ancestry, with variability on innate immune system and metabolism; (2) African human ancestry showed signs of coevolution with H. pylori while European ancestry appeared to be maladapted; and (3) mismatched ancestry did not seem to be an important differentiator of gene expression at the initial stages of infection as assayed here.
2021
Autores
Goncalves, CAO; Camacho, R; Goncalves, CT; Vieira, AS; Diz, LB; Iglesias, EL;
Publicação
APPLIED SCIENCES-BASEL
Abstract
The exponential growth of documents in the web makes it very hard for researchers to be aware of the relevant work being done within the scientific community. The task of efficiently retrieving information has therefore become an important research topic. The objective of this study is to test how the efficiency of the text classification changes if different weights are previously assigned to the sections that compose the documents. The proposal takes into account the place (section) where terms are located in the document, and each section has a weight that can be modified depending on the corpus. To carry out the study, an extended version of the OHSUMED corpus with full documents have been created. Through the use of WEKA, we compared the use of abstracts only with that of full texts, as well as the use of section weighing combinations to assess their significance in the scientific article classification process using the SMO (Sequential Minimal Optimization), the WEKA Support Vector Machine (SVM) algorithm implementation. The experimental results show that the proposed combinations of the preprocessing techniques and feature selection achieve promising results for the task of full text scientific document classification. We also have evidence to conclude that enriched datasets with text from certain sections achieve better results than using only titles and abstracts.
2022
Autores
Ferreira, P; Ladeiras, J; Camacho, R;
Publicação
PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, PACBB 2021
Abstract
Cancer is one of the diseases with the highest mortality rate in the world. To understand the different origins of the disease, and to facilitate the development of new ways to treat it, laboratories cultivate, in vitro, cancer cells (cell lines), taken from patients with cancer. These cell lines enable researchers to test new approaches and to have an appropriate procedure for comparison of results. The methods used in an initial study at EMBL-EBI Institute (Cambridge, UK) were based on algorithms that construct propositional like models. The results reported were promising but we believe that they can be improved. A relevant limitation of the algorithms used in the original study is the absence or severe lack of comprehensibility of the models constructed. In Life Sciences, the possibility of understanding a model is an asset to help the specialist to understand the phenomenon that produced the data. With our study we have improved the performance of forecasting models and constructed understandable models. To meet these objectives we have used Graph Mining and Inductive Logic Programming algorithms.
2022
Autores
Teixeira, R; Rodrigues, C; Moreira, C; Barros, H; Camacho, R;
Publicação
SCIENTIFIC REPORTS
Abstract
The timely identification of cohort participants at higher risk for attrition is important to earlier interventions and efficient use of research resources. Machine learning may have advantages over the conventional approaches to improve discrimination by analysing complex interactions among predictors. We developed predictive models of attrition applying a conventional regression model and different machine learning methods. A total of 542 very preterm (< 32 gestational weeks) infants born in Portugal as part of the European Effective Perinatal Intensive Care in Europe (EPICE) cohort were included. We tested a model with a fixed number of predictors (Baseline) and a second with a dynamic number of variables added from each follow-up (Incremental). Eight classification methods were applied: AdaBoost, Artificial Neural Networks, Functional Trees, J48, J48Consolidated, K-Nearest Neighbours, Random Forest and Logistic Regression. Performance was compared using AUC- PR (Area Under the Curve-Precision Recall), Accuracy, Sensitivity and F-measure. Attrition at the four follow-ups were, respectively: 16%, 25%, 13% and 17%. Both models demonstrated good predictive performance, AUC-PR ranging between 69 and 94.1 in Baseline and from 72.5 to 97.1 in Incremental model. Of the whole set of methods, Random Forest presented the best performance at all follow-ups [AUC-PR1: 94.1 (2.0); AUC-PR2: 91.2 (1.2); AUC-PR3: 97.1 (1.0); AUC-PR4: 96.5 (1.7)]. Logistic Regression performed well below Random Forest. The top-ranked predictors were common for both models in all follow-ups: birthweight, gestational age, maternal age, and length of hospital stay. Random Forest presented the highest capacity for prediction and provided interpretable predictors. Researchers involved in cohorts can benefit from our robust models to prepare for and prevent loss to follow-up by directing efforts toward individuals at higher risk.
2022
Autores
Goncalves, CA; Vieira, AS; Goncalves, CT; Camacho, R; Iglesias, EL; Diz, LB;
Publicação
INFORMATION
Abstract
Multi-view ensemble learning exploits the information of data views. To test its efficiency for full text classification, a technique has been implemented where the views correspond to the document sections. For classification and prediction, we use a stacking generalization based on the idea that different learning algorithms provide complementary explanations of the data. The present study implements the stacking approach using support vector machine algorithms as the baseline and a C4.5 implementation as the meta-learner. Views are created with OHSUMED biomedical full text documents. Experimental results lead to the sustained conclusion that the application of multi-view techniques to full texts significantly improves the task of text classification, providing a significant contribution for the biomedical text mining research. We also have evidence to conclude that enriched datasets with text from certain sections are better than using only titles and abstracts.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.