2022
Autores
Ferreira, P; Ladeiras, J; Camacho, R;
Publicação
PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, PACBB 2021
Abstract
Cancer is one of the diseases with the highest mortality rate in the world. To understand the different origins of the disease, and to facilitate the development of new ways to treat it, laboratories cultivate, in vitro, cancer cells (cell lines), taken from patients with cancer. These cell lines enable researchers to test new approaches and to have an appropriate procedure for comparison of results. The methods used in an initial study at EMBL-EBI Institute (Cambridge, UK) were based on algorithms that construct propositional like models. The results reported were promising but we believe that they can be improved. A relevant limitation of the algorithms used in the original study is the absence or severe lack of comprehensibility of the models constructed. In Life Sciences, the possibility of understanding a model is an asset to help the specialist to understand the phenomenon that produced the data. With our study we have improved the performance of forecasting models and constructed understandable models. To meet these objectives we have used Graph Mining and Inductive Logic Programming algorithms.
2022
Autores
Teixeira, R; Rodrigues, C; Moreira, C; Barros, H; Camacho, R;
Publicação
SCIENTIFIC REPORTS
Abstract
The timely identification of cohort participants at higher risk for attrition is important to earlier interventions and efficient use of research resources. Machine learning may have advantages over the conventional approaches to improve discrimination by analysing complex interactions among predictors. We developed predictive models of attrition applying a conventional regression model and different machine learning methods. A total of 542 very preterm (< 32 gestational weeks) infants born in Portugal as part of the European Effective Perinatal Intensive Care in Europe (EPICE) cohort were included. We tested a model with a fixed number of predictors (Baseline) and a second with a dynamic number of variables added from each follow-up (Incremental). Eight classification methods were applied: AdaBoost, Artificial Neural Networks, Functional Trees, J48, J48Consolidated, K-Nearest Neighbours, Random Forest and Logistic Regression. Performance was compared using AUC- PR (Area Under the Curve-Precision Recall), Accuracy, Sensitivity and F-measure. Attrition at the four follow-ups were, respectively: 16%, 25%, 13% and 17%. Both models demonstrated good predictive performance, AUC-PR ranging between 69 and 94.1 in Baseline and from 72.5 to 97.1 in Incremental model. Of the whole set of methods, Random Forest presented the best performance at all follow-ups [AUC-PR1: 94.1 (2.0); AUC-PR2: 91.2 (1.2); AUC-PR3: 97.1 (1.0); AUC-PR4: 96.5 (1.7)]. Logistic Regression performed well below Random Forest. The top-ranked predictors were common for both models in all follow-ups: birthweight, gestational age, maternal age, and length of hospital stay. Random Forest presented the highest capacity for prediction and provided interpretable predictors. Researchers involved in cohorts can benefit from our robust models to prepare for and prevent loss to follow-up by directing efforts toward individuals at higher risk.
2022
Autores
Goncalves, CA; Vieira, AS; Goncalves, CT; Camacho, R; Iglesias, EL; Diz, LB;
Publicação
INFORMATION
Abstract
Multi-view ensemble learning exploits the information of data views. To test its efficiency for full text classification, a technique has been implemented where the views correspond to the document sections. For classification and prediction, we use a stacking generalization based on the idea that different learning algorithms provide complementary explanations of the data. The present study implements the stacking approach using support vector machine algorithms as the baseline and a C4.5 implementation as the meta-learner. Views are created with OHSUMED biomedical full text documents. Experimental results lead to the sustained conclusion that the application of multi-view techniques to full texts significantly improves the task of text classification, providing a significant contribution for the biomedical text mining research. We also have evidence to conclude that enriched datasets with text from certain sections are better than using only titles and abstracts.
2022
Autores
Goncalves M.; Henriques A.; Costa A.R.; Correia D.; Severo M.; Severo M.; Lucas R.; Lucas R.; Barros H.; Santos A.C.; Ribeiro A.I.; Rocha A.; Lopes C.; Correia D.; Ramos E.; Gonçalves G.; Barros H.; Araújo J.; Talih M.; Tavares M.; Lunet N.; Meireles P.; Duarte R.; Camacho R.; Fraga S.; Correia S.; Silva S.; Leão T.;
Publicação
SLEEP MEDICINE
Abstract
Objective/background: To describe and characterize insomnia symptoms and nightmare profiles in Portugal during the first six weeks of a national lockdown due to COVID-19. Patients/methods: An open cohort study was conducted to collect information of the general population during the first wave of SARS-CoV-2/COVID-19 pandemic in Portugal. We analyzed data from 5011 participants (>= 16 years) who answered a weekly questionnaire about their well-being. Two questions about the frequency of insomnia and nightmares about COVID-19 were consecutively applied during six weeks (March-May 2020). Latent class analysis was conducted and different insomnia and nightmare profiles were identified. Associations between individual characteristics and both profiles were estimated using odds ratios (ORs) and 95% confidence intervals (CI). Results: Five insomnia (No insomnia, Stable-mild, Decreasing-moderate, Stable-severe, Increasing-severe) and three nightmares profiles (Stable-mild, Stable-moderate, Stable-severe) were identified. Being female, younger, perceiving their income as insufficient and feelings of fear towards COVID-19 were associated with higher odds of insomnia (Women: OR = 6.98 95%CI: 4.18-11.64; >= 60 years: OR = 0.30 95%CI: 0.18-0.53; Insufficient income: adjusted OR (aOR) = 8.413 95% CI: 3.93-16.84; Often presenting fear of being infected with SARS-CoV-2 infection: aOR = 9.13 95%CI: 6.36-13.11), and nightmares (Women: OR = 2.60 95%CI: 1.74-3.86; >= 60 years: OR = 0.45 95%CI: 0.28-0.74; Insufficient income: aOR = 2.60 95%CI: 1.20e5.20; Often/almost always presenting fear of being infected with SARS-CoV-2 infection: aOR = 6.62 95%CI: 5.01-8.74). Having a diagnosis of SARS-CoV-2 virus infection was associated with worse patterns of nightmares about the pandemic. Conclusions: Social and psychological individual factors are important characteristics to consider in the developmentof therapeutic strategies to supportpeoplewithsleep problems during the COVID-19 pandemic.
2022
Autores
Bhanu, M; Kumar, R; Roy, S; Mendes-Moreira, J; Chandra, J;
Publicação
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I
Abstract
Capturing complex spatio-temporal features of thousands of correlated taxi-demand time-series in the city makes the traffic flow prediction problem a challenging task. Hence, several Deep Neural Network (DNN) models have been developed to mimic the latent spatio-temporal behaviour of taxi-demand time-series in a city to improve the prediction results. Despite, good performance of recent DNN based traffic prediction techniques, such models can only identify either adjacent or connected regions with direct or transitive connection; hence they fail to capture spatio-temporal correlation among regions that exhibit implicit or latent connection. Additionally, the dependency of the recent DNN models on recursive components facilitates error propagation during feature aggregation without any counter strategy for it. In view of these existing glitches, we introduce a novel DNN model, graph Multi-Head Convolution for patio-Temporal Aggregation (gMHC-STA) which supports capturing spatio-temporal correlation among regions with explicit and implicit connection both. Moreover, gMHC-STA aggregates both spatial and temporal characteristics using multi-head attention; thus overriding recursive RNN or its variant approach to prevent noise propagation. The experimental results of gMHC-STA on two real-world city taxi-demand datasets report minimum of 6.5-10% improvement over the best state-of-the-art on standard benchmark metric in varying experimental conditions.
2022
Autores
Couceiro, M; Lima, IR; Ulisses, A; Neves, TM; Moreira, JM;
Publicação
Proceedings of the 10th International Conference on Sport Sciences Research and Technology Support, icSPORTS 2022, Valletta, Malta, October 27-28, 2022.
Abstract
The broadcast of audio-video sports content is a field with increasingly larger audiences demanding higher quality content and involvement. This growth creates the necessity to develop more content to engage the users and keep this trend. Otherwise, it may stall or even diminish. Therefore, enhancing the user experience, engagement, and involvement during live sports event broadcasts is of utmost importance. This paper proposes a solution to extract event’s information from video, resorting to Computer Vision techniques and Deep Learning algorithms. More specifically, the project encompassed the definition and implementation of field registration, object detection and tracking tasks. Focusing on football sports events, a novel dataset combining several video sources was created and used for analysis and metadata extraction. In particular, the proposed solution can detect and track players with acceptable precision using state-of-the-art methods, like YOLOv5 and DeepSORT. Furthermore, resorting to unsupervised learning techniques, the system provides team segmentation based on the colour of the players’ kits. A series of visual representations regarding the players’ movements on the field enables broadcast enrichment and increased user experience. The presented solution is framed in the H2020 DataCloud project and will be deployed in a cloud environment simplifying its access and utilisation. Copyright © 2022 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.