Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por LIAAD

2019

Prediction Model for Prevalence of Type-2 Diabetes Complications with ANN Approach Combining with K-Fold Cross Validation and K-Means Clustering

Autores
Munna, MTA; Alam, MM; Allayear, SM; Sarker, K; Ara, SJF;

Publicação
Lecture Notes in Networks and Systems - Advances in Information and Communication

Abstract

2019

Prediction model for prevalence of type-2 diabetes complications with ann approach combining with K-fold cross validation and K-means clustering

Autores
Munna M.T.A.; Alam M.M.; Allayear S.M.; Sarker K.; Ara S.J.F.;

Publicação
Advances in Intelligent Systems and Computing

Abstract
In today’s era, most of the people are suffering with chronic diseases because of their lifestyle, food habits and reduction in physical activities. Diabetes is one of the most common chronic diseases which has affected to the people of all ages. Diabetes complication arises in human body due to increase of blood glucose (sugar) level than the normal level. Type-2 diabetes is considered as one of the most prevalent endocrine disorders. In this circumstance, we have tried to apply Machine learning algorithm to create the statistical prediction based model that people having diabetes can be aware of their prevalence. The aim of this paper is to detect the prevalence of diabetes relevant complications among patients with Type-2 diabetes mellitus. The processing and statistical analysis we used are Scikit-Learn, and Pandas for Python. We also have used unsupervised Machine Learning approaches known as Artificial Neural Network (ANN) and K-means Clustering for developing classification system based prediction model to judge Type-2 diabetes mellitus chronic diseases.

2019

An Iterative Oversampling Approach for Ordinal Classification

Autores
Marques, F; Duarte, H; Santos, J; Domingues, I; Amorim, JP; Abreu, PH;

Publicação
SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING

Abstract
The machine learning field has grown considerably in the last years. There are, however, some problems still to be solved. The characteristics of the training sets, for instance, are known to affect the classifiers performance. Here, and inspired by medical applications, we are interested in studying datasets that are both ordinal and imbalanced. Ordinal datasets present labels where only the relative ordering between different values is significant. Imbalanced datasets have very different quantity of examples per class. Building upon our previous work, we make three new contributions, (1) extend the number of classifiers, (2) evaluate two techniques to balance intermediate train sets in binary decomposition methods (often used in multi-class contexts and ordinal ones in particular), and (3) propose a new, iterative, classifier-based oversampling algorithm that we name InCuBAtE. Experiments were made on 6 private datasets, concerning the assessment of response to treatment on oncologic diseases, and 15 public datasets widely used in the literature. When compared with our previous work, results have improved (or remained the same) for 4 of the 6 private datasets and for 11 out of the 15 public datasets.

2019

Generating Synthetic Missing Data: A Review by Missing Mechanism

Autores
Santos, MS; Pereira, RC; Costa, AF; Soares, JP; Santos, J; Abreu, PH;

Publicação
IEEE ACCESS

Abstract
The performance evaluation of imputation algorithms often involves the generation of missing values. Missing values can be inserted in only one feature (univariate configuration) or in several features (multivariate configuration) at different percentages (missing rates) and according to distinct missing mechanisms, namely, missing completely at random, missing at random, and missing not at random. Since the missing data generation process defines the basis for the imputation experiments (configuration, missing rate, and missing mechanism), it is essential that it is appropriately applied; otherwise, conclusions derived from ill-defined setups may be invalid. The goal of this paper is to review the different approaches to synthetic missing data generation found in the literature and discuss their practical details, elaborating on their strengths and weaknesses. Our analysis revealed that creating missing at random and missing not at random scenarios in datasets comprising qualitative features is the most challenging issue in the related work and, therefore, should be the focus of future work in the field.

2019

Computer Vision in Esophageal Cancer: A Literature Review

Autores
Domingues, I; Sampaio, IL; Duarte, H; Santos, JAM; Abreu, PH;

Publicação
IEEE ACCESS

Abstract
Esophageal cancer is a disease with a high prevalence that can be evaluated by a variety of imaging modalities, including endoscopy, computed tomography, and positron emission tomography. Computer-aided techniques could provide a valuable help in the analysis of these images, decreasing the medical workflow time and human errors. The goal of this paper is to review the existing literature on the application of computer vision techniques in the domain of esophageal cancer. After an initial phase where a set of keywords was chosen, the selected terms were used to retrieve papers from four well-known databases: Web of Science, Scopus, PubMed, and Springer. The results were scanned by merging identical entries, and eliminating the out of scope works, resulting in 47 selected papers. These were organized according to the image modality. Major results were then summarized and compared, and main shortcomings were identified. It could be concluded that, even though the scientific community has already paid attention to the esophageal cancer problem, there are still several open issues. Two majorfindings of this review are the nonexistence of works on MRI data and the under-exploration of recent techniques using deep learning strategies, showing the need for further investigation.

2019

Multiple-Choice Questions in Programming Courses: Can We Use Them and Are Students Motivated by Them?

Autores
Abreu, PH; Silva, DC; Gomes, A;

Publicação
ACM TRANSACTIONS ON COMPUTING EDUCATION

Abstract
Low performance of nontechnical engineering students in programming courses is a problem that remains unsolved. Over the years, many authors have tried to identify the multiple causes for that failure, but there is unanimity on the fact that motivation is a key factor for the acquisition of knowledge by students. To better understand motivation, a new evaluation strategy has been adopted in a second programming course of a nontechnical degree, consisting of 91 students. The goals of the study were to identify if those students felt more motivated to answer multiple-choice questions in comparison to development questions, and what type of question better allows for testing student knowledge acquisition. Possibilities around the motivational qualities of multiple-choice questions in programming courses will be discussed in light of the results. In conclusion, it seems clear that student performance varies according to the type of question. Our study points out that multiple-choice questions can be seen as a motivational factor for engineering students and it might also be a good way to test acquired programming concepts. Therefore, this type of question could be further explored in the evaluation points.

  • 150
  • 429