Publicacoes - INESC TEC

Publicações

Publicações por LIAAD

2018

Relative Direction: Location Path Providing Method for Allied Intelligent Agent

Autores
Kabir, SR; Alam, MM; Allayear, SM; Munna, MTA; Hossain, SS; Rahman, SSMM;

Publicação
Communications in Computer and Information Science - Advances in Computing and Data Sciences

Abstract

2018

Haar Cascade Classifier and Lucas–Kanade Optical Flow Based Realtime Object Tracker with Custom Masking Technique

Autores
Mohiuddin, K; Alam, MM; Das, AK; Munna, MTA; Allayear, SM; Ali, MH;

Publicação
Advances in Intelligent Systems and Computing - Advances in Information and Communication Networks

Abstract

2018

A computational technique for intelligent computers to learn and identify the human's relative directions

Autores
Kabir S.; Allayear S.; Alam M.; Munna M.;

Publicação
Proceedings of the International Conference on Intelligent Sustainable Systems, ICISS 2017

Abstract
The most broadly perceived relative directions are right, left, up, down, backward and forward. This research paper presents a new computational technique to learn human's relative directions, where one intelligent computer can learn any human's right, left, up, down, backward and forward or different relative directions. The present paper portrays models describing the essential structures of relative direction learning process between human and intelligent machine. We developed two proficient algorithms for solving this approach. In our experiment we propose Human Relative Direction Learning (HRDL) algorithm for learning human's relative directions and Human Direction Identification (HDI) algorithm for tracking any human position and identity human's relative directions from different direction points.

FecharLer Abstract

2018

Cross-Validation for Imbalanced Datasets: Avoiding Overoptimistic and Overfitting Approaches

Autores
Santos, MS; Soares, JP; Abreu, PH; Araujo, H; Santos, J;

Publicação
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE

Abstract
Although cross-validation is a standard procedure for performance evaluation, its joint application with oversampling remains an open question for researchers farther from the imbalanced data topic. A frequent experimental flaw is the application of oversampling algorithms to the entire dataset, resulting in biased models and overly-optimistic estimates. We emphasize and distinguish overoptimism from overfitting, showing that the former is associated with the cross-validation procedure, while the latter is influenced by the chosen oversampling algorithm. Furthermore, we perform a thorough empirical comparison of well-established oversampling algorithms, supported by a data complexity analysis. The best oversampling techniques seem to possess three key characteristics: use of cleaning procedures, cluster-based example synthetization and adaptive weighting of minority examples, where Synthetic Minority Oversampling Technique coupled with Tomek Links and Majority Weighted Minority Oversampling Technique stand out, being capable of increasing the discriminative power of data.

FecharLer Abstract

2018

BI-RADS CLASSIFICATION OF BREAST CANCER: A NEW PRE-PROCESSING PIPELINE FOR DEEP MODELS TRAINING

Autores
Domingues, I; Abreu, PH; Santos, J;

Publicação
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP)

Abstract
One of the main difficulties in the use of deep learning strategies in medical contexts is the training set size. While these methods need large annotated training sets, these datasets are costly to obtain in medical contexts and suffer from intra and inter-subject variability. In the present work, two new pre-processing techniques are introduced to improve a deep classifier performance. First, data augmentation based on co-registration is suggested. Then, multi-scale enhancement based on Difference of Gaussians is proposed. Results are accessed in a public mammogram database, the InBreast, in the context of an ordinal problem, the BI-RADS classification. Moreover, a pre-trained Convolutional Neural Network with the AlexNet architecture was used as a base classifier. The multi-class classification experiments show that the proposed pipeline with the Difference of Gaussians and the data augmentation technique outperforms using the original dataset only and using the original dataset augmented by mirroring the images.

FecharLer Abstract

2018

Exploring the effects of data distribution in missing data imputation

Autores
Pompeu Soares, J; Seoane Santos, M; Henriques Abreu, P; Araújo, H; Santos, J;

Publicação
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Abstract
In data imputation problems, researchers typically use several techniques, individually or in combination, in order to find the one that presents the best performance over all the features comprised in the dataset. This strategy, however, neglects the nature of data (data distribution) and makes impractical the generalisation of the findings, since for new datasets, a huge number of new, time consuming experiments need to be performed. To overcome this issue, this work aims to understand the relationship between data distribution and the performance of standard imputation techniques, providing a heuristic on the choice of proper imputation methods and avoiding the needs to test a large set of methods. To this end, several datasets were selected considering different sample sizes, number of features, distributions and contexts and missing values were inserted at different percentages and scenarios. Then, different imputation methods were evaluated in terms of predictive and distributional accuracy. Our findings show that there is a relationship between features’ distribution and algorithms’ performance, and that their performance seems to be affected by the combination of missing rate and scenario at state and also other less obvious factors such as sample size, goodness-of-fit of features and the ratio between the number of features and the different distributions comprised in the dataset. © Springer Nature Switzerland AG 2018.

FecharLer Abstract