2013
Authors
Costa, J; Silva, C; Ribeiro, B; Antunes, M;
Publication
2013 8TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP 2013)
Abstract
Crowdsourcing is a bubbling research topic that has the potential to be applied in numerous online and social scenarios. It consists on obtaining services or information by soliciting contributions from a large group of people. However, the question of defining the appropriate scope of a crowd to tackle each scenario is still open. In this work we compare two approaches to define the scope of a crowd in a classification problem, casted as a recommendation system. We propose a similarity measure to determine the closeness of a specific user to each crowd contributor and hence to define the appropriate crowd scope. We compare different levels of customization using crowd-based information, allowing non-experts classification by crowds to be tuned to substitute the user profile definition. Results on a real recommendation data set show the potential of making crowds more personal, i.e. of tuning the crowd to the crowdtarget.
2013
Authors
Costa, J; Silva, C; Antunes, M; Ribeiro, B;
Publication
EXPERT SYSTEMS WITH APPLICATIONS
Abstract
Traditional classification algorithms can be limited in their performance when a specific user is targeted. User preferences, e.g. in recommendation systems, constitute a challenge for learning algorithms. Additionally, in recent years user's interaction through crowdsourcing has drawn significant interest, although its use in learning settings is still underused. In this work we focus on an active strategy that uses crowd-based non-expert information to appropriately tackle the problem of capturing the drift between user preferences in a recommendation system. The proposed method combines two main ideas: to apply active strategies for adaptation to each user; to implement crowdsourcing to avoid excessive user feedback. A similitude technique is put forward to optimize the choice of the more appropriate similitude-wise crowd, under the guidance of basic user feedback. The proposed active learning framework allows non-experts classification performed by crowds to be used to define the user profile, mitigating the labeling effort normally requested to the user. The framework is designed to be generic and suitable to be applied, to different' scenarios, whilst customizable for each specific user. A case study on humor classification scenario is used to demonstrate experimentally that the approach can improve baseline active results.
2013
Authors
Costa, J; Silva, C; Antunes, M; Ribeiro, B;
Publication
ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, ICANNGA 2013
Abstract
Given the wide spread of social networks, research efforts to retrieve information using tagging from social networks communications have increased. In particular, in Twitter social network, hashtags are widely used to define a shared context for events or topics. While this is a common practice often the hashtags freely introduced by the user become easily biased. In this paper, we propose to deal with this bias defining semantic meta-hashtags by clustering similar messages to improve the classification. First, we use the user-defined hashtags as the Twitter message class labels. Then, we apply the meta-hashtag approach to boost the performance of the message classification. The meta-hashtag approach is tested in a Twitter-based dataset constructed by requesting public tweets to the Twitter API. The experimental results yielded by comparing a baseline model based on user-defined hashtags with the clustered meta-hashtag approach show that the overall classification is improved. It is concluded that by incorporating semantics in the meta-hashtag model can have impact in different applications, e.g. recommendation systems, event detection or crowdsourcing.
2013
Authors
Ferreira, P; Vinhoza, TTV; Castro, A; Mourato, F; Tavares, T; Mattos, S; Dutra, I; Coimbra, M;
Publication
2013 IEEE 26TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS)
Abstract
We evaluated a population of 7199 children between 2 and 19 years old to study the relations between the observed demographic and physiological features in the occurrence of a pathological/non-pathological heart condition. The data was collected at the Real Hospital Portugues, Pernambuco, Brazil, We performed a feature importance study, with the aim of categorizing the most relevant variables, indicative of abnormalities. Results show that second heart sound, weight, heart rate, height and secondary reason for consultation are important features, but not nearly as decisive as the presence of heart murmurs. Quantitatively speaking. systolic murmurs and a hyperphonetic second heart sound increase the odds of having a pathology by a factor of 320 and 6, respectively.
2013
Authors
Kuusisto, F; Dutra, I; Nassif, H; Wu, Y; Klein, ME; Neuman, HB; Shavlik, J; Burnside, ES;
Publication
2013 IEEE 15th International Conference on e-Health Networking, Applications and Services, Healthcom 2013
Abstract
When mammography reveals a suspicious finding, a core needle biopsy is usually recommended. In 5% to 15% of these cases, the biopsy diagnosis is non-definitive and a more invasive surgical excisional biopsy is recommended to confirm a diagnosis. The majority of these cases will ultimately be proven benign. The use of excisional biopsy for diagnosis negatively impacts patient quality of life and increases costs to the healthcare system. In this work, we employ a multi-relational machine learning approach to predict when a patient with a non-definitive core needle biopsy diagnosis need not undergo an excisional biopsy procedure because the risk of malignancy is low. © 2013 IEEE.
2013
Authors
Martinez Angeles, CA; Dutra, I; Costa, VS; Buenabad Chavez, J;
Publication
Technische Berichte des Instituts fur Informatik der Christian-Albrechts-Universitat zu Kiel
Abstract
We present the design and evaluation of a Datalog engine for execution in Graphics Processing Units (GPUs). The engine evaluates recursive and non-recursive Datalog queries using a bottom-up approach based on typical relational operators. It includes a memory management scheme that automatically swaps data between memory in the host platform (a multicore) and memory in the GPU in order to reduce the number of memory transfers. To evaluate the performance of the engine, three Datalog queries were run on the engine and on a single CPU in the multicore host. One query runs up to 200 times faster on the (GPU) engine than on the CPU. © Springer-Verlag Berlin Heidelberg 2011.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.