2020
Autores
Nobrega, FAA; Jorge, AM; Brazdil, P; Pardo, TAS;
Publicação
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020
Abstract
The task of Sentence Compression aims at producing a shorter version of a given sentence. This task may assist many other applications, as Automatic Summarization and Text Simplification. In this paper, we investigate methods for Sentence Compression for Portuguese. We focus on machine learning-based algorithms and propose new strategies. We also create reference corpora/datasets for the area, allowing to train and to test the methods of interest. Our results show that some of our methods outperform previous initiatives for Portuguese and produce competitive results with a state of the art method in the area.
2019
Autores
Correia, A; Soares, C; Jorge, A;
Publicação
Discovery Science - 22nd International Conference, DS 2019, Split, Croatia, October 28-30, 2019, Proceedings
Abstract
Machine Learning algorithms are often too complex to be studied from a purely analytical point of view. Alternatively, with a reasonably large number of datasets one can empirically observe the behavior of a given algorithm in different conditions and hypothesize some general characteristics. This knowledge about algorithms can be used to choose the most appropriate one given a new dataset. This very hard problem can be approached using metalearning. Unfortunately, the number of datasets available may not be sufficient to obtain reliable meta-knowledge. Additionally, datasets may change with time, by growing, shrinking and editing, due to natural actions like people buying in a e-commerce site. In this paper we propose dataset morphing as the basis of a novel methodology that can help overcome these drawbacks and can be used to better understand ML algorithms. It consists of manipulating real datasets through the iterative application of gradual transformations (morphing) and by observing the changes in the behavior of learning algorithms while relating these changes with changes in the meta features of the morphed datasets. Although dataset morphing can be envisaged in a much wider framework, we focus on one very specific instance: the study of collaborative filtering algorithms on binary data. Results show that the proposed approach is feasible and that it can be used to identify useful metafeatures to predict the best collaborative filtering algorithm for a given dataset. © Springer Nature Switzerland AG 2019.
2019
Autores
Ramalho, MS; Vinagre, J; Jorge, AM; Bastos, R;
Publicação
2nd Workshop on Online Recommender Systems and User Modeling, ORSUM@RecSys 2019, 19 September 2019, Copenhagen, Denmark
Abstract
The present paper sets a milestone on incremental recommender systems approaches by comparing several state-of-the-art algorithms with two different mathematical foundations - matrix and tensor factorization. Traditional Pairwise Interaction Tensor Factorization is revisited and converted into a scalable and incremental option that yields the best predictive power. A novel tensor inspired approach is described. Finally, experiments compare contextless vs context-aware scenarios, the impact of noise on the algorithms, discrepancies between time complexity and execution times, and are run on five different datasets from three different recommendation areas - music, gross retail and garment. Relevant conclusions are drawn that aim to help choosing the most appropriate algorithm to use when faced with a novel recommender tasks. © 2019 M.S. Ramalho, J. Vinagre, A.M. Jorge & R. Bastos.
2019
Autores
Vinagre, J; Jorge, AM; Bifet, A; Ghossein, MA;
Publicação
ORSUM@RecSys
Abstract
2018
Autores
Loureiro, D; Jorge, A;
Publicação
Proceedings of the First Workshop on Fact Extraction and VERification, FEVER@EMNLP 2018, Brussels, Belgium, November 1, 2018
Abstract
2020
Autores
Vinagre, J; Jorge, AM; Ghossein, MA; Bifet, A;
Publicação
RecSys 2020: Fourteenth ACM Conference on Recommender Systems, Virtual Event, Brazil, September 22-26, 2020
Abstract
Modern online web-based systems continuously generate data at very fast rates. This continuous flow of data encompasses web content - e.g. posts, news, products, comments -, but also user feedback - e.g. ratings, views, reads, clicks, thumbs up -, as well as context information - device used, geographic info, social network, current user activity, weather. This is potentially overwhelming for systems and algorithms design to train in offline batches, given the continuous and potentially fast change of content, context and user preferences. Therefore it is important to investigate online methods to be able to transparently adapt to the inherent dynamics of online systems. Incremental models that learn from data streams are gaining attention in the recommender systems community, given their natural ability to deal with data generated in dynamic, complex environments. User modeling and personalization can particularly benefit from algorithms capable of maintaining models incrementally and online, as data is generated. The objective of this workshop is to foster contributions and bring together a growing community of researchers and practitioners interested in online, adaptive approaches to user modeling, recommendation and personalization, as well as other related tasks, such as evaluation, reproducibility, privacy and explainability. © 2020 Owner/Author.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.