Publicacoes - INESC TEC

Publicações

Publicações por Alípio Jorge

2020

Sentence Compression for Portuguese

Autores
Nobrega, FAA; Jorge, AM; Brazdil, P; Pardo, TAS;

Publicação
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020

Abstract
The task of Sentence Compression aims at producing a shorter version of a given sentence. This task may assist many other applications, as Automatic Summarization and Text Simplification. In this paper, we investigate methods for Sentence Compression for Portuguese. We focus on machine learning-based algorithms and propose new strategies. We also create reference corpora/datasets for the area, allowing to train and to test the methods of interest. Our results show that some of our methods outperform previous initiatives for Portuguese and produce competitive results with a state of the art method in the area.

FecharLer Abstract

2019

Dataset Morphing to Analyze the Performance of Collaborative Filtering

Autores
Correia, A; Soares, C; Jorge, A;

Publicação
Discovery Science - 22nd International Conference, DS 2019, Split, Croatia, October 28-30, 2019, Proceedings

Abstract
Machine Learning algorithms are often too complex to be studied from a purely analytical point of view. Alternatively, with a reasonably large number of datasets one can empirically observe the behavior of a given algorithm in different conditions and hypothesize some general characteristics. This knowledge about algorithms can be used to choose the most appropriate one given a new dataset. This very hard problem can be approached using metalearning. Unfortunately, the number of datasets available may not be sufficient to obtain reliable meta-knowledge. Additionally, datasets may change with time, by growing, shrinking and editing, due to natural actions like people buying in a e-commerce site. In this paper we propose dataset morphing as the basis of a novel methodology that can help overcome these drawbacks and can be used to better understand ML algorithms. It consists of manipulating real datasets through the iterative application of gradual transformations (morphing) and by observing the changes in the behavior of learning algorithms while relating these changes with changes in the meta features of the morphed datasets. Although dataset morphing can be envisaged in a much wider framework, we focus on one very specific instance: the study of collaborative filtering algorithms on binary data. Results show that the proposed approach is feasible and that it can be used to identify useful metafeatures to predict the best collaborative filtering algorithm for a given dataset. © Springer Nature Switzerland AG 2019.

FecharLer Abstract

2019

Incremental Multi-Dimensional Recommender Systems: Co-Factorization vs Tensors

Autores
Ramalho, MS; Vinagre, J; Jorge, AM; Bastos, R;

Publicação
2nd Workshop on Online Recommender Systems and User Modeling, ORSUM@RecSys 2019, 19 September 2019, Copenhagen, Denmark

Abstract
The present paper sets a milestone on incremental recommender systems approaches by comparing several state-of-the-art algorithms with two different mathematical foundations - matrix and tensor factorization. Traditional Pairwise Interaction Tensor Factorization is revisited and converted into a scalable and incremental option that yields the best predictive power. A novel tensor inspired approach is described. Finally, experiments compare contextless vs context-aware scenarios, the impact of noise on the algorithms, discrepancies between time complexity and execution times, and are run on five different datasets from three different recommendation areas - music, gross retail and garment. Relevant conclusions are drawn that aim to help choosing the most appropriate algorithm to use when faced with a novel recommender tasks. © 2019 M.S. Ramalho, J. Vinagre, A.M. Jorge & R. Bastos.

FecharLer Abstract

2019

2nd Workshop on Online Recommender Systems and User Modeling, ORSUM@RecSys 2019, 19 September 2019, Copenhagen, Denmark

Autores
Vinagre, J; Jorge, AM; Bifet, A; Ghossein, MA;

Publicação
ORSUM@RecSys

Abstract

2018

Affordance Extraction and Inference based on Semantic Role Labeling

Autores
Loureiro, D; Jorge, A;

Publicação
Proceedings of the First Workshop on Fact Extraction and VERification, FEVER@EMNLP 2018, Brussels, Belgium, November 1, 2018

Abstract

2020

ORSUM - Workshop on Online Recommender Systems and User Modeling

Autores
Vinagre, J; Jorge, AM; Ghossein, MA; Bifet, A;

Publicação
RecSys 2020: Fourteenth ACM Conference on Recommender Systems, Virtual Event, Brazil, September 22-26, 2020

Abstract
Modern online web-based systems continuously generate data at very fast rates. This continuous flow of data encompasses web content - e.g. posts, news, products, comments -, but also user feedback - e.g. ratings, views, reads, clicks, thumbs up -, as well as context information - device used, geographic info, social network, current user activity, weather. This is potentially overwhelming for systems and algorithms design to train in offline batches, given the continuous and potentially fast change of content, context and user preferences. Therefore it is important to investigate online methods to be able to transparently adapt to the inherent dynamics of online systems. Incremental models that learn from data streams are gaining attention in the recommender systems community, given their natural ability to deal with data generated in dynamic, complex environments. User modeling and personalization can particularly benefit from algorithms capable of maintaining models incrementally and online, as data is generated. The objective of this workshop is to foster contributions and bring together a growing community of researchers and practitioners interested in online, adaptive approaches to user modeling, recommendation and personalization, as well as other related tasks, such as evaluation, reproducibility, privacy and explainability. © 2020 Owner/Author.

FecharLer Abstract