Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por LIAAD

2018

A Text Feature Based Automatic Keyword Extraction Method for Single Documents

Autores
Campos, R; Mangaravite, V; Pasquali, A; Jorge, AM; Nunes, C; Jatowt, A;

Publicação
ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018)

Abstract
In this work, we propose a lightweight approach for keyword extraction and ranking based on an unsupervised methodology to select the most important keywords of a single document. To understand the merits of our proposal, we compare it against RAKE, TextRank and SingleRank methods (three well-known unsupervised approaches) and the baseline TF. IDF, over four different collections to illustrate the generality of our approach. The experimental results suggest that extracting keywords from documents using our method results in a superior effectiveness when compared to similar approaches.

2018

YAKE! Collection-Independent Automatic Keyword Extractor

Autores
Campos, R; Mangaravite, V; Pasquali, A; Jorge, AM; Nunes, C; Jatowt, A;

Publicação
ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018)

Abstract
In this paper, we present YAKE!, a novel feature-based system for multi-lingual keyword extraction from single documents, which supports texts of different sizes, domains or languages. Unlike most systems, YAKE! does not rely on dictionaries or thesauri, neither it is trained against any corpora. Instead, we follow an unsupervised approach which builds upon features extracted from the text, making it thus applicable to documents written in many different languages without the need for external knowledge. This can be beneficial for a large number of tasks and a plethora of situations where the access to training corpora is either limited or restricted. In this demo, we offer an easy to use, interactive session, where users from both academia and industry can try our system, either by using a sample document or by introducing their own text. As an add-on, we compare our extracted keywords against the output produced by the IBM Natural Language Understanding (IBM NLU) and Rake system. YAKE! demo is available at http://bit.ly/YakeDemoECIR2018. A python implementation of YAKE! is also available at PyPi repository (https://pypi.python.org/pypi/yake/).

2018

Forgetting techniques for stream-based matrix factorization in recommender systems

Autores
Matuszyk, P; Vinagre, J; Spiliopoulou, M; Jorge, AM; Gama, J;

Publicação
KNOWLEDGE AND INFORMATION SYSTEMS

Abstract
Forgetting is often considered a malfunction of intelligent agents; however, in a changing world forgetting has an essential advantage. It provides means of adaptation to changes by removing effects of obsolete (not necessarily old) information from models. This also applies to intelligent systems, such as recommender systems, which learn users' preferences and predict future items of interest. In this work, we present unsupervised forgetting techniques that make recommender systems adapt to changes of users' preferences over time. We propose eleven techniques that select obsolete information and three algorithms that enforce the forgetting in different ways. In our evaluation on real-world datasets, we show that forgetting obsolete information significantly improves predictive power of recommender systems.

2018

Proceedings of the First Workshop on Narrative Extraction From Text (Text2Story 2018) co-located with 40th European Conference on Information Retrieval (ECIR 2018), Grenoble, France, March 26, 2018

Autores
Jorge, AM; Campos, R; Jatowt, A; Nunes, S;

Publicação
Text2Story@ECIR

Abstract

2018

First International Workshop on Narrative Extraction from Texts: Text2Story 2018

Autores
Jorge, AM; Campos, R; Jatowt, A; Nunes, S;

Publicação
ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018)

Abstract

2018

Preface

Autores
Jorge, AM; Campos, R; Jatowt, A; Nunes, S;

Publicação
CEUR Workshop Proceedings

Abstract

  • 215
  • 506