Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
About

About

Ricardo Campos is a Professor at the Universidade da Beira Interior (UBI) and lecturer at the Porto Business School (PBS). He is a senior researcher of LIAAD-INESC TEC, the Artificial Intelligence and Decision Support Lab of U. Porto, and a collaborator of Ci2.ipt, the Smart Cities Research Center of the Polytechnic Institute of Tomar. He is PhD in Computer Science by the University of Porto (U. Porto), being also a former student of the Universidade da Beira Interior (UBI). He has more than 10 years of experience in Information Retrieval (IR) and Natural Language Processing (NLP), period during which his research has been recognized with multiple awards in international conferences and scientific competitions. He is the leading author of the highly impactful YAKE! keyword extractor toolkit, of the Tell me Stories project and of the Arquivo Público, among other software. His current research focuses on developing methods concerned the process of narrative extraction from texts. He has participated in several research projects and is particularly interested in practical approaches regarding the relationship behind entities, events and temporal aspects, as a means to make sense of unstructured data. He is an editorial board member of the International Journal of Data Science and Analytics (Springer) and of the Information Processing and Management Journal (Elsevier), co-chaired international conferences and workshops, and is a regular member of the scientific committee of several international conferences. He is also a member of the Scientific Advisory Forum of the Portulan Clarin - Research Infrastructure for the Science and Technology of Language. For more info please click here.

Interest
Topics
Details

Details

  • Name

    Ricardo Campos
  • Role

    Senior Researcher
  • Since

    01st July 2012
002
Publications

2024

Indexing Portuguese NLP Resources with PT-Pump-Up

Authors
Almeida, R; Campos, R; Jorge, A; Nunes, S;

Publication
CoRR

Abstract

2024

Physio: An LLM-Based Physiotherapy Advisor

Authors
Almeida, R; Sousa, H; Cunha, LF; Guimaraes, N; Campos, R; Jorge, A;

Publication
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT V

Abstract
The capabilities of the most recent language models have increased the interest in integrating them into real-world applications. However, the fact that these models generate plausible, yet incorrect text poses a constraint when considering their use in several domains. Healthcare is a prime example of a domain where text-generative trustworthiness is a hard requirement to safeguard patient well-being. In this paper, we present Physio, a chat-based application for physical rehabilitation. Physio is capable of making an initial diagnosis while citing reliable health sources to support the information provided. Furthermore, drawing upon external knowledge databases, Physio can recommend rehabilitation exercises and over-the-counter medication for symptom relief. By combining these features, Physio can leverage the power of generative models for language processing while also conditioning its response on dependable and verifiable sources. A live demo of Physio is available at https://physio.inesctec.pt.

2024

The 7th International Workshop on Narrative Extraction from Texts: Text2Story 2024

Authors
Campos, R; Jorge, A; Jatowt, A; Bhatia, S; Litvak, M;

Publication
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT V

Abstract
The Text2Story Workshop series, dedicated to Narrative Extraction from Texts, has been running successfully since 2018. Over the past six years, significant progress, largely propelled by Transformers and Large Language Models, has advanced our understanding of natural language text. Nevertheless, the representation, analysis, generation, and comprehensive identification of the different elements that compose a narrative structure remains a challenging objective. In its seventh edition, the workshop strives to consolidate a common platform and a multidisciplinary community for discussing and addressing various issues related to narrative extraction tasks. In particular, we aim to bring to the forefront the challenges involved in understanding narrative structures and integrating their representation into established frameworks, as well as in modern architectures (e.g., transformers) and AI-powered language models (e.g., chatGPT) which are now common and form the backbone of almost every IR and NLP application. Text2Story encompasses sessions covering full research papers, work-in-progress, demos, resources, position and dissemination papers, along with keynote talks. Moreover, there is dedicated space for informal discussions on methods, challenges, and the future of research in this dynamic field.

2024

Pre-trained language models: What do they know?

Authors
Guimarães, N; Campos, R; Jorge, A;

Publication
WIREs Data. Mining. Knowl. Discov.

Abstract

2023

Public News Archive: A Searchable Sub-archive to Portuguese Past News Articles

Authors
Campos, R; Correia, D; Jatowt, A;

Publication
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III

Abstract
Over the past fewdecades, the amount of information generated turned the Web into the largest knowledge infrastructure existing to date. Web archives have been at the forefront of data preservation, preventing the losses of significant data to humankind. Different snapshots of the web are saved everyday enabling users to surf the past web and to travel through this overtime. Despite these efforts, many people are not aware that the web is being preserved, often finding these infrastructures to be unattractive or difficult to use, when compared to common search engines. In this paper, we give a step towards making use of this preserved information to develop Public Archive an intuitive interface that enables end-users to search and analyze a large-scale of 67,242 past preserved news articles belonging to a Portuguese reference newspaper (Jornal Publico). The referred collection was obtained by scraping 10,976 versions of the homepage of the Jornal Publico preserved by the Portuguese web archive infrastructure (Arquivo.pt) during the time-period of 2010 to 2021. By doing this, we aim, not only to mark a stand in what respects to make use of this preserved information, but also to come up with an easy-to-follow solution, the Public Archive python package, which creates the roots to be used (with minor adaptations) by other news source providers interested in offering their readers access to past news articles.