José Orlando Pereira

Cookies Policy

The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More

Institution
Research
Research Domains
Artificial Intelligence

Bioengineering

Communications

Computer Science and Engineering
Photonics

Power and Energy Systems

Robotics

Systems Engineering and Management
RESEARCH CENTERS
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Innovation
Innovation / Tec4

TEC4AGRO-FOOD

TEC4ENERGY

TEC4HEALTH

TEC4INDUSTRY

TEC4SEA

TECPARTNERSHIPS

Available Technologies
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Laboratories
Research Laboratories

iilab
Communication
News

Events

Media

Newsletter
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Work with us
Contacts

Home
People
José Orlando Pereira

Read Full presentation

I am a researcher at HASLab and professor at the U. Minho. My research focuses on dependable distributed systems. I am interested mainly in data management, including database replication and SQL processing over NoSQL systems, and in group communication, including consensus and gossip-based protocols for large-scale systems. I am also interested in tools for testing, evaluating, and monitoring dependable distributed systems. More information is available at my personal home page.

Read Full presentation

About

I am a researcher at HASLab and professor at the U. Minho. My research focuses on dependable distributed systems. I am interested mainly in data management, including database replication and SQL processing over NoSQL systems, and in group communication, including consensus and gossip-based protocols for large-scale systems. I am also interested in tools for testing, evaluating, and monitoring dependable distributed systems. More information is available at my personal home page.

Interest
Topics

Details

Name
José Orlando Pereira
Role
Research Coordinator
Since
01st November 2011

Nationality
Portugal
Centre
High-Assurance Software
Contacts
+351253604440
jose.o.pereira@inesctec.pt

011

Publications

View all Publications

2025

CRDV: Conflict-free Replicated Data Views

Authors
Faria, N; Pereira, J;

Publication
Proc. ACM Manag. Data

Abstract
There are now multiple proposals for Conflict-free Replicated Data Types (CRDTs) in SQL databases aimed at distributed systems. Some, such as ElectricSQL, provide only relational tables as convergent replicated maps, but this omits semantics that would be useful for merging updates. Others, such as Pg\_crdt, provide access to a rich library of encapsulated column types. However, this puts merge and query processing outside the scope of the query optimizer and restricts the ability of an administrator to influence access paths with materialization and indexes. Our proposal, CRDV, overcomes this challenge by using two layers implemented as SQL views: The first provides a replicated relational table from an update history, while the second implements varied and rich types on top of the replicated table. This allows the definition of merge semantics, or even entire new data types, in SQL itself, and enables global optimization of user queries together with merge operations. Therefore, it naturally extends the scope of query optimization and local transactions to operations on replicated data, can be used to reproduce the functionality of common CRDTs with simple SQL idioms, and results in better performance than alternatives.

CloseRead Abstract Read Full Publication

2025

Towards Adaptive Transactional Consistency for Georeplicated Datastores

Authors
Braga, R; Pereira, J; Coelho, F;

Publication
40TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING

Abstract
Developers of data-intensive georeplicated applications face a difficult decision when selecting a database system. As captured by the CAP theorem, CP systems such as Spanner provide strong consistency that greatly simplifies application development. AP systems such as AntidoteDB providing Transactional Causal Consistency (TCC), ensure availability in face of network partitions and isolate performance from wide-area round-trip times, but avoid lost-update anomalies only when values can be merged. Ideally, an application should be able to adapt to current data and network conditions by selecting which transactional consistency to use for each transaction. In this paper, we test the hypothesis that a georeplicated database system can be built at its core providing only TCC, hence, being AP, but allow an application to execute some transactions under Snapshot Isolation (SI), hence CP. Our main result is showing that this can be achieved even when all the interaction happens through the TCC database system, without additional communication channels between the participants. A preliminary experimental evaluation with a proof-of-concept implementation using AntidoteDB shows that this approach is feasible.

CloseRead Abstract

2024

Databases in Edge and Fog Environments: A Survey

Authors
Ferreira, LMM; Coelho, F; Pereira, J;

Publication
ACM COMPUTING SURVEYS

Abstract
While a significant number of databases are deployed in cloud environments, pushing part or all data storage and querying planes closer to their sources (i.e., to the edge) can provide advantages in latency, connectivity, privacy, energy, and scalability. This article dissects the advantages provided by databases in edge and fog environments by surveying application domains and discussing the key drivers for pushing database systems to the edge. At the same time, it also identifies the main challenges faced by developers in this new environment and analyzes the mechanisms employed to deal with them. By providing an overview of the current state of edge and fog databases, this survey provides valuable insights into future research directions.

CloseRead Abstract

2024

When Amnesia Strikes: Understanding and Reproducing Data Loss Bugs with Fault Injection

Authors
Ramos, M; Azevedo, J; Kingsbury, K; Pereira, J; Esteves, T; Macedo, R; Paulo, J;

Publication
PROCEEDINGS OF THE VLDB ENDOWMENT

Abstract
We present LAZYFS, a new fault injection tool that simplifies the debugging and reproduction of complex data durability bugs experienced by databases, key-value stores, and other data-centric systems in crashes. Our tool simulates persistence properties of POSIX file systems (e.g., operations ordering and atomicity) and enables users to inject lost and torn write faults with a precise and controlled approach. Further, it provides profiling information about the system's operations flow and persisted data, enabling users to better understand the root cause of errors. We use LAZYFS to study seven important systems: PostgreSQL, etcd, Zookeeper, Redis, LevelDB, PebblesDB, and Lightning Network. Our fault injection campaign shows that LAZYFS automates and facilitates the reproduction of five known bug reports containing manual and complex reproducibility steps. Further, it aids in understanding and reproducing seven ambiguous bugs reported by users. Finally, LAZYFS is used to find eight new bugs, which lead to data loss, corruption, and unavailability.

CloseRead Abstract

2024

TADA: A Toolkit for Approximate Distributed Agreement

Authors
da Conceiçao, EL; Alonso, AN; Oliveira, RC; Pereira, J;

Publication
SCIENCE OF COMPUTER PROGRAMMING

Abstract
TADA is a unique toolkit designed to foster the use and implementation of approximate distributed agreement primitives. Developed in Java, TADA provides ready-to-use implementations of several approximate agreement algorithms, as well as the tools to enable programmers/researchers to easily implement further protocols: A template that enables new protocol implementations to be created by simply changing specific functions; and high-level abstractions for communication and concurrency control. As an example, the toolkit includes a ready-to-use implementation for clock synchronisation between distributed processes. Further use cases can include sensor input stabilisation and distributed machine learning, or other instances of distributed agreement where network synchrony cannot be assumed, byzantine fault tolerance may be required and a bounded divergence in decision values can be tolerated.

CloseRead Abstract

Supervised
thesis

Supervised Thesis

View all Supervised Theses

2023

User-level software-defined storage data planes

Author
Ricardo Gonçalves Macedo

Institution
UM

2023

Distributed and Dependable SDS Control Plane for HPC

Author
Mariana Martins de Sá Miranda

Institution
UM

2022

Visualização de centro de dados

Author
Pedro Miguel da Costa Capa

Institution
UM

2022

Autonomous Optimization for a Transactional Middleware

Author
Susana Vitória Sá Silva Marques

Institution
UM

2022

Distributed Game

Author
Márcio da Silva Rocha

Institution
UM

View all Supervised Theses

José Orlando Pereira

About

Details

Name

Role

Since

Nationality

Centre

Contacts

PeT

ADAPQO

AURORA

T4CDTKC

CircThread

InteGrid

InterConnect

BeFlexible

CloudDBAppliance

AIDA

PAStor

CRDV: Conflict-free Replicated Data Views

Towards Adaptive Transactional Consistency for Georeplicated Datastores

Databases in Edge and Fog Environments: A Survey

When Amnesia Strikes: Understanding and Reproducing Data Loss Bugs with Fault Injection

TADA: A Toolkit for Approximate Distributed Agreement

User-level software-defined storage data planes

Distributed and Dependable SDS Control Plane for HPC

Visualização de centro de dados

Autonomous Optimization for a Transactional Middleware

Distributed Game