Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by José Orlando Pereira

2016

CoherentPaaS - A Coherent and Rich PaaS with a Common Programming Model

Authors
Jimenez, R; Patiño, M; Brondino, I; Vianello, V; Vilaça, R; Kolev, B; Valduriez, P; Pau, R; Hatzimanikatis, A; Spitadakis, V; Bouras, D; Panagiotakis, Y; Saloustros, G; Papagiannis, A; Férez, PG; Bilas, A; Zhang, Y; Kranas, P; Stamokostas, S; Moulos, V; Aisopos, F; Sabary, F; Cortesao, L; Regateiro, D; Pereira, J; Oliveira, R;

Publication
European Space project on Smart Systems, Big Data, Future Internet - Towards Serving the Grand Societal Challenges, Rome, Italy, April 21-28, 2016.

Abstract

2021

PAIO: A Software-Defined Storage Data Plane Framework

Authors
Macedo, R; Tanimura, Y; Haga, J; Chidambaram, V; Pereira, J; Paulo, J;

Publication
CoRR

Abstract

2020

Towards a Polyglot Data Access Layer for a Low-Code Application Development Platform

Authors
Alonso, AN; Abreu, J; Nunes, D; Vieira, A; Santos, L; Soares, T; Pereira, J;

Publication
CoRR

Abstract

2023

Taming Metadata-intensive HPC Jobs Through Dynamic, Application-agnostic QoS Control

Authors
Macedo, R; Miranda, M; Tanimura, Y; Haga, J; Ruhela, A; Harrell, SL; Evans, RT; Pereira, J; Paulo, J;

Publication
2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID

Abstract
Modern I/O applications that run on HPC infrastructures are increasingly becoming read and metadata intensive. However, having multiple applications submitting large amounts of metadata operations can easily saturate the shared parallel file system's metadata resources, leading to overall performance degradation and I/O unfairness. We present PADLL, an application and file system agnostic storage middleware that enables QoS control of data and metadata workflows in HPC storage systems. It adopts ideas from Software-Defined Storage, building data plane stages that mediate and rate limit POSIX requests submitted to the shared file system, and a control plane that holistically coordinates how all I/O workflows are handled. We demonstrate its performance and feasibility under multiple QoS policies using synthetic benchmarks, real-world applications, and traces collected from a production file system. Results show that PADLL can enforce complex storage QoS policies over concurrent metadata-aggressive jobs, ensuring fairness and prioritization.

2023

Towards MRAM Byte-Addressable Persistent Memory in Edge Database Systems

Authors
Ferreira, LM; Coelho, F; Pereira, JO;

Publication
Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28 - September 1, 2023.

Abstract
There is a growing demand for persistent data in IoT, edge and similar resource-constrained devices. However, standard FLASH memory-based solutions present performance, energy, and reliability limitations in these applications. We propose MRAM persistent memory as an alternative to FLASH based storage. Preliminary experimental results show that its performance, power consumption, and reliability in typical database workloads is competitive for resource-constrained devices. This opens up new opportunities, as well as challenges, for small-scale database systems. MRAM is tested for its raw performance and applicability to key-value and relational database systems on resource-constrained devices. Improvements of as much as three orders of magnitude in write performance for key-value systems were observed in comparison to an alternative NAND FLASH based device. © 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).

2023

LOOM: A Closed-Box Disaggregated Database System

Authors
Coelho, F; Alonso, AN; Ferreira, L; Pereira, J; Oliveira, R;

Publication
PROCEEDINGS OF12TH LATIN-AMERICAN SYMPOSIUM ON DEPENDABLE AND SECURE COMPUTING, LADC 2023

Abstract
Cloud native database systems provide highly available and scalable services as part of cloud platforms by transparently replicating and partitioning data across automatically managed resources. Some systems, such as Google Spanner, are designed and implemented from scratch. Others, such as Amazon Aurora, derive from traditional database systems for better compatibility but disaggregate storage to cloud services. Unfortunately, because they follow an open-box approach and fork the original code base, they are difficult to implement and maintain. We address this problem with Loom, a replicated and partitioned database system built on top of PostgreSQL that delegates durable storage to a distributed log native to the cloud. Unlike previous disaggregation proposals, Loom is a closed-box approach that uses the original server through existing interfaces to simplify implementation and improve robustness and maintainability. Experimental evaluation achieves 6x higher throughput and 5x lower response time than standard replication and competes with the state of the art in cloud and HPC hardware.

  • 20
  • 20