Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by HASLab

2021

The CoronaSurveys System for COVID-19 Incidence Data Collection and Processing

Authors
Baquero, C; Casari, P; Anta, AF; Garcia Garcia, A; Frey, D; Garcia Agundez, A; Georgiou, C; Girault, B; Ortega, A; Goessens, M; Hernandez Roig, HA; Nicolaou, N; Stavrakis, E; Ojo, O; Roberts, JC; Sanchez, I;

Publication
FRONTIERS IN COMPUTER SCIENCE

Abstract
CoronaSurveys is an ongoing interdisciplinary project developing a system to infer the incidence of COVID-19 around the world using anonymous open surveys. The surveys have been translated into 60 languages and are continuously collecting participant responses from any country in the world. The responses collected are pre-processed, organized, and stored in a version-controlled repository, which is publicly available to the scientific community. In addition, the CoronaSurveys team has devised several estimates computed on the basis of survey responses and other data, and makes them available on the project's website in the form of tables, as well as interactive plots and maps. In this paper, we describe the computational system developed for the CoronaSurveys project. The system includes multiple components and processes, including the web survey, the mobile apps, the cleaning and aggregation process of the survey responses, the process of storage and publication of the data, the processing of the data and the computation of estimates, and the visualization of the results. In this paper we describe the system architecture and the major challenges we faced in designing and deploying it.

2021

Efficient Replication via Timestamp Stability

Authors
Enes, V; Baquero, C; Gotsman, A; Sutra, P;

Publication
PROCEEDINGS OF THE SIXTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS '21)

Abstract
Modern web applications replicate their data across the globe and require strong consistency guarantees for their most critical data. These guarantees are usually provided via state-machine replication (SMR). Recent advances in SMR have focused on leaderless protocols, which improve the availability and performance of traditional Paxos-based solutions. We propose Tempo - a leaderless SMR protocol that, in comparison to prior solutions, achieves superior throughput and offers predictable performance even in contended workloads. To achieve these benefits, Tempo timestamps each application command and executes it only after the timestamp becomes stable, i.e., all commands with a lower timestamp are known. Both the timestamping and stability detection mechanisms are fully decentralized, thus obviating the need for a leader replica. Our protocol furthermore generalizes to partial replication settings, enabling scalability in highly parallel workloads. We evaluate the protocol in both real and simulated geo-distributed environments and demonstrate that it outperforms state-of-the-art alternatives.

2021

Seeking Out Camille, and Being Open to Others

Authors
Hill, RK; Baquero, C;

Publication
COMMUNICATIONS OF THE ACM

Abstract
Robin K. Hill on overcoming biases against alternative views, and Carlos Baquero on his search for the elusive Camille Nous.

2021

Estimating the COVID-19 Prevalence in Spain With Indirect Reporting via Open Surveys

Authors
Garcia Agundez, A; Ojo, O; Hernandez Roig, HA; Baquero, C; Frey, D; Georgiou, C; Goessens, M; Lillo, RE; Menezes, R; Nicolaou, N; Ortega, A; Stavrakis, E; Anta, AF;

Publication
FRONTIERS IN PUBLIC HEALTH

Abstract
During the initial phases of the COVID-19 pandemic, accurate tracking has proven unfeasible. Initial estimation methods pointed toward case numbers that were much higher than officially reported. In the CoronaSurveys project, we have been addressing this issue using open online surveys with indirect reporting. We compare our estimates with the results of a serology study for Spain, obtaining high correlations (R squared 0.89). In our view, these results strongly support the idea of using open surveys with indirect reporting as a method to broadly sense the progress of a pandemic.

2021

Estimating Active Cases of COVID-19

Authors
Álvarez, J; Baquero, C; Cabana, E; Champati, JP; Anta, AF; Frey, D; Agundez, AG; Georgiou, C; Goessens, M; Hernández, H; Lillo, RE; Menezes, R; Moreno, R; Nicolaou, N; Ojo, O; Ortega, A; Rufino, J; Stavrakis, E; Jeevan, G; Glorioso, C;

Publication
CoRR

Abstract
AbstractHaving accurate and timely data on active COVID-19 cases is challenging, since it depends on the availability of an appropriate infrastructure to perform tests and aggregate their results. In this paper, we consider a case to be active if it is infectious, and we propose methods to estimate the number of active infectious cases of COVID-19 from the official data (of confirmed cases and fatalities) and from public survey data. We show that the latter is a viable option in countries with reduced testing capacity or infrastructures.

2021

Efficient Replication via Timestamp Stability (Extended Version)

Authors
Enes, V; Baquero, C; Gotsman, A; Sutra, P;

Publication
CoRR

Abstract

  • 39
  • 251