Paula Brito

Cookies Policy

The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More

Institution
Research
Research Domains
Artificial Intelligence

Bioengineering

Communications

Computer Science and Engineering

Photonics

Power and Energy Systems

Robotics

Systems Engineering and Management
RESEARCH CENTERS
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Innovation
Innovation / Tec4

TEC4AGRO-FOOD

TEC4ENERGY

TEC4HEALTH

TEC4INDUSTRY

TEC4SEA

TECPARTNERSHIPS

Available Technologies
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Laboratories
Research Laboratories

iilab
Communication
News

Events

Media

Newsletter
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Work with us
Contacts

Home
People
Paula Brito

Read Full presentation

I am Associate Professor at the School of Economics of the University of Porto, where I teach Statistics and Multivariate Data Analysis, at undergraduate and post-graduate (Master, PhD) levels, and member of the Artificial Intelligence and Decision Support Lab (LIAAD) of INESC-TEC. I hold a doctorate degree in Applied Mathematics from the University of Paris Dauphine (1991). My current research focuses on the analysis of multidimensional complex data, known as symbolic data - data representing inherent variability, in the form of intervals or distributions - for which I develop statistical approaches and multivariate analysis methodologies. I am generally interested in multivariate data analysis, with particular incidence in clustering methods.

Read Full presentation

About

My current research focuses on the analysis of multidimensional complex data, known as symbolic data - data representing inherent variability, in the form of intervals or distributions - for which I develop statistical approaches and multivariate analysis methodologies. I am generally interested in multivariate data analysis, with particular incidence in clustering methods.

Interest
Topics

Details

Name
Paula Brito
Role
Research Coordinator
Since
01st January 2008

Nationality
Portugal
Centre
Artificial Intelligence and Decision Support
Contacts
+351220402963
paula.brito@inesctec.pt

001

Publications

View all Publications

2025

Air Quality Data Analysis with Symbolic Principal Components

Authors
Loureiro, P; Oliveira, M; Brito, P; Oliveira, L;

Publication
Springer Proceedings in Mathematics and Statistics

Abstract
Air pollution is a global challenge with deep implications in public health and environment. We examine air quality data from a monitoring station in Entrecampos, Lisbon, Portugal, using Symbolic Data Analysis. The dataset consists of hourly concentrations of nine pollutants during three years, which are logarithmically transformed and aggregated in intervals, taking the daily minimum and maximum values. The symbolic mean and variance are estimated for each variable through the method of moments, and the pairwise dependencies are captured using a bivariate copula. Symbolic principal component scores are obtained from the estimated covariance matrix and used to fit generalized extreme value distributions. Outlier maps, based on these distributions’ quantiles, are used to identify outlying observations. A comparative analysis with daily average-based outlier detection methods is conducted. The results show the relevance of Symbolic Data Analysis in revealing new insights into air quality. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

CloseRead Abstract

2025

Parametric models for distributional data

Authors
Brito, P; Silva, APD;

Publication
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION

Abstract
We present parametric probabilistic models for numerical distributional variables. The proposed models are based on the representation of each distribution by a location measure and inter-quantile ranges, for given quantiles, thereby characterizing the underlying empirical distributions in a flexible way. Multivariate Normal distributions are assumed for the whole set of indicators, considering alternative structures of the variance-covariance matrix. For all cases, maximum likelihood estimators of the corresponding parameters are derived. This modelling allows for hypothesis testing and multivariate parametric analysis. The proposed framework is applied to Analysis of Variance and parametric Discriminant Analysis of distributional data. A simulation study examines the performance of the proposed models in classification problems under different data conditions. Applications to Internet traffic data and Portuguese official data illustrate the relevance of the proposed approach.

CloseRead Abstract

2024

Immigrant groups in Luxembourg's labour market: A symbolic data analysis approach

Authors
Silva, CC; Brito, P; Campos, P;

Publication
STATISTICAL JOURNAL OF THE IAOS

Abstract
Luxembourg, known for its immigration history, attracts immigrants to work. This study analyses different immigrant groups in the labour market from 2014 to 2022 by using Labor Force Survey (LFS) data, Symbolic Data Analysis (SDA), and the Monitoring the Evolution of Clusters (MEC) framework.Based on the birthplace and length of residence in Luxembourg, in each year, microdata were aggregated into 21 symbolic objects. They were primarily described by 16 modal variables which are multi-valued variables with a frequency attached to each category. Moreover, clustering using complete linkage and the Chernoff's distance was applied. The Heuristic Identification of Noisy Variables (HINoV) suggested that with just six variables, objects may be grouped homogeneously. The MEC framework traced temporal relations and transitions between the clusters, revealing some movements across the different years.Results indicate that people from the European Union (EU) and Neighbouring countries have similar profiles while the Portuguese have opposite characteristics. The Luxembourgers are somewhere in between. Profiling people from non-EU countries was challenging.The data and methodology used make it easy to replicate the work in other nations, enabling comparison of results and monitoring to continue in the future.

CloseRead Abstract

2024

New skills in symbolic data analysis for official statistics

Authors
Verde R.; Batagelj V.; Brito P.; Silva A.P.D.; Korenjak-Cerne S.; Dobša J.; Diday E.;

Publication
Statistical Journal of the IAOS

Abstract
The paper draws attention to the use of Symbolic Data Analysis (SDA) in the field of Official Statistics. It is composed of three sections presenting three pilot techniques in the field of SDA. The three contributions range from a technique based on the notion of exactly unified summaries for the creation of symbolic objects, a model-based approach for interval data as an innovative parametric strategy in this context, and measures of similarity defined between a class and a collection of classes based on the frequency of the categories which characterize them. The paper shows the effectiveness of the proposed approaches as prototypes of numerous techniques developed within the SDA framework and opens to possible further developments.

CloseRead Abstract

2024

Special issue on "New methodologies in clustering and classification for complex and/or big data"

Authors
Brito, P; Cerioli, A; Garcia Escudero, LA; Saporta, G;

Publication
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION

Abstract

About

Details

Name

Role

Since

Nationality

Centre

Contacts

MaLPIS

Air Quality Data Analysis with Symbolic Principal Components

Parametric models for distributional data

Immigrant groups in Luxembourg's labour market: A symbolic data analysis approach

New skills in symbolic data analysis for official statistics

Special issue on "New methodologies in clustering and classification for complex and/or big data"