Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by Paula Brito

1993

Learning Probabilistic Models by Conceptual Pyramidal Clustering

Authors
Diday, E; Brito, P; Mfoumoune, E;

Publication
Progress in Artificial Intelligence, 6th Portuguese Conference on Artificial Intelligence, EPIA '93, Porto, Portugal, October 6-8, 1993, Proceedings

Abstract
Symbolic objects (Diday (1987, 1992), Brito, Diday (1990), Brito (1991)) allow to model data on the form of descriptions by intension, thus generalizing the usual tabular model of data analysis. This modelisation allows to take into account variability within a set. The formalism of symbolic objects has some notions in common with VL1, proposed by Michalski (1980); however VL1 is mainly based on prepositional and predicate calculus, while the formalism of symbolic objects allows for an explicit interpretation within its framework, by considering the duality intension-extension. That is, given a set of observations, we consider the couple (symbolic object — extension in the given set). This results from the wish to keep a statistics point of view. The need to represent non-deterministic knowledge, that is, data for which the values for the different variables are assigned a weight, led to considering an extension of assertion objects to probabilist objects (Diday 1992). In this case, data are represented by probability distributions on the variables observation sets. The notions previously defined for assertion objects are the generalized to this new kind of symbolic objects. Other extensions can be found in Diday (1992). © Springer-Verlag Berlin Heidelberg 1993.

2012

Conceptual clustering with generalization by intervals [Classification Conceptuelle avec Généralisation par Intervalles]

Authors
Brito, P; Polaillon, G;

Publication
Revue des Nouvelles Technologies de l'Information

Abstract
This paper deals with hierarchical or pyramidal conceptual clustering methods, where each formed cluster corresponds to a concept, i.e., a pair (extent, intent).We consider data presenting real or interval-valued numerical values, ordered values and/or probability/frequency distributions on a set of categories. Concepts are obtained by a Galois connection with generalisation by intervals, which allows dealing with different variable types on a common framework. In the case of distribution data, the obtained concepts are more homogeneous and more easily interpretable than those obtained by using the maximum and minimum operators previously proposed. A measure of generality of a concept is defined similarly for all these variable types. An example illustrates the proposed method.

2003

Mining official data

Authors
Brito, P; Malerba, D;

Publication
Intelligent Data Analysis

Abstract

2011

Homogeneity and stability in conceptual analysis

Authors
Brito, P; Polaillon, G;

Publication
CEUR Workshop Proceedings

Abstract
This work comes within the field of data analysis using Galois lattices. We consider ordinal, numerical single or interval data as well as data that consist on frequency/probability distributions on a finite set of categories. Data are represented and dealt with on a common framework, by defining a generalization operator that determines intents by intervals. In the case of distribution data, the obtained concepts are more homogeneous and more easily interpretable than those obtained by using the maximum and minimum operators previously proposed. The number of obtained concepts being often rather large, and to limit the influence of atypical elements, we propose to identify stable concepts using interval distances in a cross validation-like approach.

2012

Divisive monothetic clustering for interval and histogram-valued data

Authors
Brito, P; Chavent, M;

Publication
ICPRAM 2012 - Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods

Abstract
In this paper we propose a divisive top-down clustering method designed for interval and histogram-valued data. The method provides a hierarchy on a set of objects together with a monothetic characterization of each formed cluster. At each step, a cluster is split so as to minimize intra-cluster dispersion, which is measured using a distance suitable for the considered variable types. The criterion is minimized across the bipartitions induced by a set of binary questions. Since interval-valued variables may be considered a special case of histogram-valued variables, the method applies to data described by either kind of variables, or by variables of both types. An example illustrates the proposed approach.

2006

Symbolic and spatial data analysis: Mining complex data structures

Authors
Brito, P; Noirhomme Fraiture, M;

Publication
INTELLIGENT DATA ANALYSIS

Abstract

  • 8
  • 11