Normalizing Constrained Symbolic Data for Clustering

Marc Csernel, Francisco de Assis Tenório de Carvalho

In HDSDA 2013, vol. RNTI-E-25, pp.58-77

Résumé

Clustering is one of the most common operation in data analysis while constrained is not so common. We present here a clustering method in the framework of Symbolic Data Analysis (S.D.A) which allows to cluster Symbolic Data. Such data can be constrained relations between the variables, expressed by rules which express the domain knowledge. But such rules can induce a combinatorial increase of the computation time according to the number of rules. We present in this paper a way to cluster such data in a quadratic time. This method is based first on the decomposition of the data according to the rules, then we can apply to the data a clustering algorithm based on dissimilarities.

Aperçu Voir bibtex

Télécharger