Normalizing Constrained Symbolic Data for Clustering
Abstract
Clustering is one of the most common operation in data analysis
while constrained is not so common. We present here a clustering method in the
framework of Symbolic Data Analysis (S.D.A) which allows to cluster Symbolic
Data. Such data can be constrained relations between the variables, expressed by
rules which express the domain knowledge. But such rules can induce a combinatorial
increase of the computation time according to the number of rules. We
present in this paper a way to cluster such data in a quadratic time. This method
is based first on the decomposition of the data according to the rules, then we
can apply to the data a clustering algorithm based on dissimilarities.