GraphMDL : sélection de motifs de graphes avec le principe MDL
Abstract
Many graph pattern mining algorithms have been designed to identify recurring structures
in graphs. The main drawback of these approaches is that they often extract too many patterns
for human analysis. Recently, pattern mining methods using the Minimum Description Length
(MDL) principle have been proposed to select a characteristic subset of patterns from transactional, sequential and relational data. In this paper, we propose a MDL-based approach for
selecting a characteristic subset of patterns on labeled graphs. A key notion in this paper is
the introduction of ports to encode connections between pattern occurrences without any loss
of information. Experiments show that the number of patterns is drastically reduced, and the
selected patterns can have complex shapes.