Anonymiser des données multidimensionnelles à l'aide du coclustering

Françoise Fessant, Tarek Benkhelif, Fabrice Clérot

In EGC 2017, vol. RNTI-E-33, pp.153-164

Abstract

In tis paper we propose a methodology to anonymize multidimensional individual data. The goal is to be able to protect data against the reidentification risk. The proposed solution is based on a coclustering method. The coclustering is used to build an aggregated representation of the data, then the model is used to draw synthetic individual data. We show that these synthetic data preserve sufficient information to be used in place of the real data. Finally the protection against the reidentification risk is evaluated.

Preview See bibtex

Download