Calibration des modèles d'apprentissage pour l'amélioration des détecteurs automatiques d'exemples mal-étiquetés

Ilies Chibane, Thomas George, Pierre Nodet, Vincent Lemaire

In EGC 2025, vol. RNTI-E-41, pp.327-334

Abstract

Mislabeled data is a pervasive issue that undermines the performance of machine-learning models across various industries. Methods for detecting mislabeled instances usually involve training a base machine learning model and then probing it for every instance in order to obtain a trust score that the provided label is genuine or incorrect. In this paper, we experiment with the calibration of this base model. Our empirical findings show that employing calibration methods improves the accuracy and robustness of mislabeled instance detection, providing a practical and effective solution for industry applications.

Preview See bibtex

Download