Skip to Main content Skip to Navigation
Conference papers

Explaining Automated Data Cleaning with CLeanEX

Abstract : In this paper, we study the explainability of automated data cleaning pipelines and propose CLeanEX, a solution that can generate explanations for the pipelines automatically selected by an automated cleaning system, given it can provide its corresponding cleaning pipeline search space. We propose meaningful explanatory features that are used to describe the pipelines and generate predicate-based explanation rules. We compute quality indicators for these explanations and propose a multi-objective optimization algorithm to select the optimal set of explanations for user-defined objectives. Preliminary experiments show the need for multi-objective optimization for the generation of high-quality explanations that can be either intrinsic to the single selected cleaning pipeline or relative to the other pipelines that were not selected by the automated cleaning system. We also show that CLeanEX is a promising step towards generating automatically insightful explanations, while catering to the needs of the user alike.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-03066026
Contributor : Laure Berti-Equille <>
Submitted on : Tuesday, December 15, 2020 - 9:32:23 AM
Last modification on : Monday, February 22, 2021 - 11:00:02 AM

File

XAI2021-CRV-berti-equille-comi...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-03066026, version 1

Citation

Laure Berti-Équille, Ugo Comignani. Explaining Automated Data Cleaning with CLeanEX. IJCAI-PRICAI 2020 Workshop on Explainable Artificial Intelligence (XAI), Jan 2021, Online, Japan. ⟨hal-03066026⟩

Share

Metrics

Record views

130

Files downloads

73