Data Quality and Explainable AI

descriptionPublicationkeyboard_double_arrow_right Article 03 May 2020 Belgium English Publisher:Association for Computing Machinery (ACM)Journal:Journal of Data and Information Quality, volume 12, pages 1-9 (issn: 1936-1955, eissn: 1936-1963,

Copyright policy )

Authors: Leopoldo E. Bertossi; Floris Geerts;

doi: 10.1145/3386687

handle: 10067/1715890151162165141

Data Quality and Explainable AI

- Summary
- Subjects
- Metrics

Abstract

In this work, we provide some insights and develop some ideas, with few technical details, about the role of explanations in Data Quality in the context of data-based machine learning models (ML). In this direction, there are, as expected, roles for causality, and explainable artificial intelligence . The latter area not only sheds light on the models, but also on the data that support model construction. There is also room for defining, identifying, and explaining errors in data, in particular, in ML, and also for suggesting repair actions. More generally, explanations can be used as a basis for defining dirty data in the context of ML, and measuring or quantifying them. We think dirtiness as relative to the ML task at hand, e.g., classification.

Country

Belgium

Related Organizations

Adolfo Ibáñez University
Chile
University of Antwerp
Belgium

Keywords

Computer. Automation

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	37
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%