Cardinality estimation with local deep learning models

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 05 Jul 2019 Germany Publisher:ACMJournal:Proceedings of the Second International Workshop on Exploiting Artificial Intelligence Techniques for Data Management

Authors: Lucas Woltmann; Claudio Hartmann; Maik Thiele; Dirk Habich; Wolfgang Lehner;

doi: 10.1145/3329859.3329875

Cardinality estimation with local deep learning models

- Summary
- Subjects
- Metrics

Abstract

Cardinality estimation is a fundamental task in database query processing and optimization. Unfortunately, the accuracy of traditional estimation techniques is poor resulting in non-optimal query execution plans. With the recent expansion of machine learning into the field of data management, there is the general notion that data analysis, especially neural networks, can lead to better estimation accuracy. Up to now, all proposed neural network approaches for the cardinality estimation follow a global approach considering the whole database schema at once. These global models are prone to sparse data at training leading to misestimates for queries which were not represented in the sample space used for generating training queries. To overcome this issue, we introduce a novel local-oriented approach in this paper, therefore the local context is a specific sub-part of the schema. As we will show, this leads to better representation of data correlation and thus better estimation accuracy. Compared to global approaches, our novel approach achieves an improvement by two orders of magnitude in accuracy and by a factor of four in training time performance for local models.

Country

Germany

Related Organizations

TU Dresden
Germany

Keywords

ddc:004, Kardinalitätsschätzung, lokale Deep-Learning-Modelle, Verarbeitung und Optimierung von Datenbankabfragen, Cardinality Estimation , Local Deep Learning Models, database query processing and optimization, info:eu-repo/classification/ddc/004

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	58
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%

Found an issue? Give us feedback

58

Top 1%

Top 10%

Top 1%

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Related to Research communities

EUTOPIA Open Research Portal