
doi: 10.1561/1900000082
This survey presents recent progress on using machine learning techniques to improve query optimizers in database systems. Centering around a generic paradigm of learned query optimizers, this survey covers several lines of effort on rebuilding or aiding important components in query optimizers (i.e., cardinality estimators, cost models, and plan enumerators) with machine learning. We introduce some important machine learning tools developed recently, which are useful for query optimization, and how they are adapted for sub-tasks of query optimization. This survey is for readers who are already familiar with query optimization and are eager to understand what machine learning techniques can be helpful and how to apply them with examples and necessary details, or for machine learning researchers who want to expand their research agendas to helping database systems with machine learning techniques. Some open research challenges are also discussed with the goal of making learned query optimizers truly applicable in production.
database tuning, reinforcement learning, OLAP, Research exposition (monographs, survey articles) pertaining to computer science, query processing, Database theory, Learning and adaptive systems in artificial intelligence, graphical models, deep learning, database design, data mining, Artificial neural networks and deep learning, query optimization
database tuning, reinforcement learning, OLAP, Research exposition (monographs, survey articles) pertaining to computer science, query processing, Database theory, Learning and adaptive systems in artificial intelligence, graphical models, deep learning, database design, data mining, Artificial neural networks and deep learning, query optimization
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 6 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
