Learning Heuristic Selection with Dynamic Algorithm Configuration

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 17 May 2021Embargo end date: 01 Jan 2020Publisher:Association for the Advancement of Artificial Intelligence (AAAI)Journal:Proceedings of the International Conference on Automated Planning and Scheduling, volume 31, pages 597-605 (issn: 2334-0835, eissn: 2334-0843,

Copyright policy )

Authors: David Speck 0001; André Biedenkapp; Frank Hutter; Robert Mattmüller; Marius Lindauer;

doi: 10.1609/icaps.v31i1.16008 , 10.48550/arxiv.2006.08246

arXiv: 2006.08246

Learning Heuristic Selection with Dynamic Algorithm Configuration

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

A key challenge in satisficing planning is to use multiple heuristics within one heuristic search. An aggregation of multiple heuristic estimates, for example by taking the maximum, has the disadvantage that bad estimates of a single heuristic can negatively affect the whole search. Since the performance of a heuristic varies from instance to instance, approaches such as algorithm selection can be successfully applied. In addition, alternating between multiple heuristics during the search makes it possible to use all heuristics equally and improve performance. However, all these approaches ignore the internal search dynamics of a planning system, which can help to select the most useful heuristics for the current expansion step. We show that dynamic algorithm configuration can be used for dynamic heuristic selection which takes into account the internal search dynamics of a planning system. Furthermore, we prove that this approach generalizes over existing approaches and that it can exponentially improve the performance of the heuristic search. To learn dynamic heuristic selection, we propose an approach based on reinforcement learning and show empirically that domain-wise learned policies, which take the internal search dynamics of a planning system into account, can exceed existing approaches.

Related Organizations

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Machine Learning (cs.LG)

1 Research products, page 1 of 1

rl-plan software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	12
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%