descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 18 Jul 2021Embargo end date: 01 Jan 2018Publisher:IEEEJournal:2021 International Joint Conference on Neural Networks (IJCNN)Funded by:FCT | LA 2

Authors: Tasfi, Norman; Capretz, Miriam A M;

doi: 10.1109/ijcnn52387.2021.9533459 , 10.48550/arxiv.1812.11240

arXiv: http://arxiv.org/abs/1812.11240

Dynamic Planning Networks

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

We introduce Dynamic Planning Networks (DPN), a novel architecture for deep reinforcement learning, that combines model-based and model-free aspects for online planning. Our architecture learns to dynamically construct plans using a learned state-transition model by selecting and traversing between simulated states and actions to maximize information before acting. In contrast to model-free methods, model-based planning lets the agent efficiently test action hypotheses without performing costly trial-and-error in the environment. DPN learns to efficiently form plans by expanding a single action-conditional state transition at a time instead of exhaustively evaluating each action, reducing the required number of state-transitions during planning by up to 96%. We observe various emergent planning patterns used to solve environments, including classical search methods such as breadth-first and depth-first search. DPN shows improved data efficiency, performance, and generalization to new and unseen domains in comparison to several baselines.

Related Organizations

Keywords

FOS: Computer and information sciences, reinforcement learning, Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Neural and Evolutionary Computing, Machine Learning (stat.ML), Electrical and Computer Engineering, Machine Learning (cs.LG), Artificial Intelligence (cs.AI), deep neural networks, Statistics - Machine Learning, Computer Engineering, Neural and Evolutionary Computing (cs.NE), planning

1 Research products, page 1 of 1

treeqn software on GitHub
IsRelatedTo

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Fields of Science (7) View all

Fields of Science

Funded by

Dynamic Planning Networks

Dynamic Planning Networks

1 Research products, page 1 of 1

treeqn software on GitHub