Deep Generative Models for Decision-Making and Control

Name: Deep Generative Models for Decision-Making and Control
Creator: Michael Janner
Keywords: FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Machine Learning (cs.LG)

Michael Janner

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2023

Data sources: arXiv.org e-Print Archive

https://dx.doi.org/10.48550/ar...

Article . 2023

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

DBLP

Article

Data sources: DBLP

Deep Generative Models for Decision-Making and Control

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2023Embargo end date: 01 Jan 2023Publisher:arXivJournal:CoRR, volume abs/2306.08810Funded by:FCT | D4

Authors: Michael Janner;

doi: 10.48550/arxiv.2306.08810

arXiv: 2306.08810

Deep Generative Models for Decision-Making and Control

- Summary
- Subjects
- Related research
  (6)
- Metrics

Abstract

Deep model-based reinforcement learning methods offer a conceptually simple approach to the decision-making and control problem: use learning for the purpose of estimating an approximate dynamics model, and offload the rest of the work to classical trajectory optimization. However, this combination has a number of empirical shortcomings, limiting the usefulness of model-based methods in practice. The dual purpose of this thesis is to study the reasons for these shortcomings and to propose solutions for the uncovered problems. Along the way, we highlight how inference techniques from the contemporary generative modeling toolbox, including beam search, classifier-guided sampling, and image inpainting, can be reinterpreted as viable planning strategies for reinforcement learning problems.

UC Berkeley PhD thesis; supersedes arXiv:2010.14496, arXiv:2106.02039, and arXiv:2205.09991

Related Organizations

University of California System
United States
University of California, San Francisco
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Machine Learning (cs.LG)

6 Research products, page 1 of 1

mujoco-py software on GitHub
IsRelatedTo
BCQ software on GitHub
IsRelatedTo
denoising-diffusion-pytorch software on GitHub
IsRelatedTo
midGPT software on GitHub
IsRelatedTo
d4rl software on GitHub
IsRelatedTo
CQL software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average