Rollout, Policy Iteration, and Distributed Reinforcement Learning

Rollout, Policy Iteration, and Distributed Reinforcement Learning
Author :
Publisher : Athena Scientific
Total Pages : 498
Release :
ISBN-10 : 9781886529076
ISBN-13 : 1886529078
Rating : 4/5 (078 Downloads)

Book Synopsis Rollout, Policy Iteration, and Distributed Reinforcement Learning by : Dimitri Bertsekas

Download or read book Rollout, Policy Iteration, and Distributed Reinforcement Learning written by Dimitri Bertsekas and published by Athena Scientific. This book was released on 2021-08-20 with total page 498 pages. Available in PDF, EPUB and Kindle. Book excerpt: The purpose of this book is to develop in greater depth some of the methods from the author's Reinforcement Learning and Optimal Control recently published textbook (Athena Scientific, 2019). In particular, we present new research, relating to systems involving multiple agents, partitioned architectures, and distributed asynchronous computation. We pay special attention to the contexts of dynamic programming/policy iteration and control theory/model predictive control. We also discuss in some detail the application of the methodology to challenging discrete/combinatorial optimization problems, such as routing, scheduling, assignment, and mixed integer programming, including the use of neural network approximations within these contexts. The book focuses on the fundamental idea of policy iteration, i.e., start from some policy, and successively generate one or more improved policies. If just one improved policy is generated, this is called rollout, which, based on broad and consistent computational experience, appears to be one of the most versatile and reliable of all reinforcement learning methods. In this book, rollout algorithms are developed for both discrete deterministic and stochastic DP problems, and the development of distributed implementations in both multiagent and multiprocessor settings, aiming to take advantage of parallelism. Approximate policy iteration is more ambitious than rollout, but it is a strictly off-line method, and it is generally far more computationally intensive. This motivates the use of parallel and distributed computation. One of the purposes of the monograph is to discuss distributed (possibly asynchronous) methods that relate to rollout and policy iteration, both in the context of an exact and an approximate implementation involving neural networks or other approximation architectures. Much of the new research is inspired by the remarkable AlphaZero chess program, where policy iteration, value and policy networks, approximate lookahead minimization, and parallel computation all play an important role.


Rollout, Policy Iteration, and Distributed Reinforcement Learning Related Books

Rollout, Policy Iteration, and Distributed Reinforcement Learning
Language: en
Pages: 498
Authors: Dimitri Bertsekas
Categories: Computers
Type: BOOK - Published: 2021-08-20 - Publisher: Athena Scientific

GET EBOOK

The purpose of this book is to develop in greater depth some of the methods from the author's Reinforcement Learning and Optimal Control recently published text
Reinforcement Learning and Optimal Control
Language: zh-CN
Pages: 373
Authors: Dimitri P. Bertsekas
Categories: Artificial intelligence
Type: BOOK - Published: 2020 - Publisher:

GET EBOOK

Reinforcement Learning, second edition
Language: en
Pages: 549
Authors: Richard S. Sutton
Categories: Computers
Type: BOOK - Published: 2018-11-13 - Publisher: MIT Press

GET EBOOK

The significantly expanded and updated new edition of a widely used text on reinforcement learning, one of the most active research areas in artificial intellig
Efficient Reinforcement Learning Using Gaussian Processes
Language: en
Pages: 226
Authors: Marc Peter Deisenroth
Categories: Electronic computers. Computer science
Type: BOOK - Published: 2010 - Publisher: KIT Scientific Publishing

GET EBOOK

This book examines Gaussian processes in both model-based reinforcement learning (RL) and inference in nonlinear dynamic systems.First, we introduce PILCO, a fu
Reinforcement Learning and Dynamic Programming Using Function Approximators
Language: en
Pages: 280
Authors: Lucian Busoniu
Categories: Computers
Type: BOOK - Published: 2017-07-28 - Publisher: CRC Press

GET EBOOK

From household appliances to applications in robotics, engineered systems involving complex dynamics can only be as effective as the algorithms that control the