Adaptive Representation for Policy Gradient

Adaptive Representation for Policy Gradient
Author :
Publisher :
Total Pages : 40
Release :
ISBN-10 : OCLC:918929732
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Adaptive Representation for Policy Gradient by : Ujjwal Das Gupta

Download or read book Adaptive Representation for Policy Gradient written by Ujjwal Das Gupta and published by . This book was released on 2015 with total page 40 pages. Available in PDF, EPUB and Kindle. Book excerpt: Much of the focus on finding good representations in reinforcement learning has been on learning complex non-linear predictors of value. Methods like policy gradient, that do not learn a value function and instead directly represent policy, often need fewer parameters to learn good policies. However, they typically employ a fixed parametric representation that may not be sufficient for complex domains. This thesis introduces two algorithms which can learn an adaptive representation of policy: the Policy Tree algorithm, which learns a decision tree over different instantiations of a base policy, and the Policy Conjunction algorithm, which adds conjunctive features to any base policy that uses a linear feature representation. In both of these algorithms, policy gradient is used to grow the representation in a way that enables the maximum local increase in the expected return of the policy. Experiments show that these algorithms can choose genuinely helpful splits or features, and significantly improve upon the commonly used linear Gibbs softmax policy, which is chosen as the base policy.


Adaptive Representation for Policy Gradient Related Books

Adaptive Representation for Policy Gradient
Language: en
Pages: 40
Authors: Ujjwal Das Gupta
Categories: Algorithms
Type: BOOK - Published: 2015 - Publisher:

DOWNLOAD EBOOK

Much of the focus on finding good representations in reinforcement learning has been on learning complex non-linear predictors of value. Methods like policy gra
Adaptive Representations for Reinforcement Learning
Language: en
Pages: 127
Authors: Simon Whiteson
Categories: Computers
Type: BOOK - Published: 2010-10-05 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

This book presents new algorithms for reinforcement learning, a form of machine learning in which an autonomous agent seeks a control policy for a sequential de
The Logic of Adaptive Behavior
Language: en
Pages: 508
Authors: Martijn van Otterlo
Categories: Business & Economics
Type: BOOK - Published: 2009 - Publisher: IOS Press

DOWNLOAD EBOOK

Markov decision processes have become the de facto standard in modeling and solving sequential decision making problems under uncertainty. This book studies lif
Theoretical and Practical Advances in Computer-based Educational Measurement
Language: en
Pages: 399
Authors: Bernard P. Veldkamp
Categories: Education
Type: BOOK - Published: 2019-07-05 - Publisher: Springer

DOWNLOAD EBOOK

This open access book presents a large number of innovations in the world of operational testing. It brings together different but related areas and provides in
Adaptive Dynamic Programming: Single and Multiple Controllers
Language: en
Pages: 271
Authors: Ruizhuo Song
Categories: Technology & Engineering
Type: BOOK - Published: 2018-12-28 - Publisher: Springer

DOWNLOAD EBOOK

This book presents a class of novel optimal control methods and games schemes based on adaptive dynamic programming techniques. For systems with one control inp