Download: Proximal Reinforcement Learning%3a A New Theory Of Sequential Decision Making In Primal Dual Spaces by Sridhar Mahadevan

Proximal Reinforcement Learning%3a A New Theory Of Sequential Decision Making In Primal Dual Spaces by Sridhar Mahadevan

Read "Proximal Reinforcement Learning%3a A New Theory Of Sequential Decision Making In Primal Dual Spaces" by Sridhar Mahadevan through these free online access and download options.

Books Results

Available books for downloads and borrow from The internet Archive

1Proximal Reinforcement Learning: A New Theory Of Sequential Decision Making In Primal-Dual Spaces

By Sridhar Mahadevan, Bo Liu, Philip Thomas, Will Dabney, Steve Giguere, Nicholas Jacek, Ian Gemp and Ji Liu

In this paper, we set forth a new vision of reinforcement learning developed by us over the past few years, one that yields mathematically rigorous solutions to longstanding important questions that have remained unresolved: (i) how to design reliable, convergent, and robust reinforcement learning algorithms (ii) how to guarantee that reinforcement learning satisfies pre-specified "safety" guarantees, and remains in a stable region of the parameter space (iii) how to design "off-policy" temporal difference learning algorithms in a reliable and stable manner, and finally (iv) how to integrate the study of reinforcement learning into the rich theory of stochastic optimization. In this paper, we provide detailed answers to all these questions using the powerful framework of proximal operators. The key idea that emerges is the use of primal dual spaces connected through the use of a Legendre transform. This allows temporal difference updates to occur in dual spaces, allowing a variety of important technical advantages. The Legendre transform elegantly generalizes past algorithms for solving reinforcement learning problems, such as natural gradient methods, which we show relate closely to the previously unconnected framework of mirror descent methods. Equally importantly, proximal operator theory enables the systematic development of operator splitting methods that show how to safely and reliably decompose complex products of gradients that occur in recent variants of gradient-based temporal difference learning. This key technical innovation makes it possible to finally design "true" stochastic gradient methods for reinforcement learning. Finally, Legendre transforms enable a variety of other benefits, including modeling sparsity and domain geometry. Our work builds extensively on recent work on the convergence of saddle-point algorithms, and on the theory of monotone operators.

“Proximal Reinforcement Learning: A New Theory Of Sequential Decision Making In Primal-Dual Spaces” Metadata:

Title: ➤ Proximal Reinforcement Learning: A New Theory Of Sequential Decision Making In Primal-Dual Spaces
Authors: ➤ Sridhar MahadevanBo LiuPhilip ThomasWill DabneySteve GiguereNicholas JacekIan GempJi Liu

“Proximal Reinforcement Learning: A New Theory Of Sequential Decision Making In Primal-Dual Spaces” Subjects and Themes:

Subjects: Computing Research Repository - Learning

Edition Identifiers:

Internet Archive ID: arxiv-1405.6757

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 3.74 Mbs, the file-s for this book were downloaded 21 times, the file-s went public at Sat Jun 30 2018.

Available formats:
Archive BitTorrent - Metadata - Text PDF -

Online Marketplaces

Find Proximal Reinforcement Learning: A New Theory Of Sequential Decision Making In Primal-Dual Spaces at online marketplaces:

Amazon: Audiable, Kindle and printed editions.

Ebay: New & used books.

Top of Page

Buy “Proximal Reinforcement Learning%3a A New Theory Of Sequential Decision Making In Primal Dual Spaces” online:

Shop for “Proximal Reinforcement Learning%3a A New Theory Of Sequential Decision Making In Primal Dual Spaces” on popular online marketplaces.

Ebay: New and used books.

Downloads & Free Reading Options - Results

Proximal Reinforcement Learning%3a A New Theory Of Sequential Decision Making In Primal Dual Spaces by Sridhar Mahadevan

Search for Downloads

Books Results

Source: The Internet Archive

The internet Archive Search Results

1Proximal Reinforcement Learning: A New Theory Of Sequential Decision Making In Primal-Dual Spaces

By Sridhar Mahadevan, Bo Liu, Philip Thomas, Will Dabney, Steve Giguere, Nicholas Jacek, Ian Gemp and Ji Liu

“Proximal Reinforcement Learning: A New Theory Of Sequential Decision Making In Primal-Dual Spaces” Metadata:

“Proximal Reinforcement Learning: A New Theory Of Sequential Decision Making In Primal-Dual Spaces” Subjects and Themes:

Edition Identifiers:

Downloads Information:

Related Links:

Online Marketplaces

Buy “Proximal Reinforcement Learning%3a A New Theory Of Sequential Decision Making In Primal Dual Spaces” online: