Downloads & Free Reading Options - Results
On Convergence Of Value Iteration For A Class Of Total Cost Markov Decision Processes by Huizhen Yu
Read "On Convergence Of Value Iteration For A Class Of Total Cost Markov Decision Processes" by Huizhen Yu through these free online access and download options.
Books Results
Source: The Internet Archive
The internet Archive Search Results
Available books for downloads and borrow from The internet Archive
1On Convergence Of Value Iteration For A Class Of Total Cost Markov Decision Processes
By Huizhen Yu
We consider a general class of total cost Markov decision processes (MDP) in which the one-stage costs can have arbitrary signs, but the sum of the negative parts of the one-stage costs is finite for all policies and all initial states. We refer to this class as the General Convergence (GC for short) total cost model, and we study the convergence of value iteration for the GC model, in the Borel MDP framework with universally measurable policies. Our main results include: (i) convergence of value iteration when starting from certain functions above the optimal cost function; (ii) convergence of transfinite value iteration starting from zero, in the special case where the optimal cost function is nonnegative; and (iii) partial convergence of value iteration starting from zero, for a subset of initial states. These results extend several previously known results about the convergence of value iteration for either positive costs problems or GC total cost problems. In particular, the first result on convergence of value iteration from above extends a theorem of van der Wal for the GC model. The second result relates to Maitra and Sudderth's analysis of transfinite value iteration for the positive costs model. It suggests connections between the two total cost models when the optimal cost function is nonnegative, and it leads to additional results on the convergence of ordinary non-transfinite value iteration, with a suitably defined dynamic programming operator, for finite state or finite control GC problems. The third result on partial convergence of value iteration is motivated by Whittle's bridging condition for the positive costs model, and provides a novel extension of the bridging condition to the GC model, where there are no sign constraints on the costs.
“On Convergence Of Value Iteration For A Class Of Total Cost Markov Decision Processes” Metadata:
- Title: ➤ On Convergence Of Value Iteration For A Class Of Total Cost Markov Decision Processes
- Author: Huizhen Yu
“On Convergence Of Value Iteration For A Class Of Total Cost Markov Decision Processes” Subjects and Themes:
- Subjects: Mathematics - Optimization and Control
Edition Identifiers:
- Internet Archive ID: arxiv-1411.1459
Downloads Information:
The book is available for download in "texts" format, the size of the file-s is: 0.57 Mbs, the file-s for this book were downloaded 19 times, the file-s went public at Sat Jun 30 2018.
Available formats:
Archive BitTorrent - Metadata - Text PDF -
Related Links:
- Whefi.com: Download
- Whefi.com: Review - Coverage
- Internet Archive: Details
- Internet Archive Link: Downloads
Online Marketplaces
Find On Convergence Of Value Iteration For A Class Of Total Cost Markov Decision Processes at online marketplaces:
- Amazon: Audiable, Kindle and printed editions.
- Ebay: New & used books.
Buy “On Convergence Of Value Iteration For A Class Of Total Cost Markov Decision Processes” online:
Shop for “On Convergence Of Value Iteration For A Class Of Total Cost Markov Decision Processes” on popular online marketplaces.
- Ebay: New and used books.