Exploiting Structure to Efficiently Solve Large Scale Partially Observable Markov Decision Processes [microform]

Exploiting Structure to Efficiently Solve Large Scale Partially Observable Markov Decision Processes [microform]
Author :
Publisher : Library and Archives Canada = Bibliothèque et Archives Canada
Total Pages : 288
Release :
ISBN-10 : 0494027274
ISBN-13 : 9780494027271
Rating : 4/5 (271 Downloads)

Book Synopsis Exploiting Structure to Efficiently Solve Large Scale Partially Observable Markov Decision Processes [microform] by : Pascal Poupart

Download or read book Exploiting Structure to Efficiently Solve Large Scale Partially Observable Markov Decision Processes [microform] written by Pascal Poupart and published by Library and Archives Canada = Bibliothèque et Archives Canada. This book was released on 2005 with total page 288 pages. Available in PDF, EPUB and Kindle. Book excerpt: Partially observable Markov decision processes (POMDPs) provide a natural and principled framework to model a wide range of sequential decision making problems under uncertainty. To date, the use of POMDPs in real-world problems has been limited by the poor scalability of existing solution algorithms, which can only solve problems with up to ten thousand states. In fact, the complexity of finding an optimal policy for a finite-horizon discrete POMDP is PSPACE-complete. In practice, two important sources of intractability plague most solution algorithms: Large policy spaces and large state spaces. In practice, it is critical to simultaneously mitigate the impact of complex policy representations and large state spaces. Hence, this thesis describes three approaches that combine techniques capable of dealing with each source of intractability: VDC with BPI, VDC with Perseus (a randomized point-based value iteration algorithm by Spaan and Vlassis [136]), and state abstraction with Perseus. The scalability of those approaches is demonstrated on two problems with more than 33 million states: synthetic network management and a real-world system designed to assist elderly persons with cognitive deficiencies to carry out simple daily tasks such as hand-washing. This represents an important step towards the deployment of POMDP techniques in ever larger, real-world, sequential decision making problems. On the other hand, for many real-world POMDPs it is possible to define effective policies with simple rules of thumb. This suggests that we may be able to find small policies that are near optimal. This thesis first presents a Bounded Policy Iteration (BPI) algorithm to robustly find a good policy represented by a small finite state controller. Real-world POMDPs also tend to exhibit structural properties that can be exploited to mitigate the effect of large state spaces. To that effect, a value-directed compression (VDC) technique is also presented to reduce POMDP models to lower dimensional representations.


Exploiting Structure to Efficiently Solve Large Scale Partially Observable Markov Decision Processes [microform] Related Books

Exploiting Structure to Efficiently Solve Large Scale Partially Observable Markov Decision Processes [microform]
Language: en
Pages: 288
Authors: Pascal Poupart
Categories:
Type: BOOK - Published: 2005 - Publisher: Library and Archives Canada = Bibliothèque et Archives Canada

GET EBOOK

Partially observable Markov decision processes (POMDPs) provide a natural and principled framework to model a wide range of sequential decision making problems
Algorithms for Partially Observable Markov Decision Processes
Language: en
Pages: 354
Authors: Hsien-Te Cheng
Categories: Markov processes
Type: BOOK - Published: 1988 - Publisher:

GET EBOOK

Finite Memory Policies for Partially Observable Markov Decision Processes [microform]
Language: en
Pages:
Authors: Lusena, Christopher
Categories: Heuristic programming
Type: BOOK - Published: 2001 - Publisher: Ann Arbor, Mich. : University Microfilms International

GET EBOOK

Algorithms for Partially Observable Markov Decision Processes
Language: en
Pages: 354
Authors: Hsien-Te Cheng
Categories: Markov processes
Type: BOOK - Published: 1988 - Publisher:

GET EBOOK

Learning and Solving Partially Observable Markov Decision Processes
Language: en
Pages: 176
Authors: Guy Shani
Categories: Markov processes
Type: BOOK - Published: 2007 - Publisher:

GET EBOOK