On optimising taboo criteria in Markov decision processes
by Kartikeya Puranam; Michael Katehakis
International Journal of Applied Decision Sciences (IJADS), Vol. 7, No. 1, 2014

Abstract: The standard approach when using a Markov decision process to find an optimal policy is to assume a fixed profit or cost structure. However, in many applied problems it may be not possible to determine profits or costs associated with all states or actions. In such cases we propose the use of taboo first passage reward and taboo first passage time as objectives. In this paper, we investigate problems related to optimising aforementioned taboo measures and we provide two examples.

Online publication date: Sat, 28-Jun-2014

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Applied Decision Sciences (IJADS):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com