Abstract

We propose a new approach for solving a class of discrete decision making problems under uncertainty with positive cost. This issue concerns multiple and diverse fields such as engineering, economics, artificial intelligence, cognitive science and many others. Basically, an agent has to choose a single or series of actions from a set of options, without knowing for sure their consequences. Schematically, two main approaches have been followed: either the agent learns which option is the correct one to choose in a given situation by trial and error, or the agent already has some knowledge on the possible consequences of his decisions; this knowledge being generally expressed as a conditional probability distribution. In the latter case, several optimal or suboptimal methods have been proposed to exploit this uncertain knowledge in various contexts. In this work, we propose following a different approach, based on the geometric intuition of distance. More precisely, we define a goal independent quasimetric structure on the state space, taking into account both cost function and transition probability. We then compare precision and computation time with classical approaches.

Highlights

  • It’s Friday evening, and you are in a hurry to get home after a hard day’s work

  • You might be caught in a traffic jam in a completely unknown neighborhood. This problem that we propose to deal with in this paper can be viewed as sequential decision making, usually expressed as a Markovian Decision Process (MDP) [1,2,3,4] and its extension to Partially Observable cases (POMDP) [5,6]

  • First as we have shown, computation of the quasi-distance is ensured to converge even for infinite horizon while the standard Value Iteration algorithm is not

Read more

Summary

Introduction

It’s Friday evening, and you are in a hurry to get home after a hard day’s work. You can hail a taxi, but it’s costly and you’re worried about traffic jams, common at this time of day. You might go on foot, but it’s slow and tiring. The weather forecast predicted rain, and you forgot your umbrella. In the end you decide to take the subway, but you have to wait half an hour for the train at the connecting station due to a technical incident

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.