Animals are able to cope with the noise, uncertainties, and complexity of the real world. Often even elementary living beings, equipped with very limited sensory organs, are able to reach regions favorable to their existence, using simple stochastic policies. In this paper we discuss a minimalistic stochastic behavioral rule, inspired from bacteria chemotaxis, which is able to increase the value of a specified evaluation function in a similar manner. In particular, we prove that, under opportune assumptions, the direction that is taken with maximum probability by an agent that follows this rule corresponds to the optimal direction. The rule does not require a specific agent dynamics, needs no memory for storing observed states, and works in generic n-dimensional spaces. It thus reveals itself interesting for the control of simple sensing robots as well.
Read full abstract