Abstract

Evidence from biological studies suggests that humans are able to predict the sensory consequences of their own actions [1]. Computational studies also demonstrate the advantage of systems that predict sensory consequences of actions over those that predict the value of actions alone [2]. But how could the ability to predict sensory consequences of actions have evolved? One solution suggested by [3] is that prediction mechanisms first evolved to deal with natural sources of delay. Delay is commonly considered to be a purely negative feature of real world systems; however, we argue that delay can actually encourage evolution of the prediction of sensory consequences. We hypothesize that increasing sensory delay to an evolving population of sensory-motor agents will increase reliance on internal prediction of sensory consequences. To test our hypothesis we evolved populations of artificial neural networks at a complex control task (i.e. pole balancing, see figure ​figure1)1) with varied neural conduction delay (Δt) between sensory neurons and input to the control network (see figure ​figure2),2), which estimates the long term cost of applying a specific action. For top fitness networks, hidden unit activations were recorded as well as the true consequent sensory state during several evaluation trials. Each sensory variable was associated with the hidden unit that the sensory variable was maximally correlated with. Taking the average of these correlation values provides a measure of how well an agent can predict the sensory consequences of actions. We expected to find that increasing sensory delay also increases the average correlation measure described above. Figure 1 Cart-Pole Balancing Figure 2 Control network structure The result of the experiment (summarized in figure ​figure3)3) show that with no delay successful agents use a range of strategies, however, as delay increases successful strategies are forced to rely more and more on prediction of the next state to compensate for sensory delay. This seems surprising when considering that under conditions of no delay it is considerably easier to predict the next state than conditions with increased delay. Figure 3 Absolute correlation between hidden until activations and variables of the state at time t+Δt as delay increases. Although the common conception of delay is negative, sensory delay can direct natural selection to favor individuals that are better able to predict the sensory consequences of their actions.

Highlights

  • Evidence from biological studies suggests that humans are able to predict the sensory consequences of their own actions [1]

  • Each sensory variable was associated with the hidden unit that the sensory variable was maximally correlated with

  • Taking the average of these correlation values provides a measure of how well an agent can predict the sensory consequences of actions

Read more

Summary

Introduction

Evidence from biological studies suggests that humans are able to predict the sensory consequences of their own actions [1]. To test our hypothesis we evolved populations of artificial neural networks at a complex control task (i.e. pole balancing, see figure 1) with varied neural conduction delay (Δt) between sensory neurons and input to the control network (see figure 2), which estimates the long term cost of applying a specific action.

Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.