Abstract

Generating various goal-oriented movements via the flexible muscle model of the musculoskeletal system as fast and accurately as possible is a pressing problem, which is also the basis of most human adaptive behaviors, such as reaching, catching, interception, and pointing. This article focuses on the adaptive motion generation of fast goal-oriented motion on the musculoskeletal system by implementing the speed-accuracy tradeoff (SAT) in a hierarchical motion learning framework. First, we introduce Fitts' Law into the modified basal ganglia circuit-inspired iterative decision-making model for achieving dynamic and adaptive decision making. Then, as a time constraint, the decision is decomposed into a series of supervised terms by the proposed striatal FSI-SPN interneuron circuit-inspired velocity modulator to implement the tradeoff smoothly on the musculoskeletal system. Finally, an improved policy gradient algorithm is suggested to generate the muscle excitations of the modulated motion via the proposed muscle co-contraction policy, which promotes general cooperation between flexor and extensor muscles. In experiments, a redundant musculoskeletal arm model is trained to perform the adaptive quick pointing movements. By combining the muscle co-contraction policy with SAT, our algorithm shows the most efficient training and the best performance in the adaptive motion generation among the other three popular reinforcement learning algorithms on the musculoskeletal model.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call