Abstract

The algorithm of policy optimization with learning behavior enhancement based on mapping network technology was proposed, aiming to address the issues of lack and sparsity of learning behavior data and weak generalization ability of the model in AI education. Based on the basic recommendation algorithm and the framework of rein- forcement learning, and model introduces the correlation mapping network to realize the transformation of strong and weak correlation, so as to optimize the input agent policy to improve the performance model of course recommendation. Experiment on MOOC da- tasets show that the proposed algorithm model has a stable improvement compared with the baseline models, and can effectively improve the accuracy of course recommendation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call