Abstract

In this work, we focus on the development of an AI technology to support decision making for people in leadership positions while facing uncertain environments. We demonstrate an efficient approach based on a stochastic inverse reinforcement leaning (IRL) algorithm constructed by hybridizing the conventional Max-entropy IRL and mixture density network (MDN) for the prediction of transition probability. We took the case study of American football, a sports game with stochastic environment, since the number of yards gainable on the next offence in real American football is usually uncertain during strategy planning and decision making. The expert data for IRL are built using the American football 2017 season data in National Football League (NFL). The American football simulation environment was built by training MDN using the annual NFL data to generate the state transition probability for IRL. Under the framework of Max-Entropy IRL, optimal strategy was successfully obtained through a learnt reward function by trial-and-error communication with the MDN environment. To precisely evaluate the validity of the learnt policy, we have conducted a risk-return analysis and revealed that the trained IRL agent showed higher return and lower risk than the expert data, indicating that it is possible for the proposed IRL algorithm to learn superior policy than the one derived directly from the expert teaching data. Decision-making in an uncertain environment is a general issue, ranging from business operation to management. Our work presented here will likely serve as a general framework for optimal business operation and risk management and contribute especially to the portfolio’s optimization in the financial and energy trading market.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.