To enhance the collaborative intelligence of a machine, it is important for the machine to understand what behavior a human may adopt to interact with the machine when performing a task in shared control. In this study, an online behavior learning method is proposed for continuous-time linear human-in-the-loop shared control systems by using the system state data only. A two-player nonzero-sum linear quadratic dynamic game paradigm is used for modeling the control interaction between a human operator and an automation that actively compensates for human control action. In this game model, the cost function representing the human behavior is assumed to have an unknown weighting matrix. Here, we want to learn the human behavior or retrieve the weighting matrix by using the system state data only. Accordingly, a new adaptive inverse differential game (IDG) method, which integrates concurrent learning (CL) and linear matrix inequality (LMI) optimization, is proposed. First, a CL-based adaptive law and an interactive controller of the automation are developed to estimate the feedback gain matrix of the human online, and second, an LMI optimization problem is solved to determine the weighting matrix of the human cost function. Finally, simulation results on a cooperative shared control driver assistance system are provided to elucidate the feasibility of the developed method.
Read full abstract