Abstract
We study the optimality properties of maximum likelihood ratio estimation based Mean Field (Nash Certainty Equivalence) control laws in a leader-follower stochastic collective dynamics model. In this formulation the leaders track a convex combination of their centroid together with a certain reference trajectory which is unknown to the followers, and each follower reacts by tracking the centroid of the leaders. The followers use a maximum likelihood estimator (based on a fixed ratio sample of the population of the leaders' trajectories) to identify the member of a given finite class of models which is generating the reference trajectory of the leaders. Subject to reasonable conditions, it is shown that each adaptive follower identifies the true reference trajectory model in finite time with probability one as the leaders' population goes to infinity. It is also shown that the leaders' control laws possess an almost sure e-Nash equilibrium property with respect to all other leaders. In this paper we show that the system performance for the adaptive followers is almost surely e-optimal with respect to the leaders.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.