This paper proposes an online adaptive dynamic programming-based controller for the variable-speed wind turbine system, ensuring it tracks the maximum power point under uncertain conditions. This control scheme is composed of two components: the adaptive optimal control component and the high-order disturbance observer-based adaptive sliding mode control component. The adaptive optimal control part is designed using the online adaptive dynamic programming technique, which is one of the algorithms of the reinforcement learning method. This control component has the role of achieving the optimal characteristics for the nonlinear system. Likewise, the high-order disturbance observer is established to estimate system uncertainties and external disturbances. These approximated disturbances are then fed to the adaptive sliding mode control component for system disturbance compensation. As a result, the system, which is affected by the unknown disturbances, achieves robust optimal performance. In addition, the convergence of the adaptive laws and the stability of the overall system are ensured through the Lyapunov theory. Finally, comparative simulations are conducted to validate the advantages of the proposed control scheme in comparison to the existing methods.