Abstract. Accurate estimation of wind speed at wind turbine hub height is of significance for wind energy assessment and exploitation. Nevertheless, the traditional power law method (PLM) generally estimates the hub-height wind speed by assuming a constant exponent between surface and hub-height wind speed. This inevitably leads to significant uncertainties in estimating the wind speed profile especially under unstable conditions. To minimize the uncertainties, we here use a machine learning algorithm known as random forest (RF) to estimate the wind speed at hub heights such as at 120 m (WS120), 160 m (WS160), and 200 m (WS200). These heights go beyond the traditional wind mast limit of 100–120 m. The radar wind profiler and surface synoptic observations at the Qingdao station from May 2018 to August 2020 are used as key inputs to develop the RF model. A deep analysis of the RF model construction has been performed to ensure its applicability. Afterwards, the RF model and the PLM model are used to retrieve WS120, WS160, and WS200. The comparison analyses from both RF and PLM models are performed against radiosonde wind measurements. At 120 m, the RF model shows a relatively higher correlation coefficient R of 0.93 and a smaller RMSE of 1.09 m s−1, compared with the R of 0.89 and RMSE of 1.50 m s−1 for the PLM. Notably, the metrics used to determine the performance of the model decline sharply with height for the PLM model, as opposed to the stable variation for the RF model. This suggests the RF model exhibits advantages over the traditional PLM model. This is because the RF model considers well the factors such as surface friction and heat transfer. The diurnal and seasonal variations in WS120, WS160, and WS200 from RF are then analyzed. The hourly WS120 is large during daytime from 09:00 to 16:00 local solar time (LST) and reach a peak at 14:00 LST. The seasonal WS120 is large in spring and winter and is low in summer and autumn. The diurnal and seasonal variations in WS160 and WS200 are similar to those of WS120. Finally, we investigated the absolute percentage error (APE) of wind power density between the RF and PLM models at different heights. In the vertical direction, the APE is gradually increased as the height increases. Overall, the PLM algorithm has some limitations in estimating wind speed at hub height. The RF model, which combines more observations or auxiliary data, is more suitable for the hub-height wind speed estimation. These findings obtained here have great implications for development and utilization in the wind energy industry in the future.