AbstractVertical exchange between the atmospheric boundary layer (ABL) and free troposphere (FT) is a key link in coupling the earth's surface and upper atmosphere. This process is usually quantified by numerical simulations, while its reliability is not well assessed until now. Using space‐time intensified ABL observations, we evaluate the ABL‐FT air mass exchange flux derived from the Weather Research and Forecast (WRF) model. A six‐site sounding experiment is conducted in the North China Plain during the wintertime of 2019. The measured data is processed to provide enough information to derive the vertical exchange flux corresponding to the model‐based result, so that a systematic comparison is conducted. Three physical processes involved in ABL‐FT vertical exchange are quantitatively evaluated, that is, temporal variation of ABL height, advection across the inclined ABL top, and vertical motion at the ABL‐FT interface. Results show that the model‐based and observation‐based fluxes are generally agreed in temporal evolution (R = 0.67, p < 0.01), both characterized by 4–6 days periodicity and diurnal cycle. Their relative mean error was about 45% during the whole study period, mainly stemming from the vertical motion term and the advection crossing term. The model inaccuracy in representing these relevant processes at the ABL top is largely responsible for the discrepancy. Besides, the difference may also be attributed to the observational uncertainty (∼22%) that is caused by the measurement's difficulties in determining ABL spatial variation and acquiring vertical velocity. Through this study, the credibility and limitation of the WRF model in deriving ABL‐FT exchange flux are quantified.