Abstract Gravity waves (GWs) are among the key drivers of the meridional overturning circulation in the mesosphere and upper stratosphere. Their representation in climate models suffers from insufficient resolution and limited observational constraints on their parameterizations. This obscures assessments of middle atmospheric circulation changes in a changing climate. This study presents a comprehensive analysis of stratospheric GW activity above and downstream of the Andes from 1 to 15 August 2019, with special focus on GW representation ranging from an unprecedented kilometer-scale global forecast model (1.4 km ECMWF IFS), ground-based Rayleigh lidar (CORAL) observations, modern reanalysis (ERA5), to a coarse-resolution climate model (EMAC). Resolved vertical flux of zonal GW momentum (GWMF) is found to be stronger by a factor of at least 2–2.5 in IFS compared to ERA5. Compared to resolved GWMF in IFS, parameterizations in ERA5 and EMAC continue to inaccurately generate excessive GWMF poleward of 60°S, yielding prominent differences between resolved and parameterized GWMFs. A like-to-like validation of GW profiles in IFS and ERA5 reveals similar wave structures. Still, even at ∼1 km resolution, the resolved waves in IFS are weaker than those observed by lidar. Further, GWMF estimates across datasets reveal that temperature-based proxies, based on midfrequency approximations for linear GWs, overestimate GWMF due to simplifications and uncertainties in GW wavelength estimation from data. Overall, the analysis provides GWMF benchmarks for parameterization validation and calls for three-dimensional GW parameterizations, better upper-boundary treatment, and vertical resolution increases commensurate with increases in horizontal resolution in models, for a more realistic GW analysis. Significance Statement Gravity wave–induced momentum forcing forms a key component of the middle atmospheric circulation. However, complete knowledge of gravity waves, their atmospheric effects, and their long-term trends are obscured due to limited global observations, and the inability of current climate models to fully resolve them. This study combines a kilometer-scale forecast model, modern reanalysis, and a coarse-resolution climate model to first compare the resolved and parameterized momentum fluxes by gravity waves generated over the Andes, and then evaluate the fluxes using a state-of-the-art ground-based Rayleigh lidar. Our analysis reveals shortcomings in current model parameterizations of gravity waves in the middle atmosphere and highlights the sensitivity of the estimated flux to the formulation used.