Abstract. A multiscale modeling ensemble chain has been assembled as a first step towards an air quality analysis and forecasting (AQF) system for Latin America. Two global and three regional models were tested and compared in retrospective mode over a shared domain (120–28° W, 60° S–30° N) for the months of January and July 2015. The objective of this experiment was to understand their performance and characterize their errors. Observations from local air quality monitoring networks in Colombia, Chile, Brazil, Mexico, Ecuador and Peru were used for model evaluation. The models generally agreed with observations in large cities such as Mexico City and São Paulo, whereas representing smaller urban areas, such as Bogotá and Santiago, was more challenging. For instance, in Santiago during wintertime, the simulations showed large discrepancies with observations. No single model demonstrated superior performance over others or among pollutants and sites available. In general, ozone and NO2 exhibited the lowest bias and errors, especially in São Paulo and Mexico City. For SO2, the bias and error were close to 200 %, except for Bogotá. The ensemble, created from the median value of all models, was evaluated as well. In some cases, the ensemble outperformed the individual models and mitigated extreme over- or underestimation. However, more research is needed before concluding that the ensemble is the path for an AQF system in Latin America. This study identified certain limitations in the models and global emission inventories, which should be addressed with the involvement and experience of local researchers.