Abstract “Supermodeling” climate by allowing different models to assimilate data from one another in run time has been shown to give results superior to those of any one model and superior to any weighted average of model outputs. The only free parameters, connection strengths between corresponding variables in each pair of models, are determined using some form of machine learning. It is demonstrated that supermodeling succeeds because near critical states, interscale interactions are important but unresolved processes cannot be effectively represented diagnostically in any single parameterization scheme. In two examples, a pair of toy quasigeostrophic (QG) channel models of the midlatitudes and a pair of ECHAM5 models of the tropical Pacific atmosphere with a common ocean, supermodels dynamically combine parameterization schemes so as to capture criticality, associated critical structures, and the supporting scale interactions. The QG supermodeling scheme extends a previous configuration in which two such models synchronize with intermodel connections only between medium-scale components of the flow; here the connections are trained against a third “real” model. Intermittent blocking patterns characterize the critical behavior thus obtained, even where such patterns are missing in the constituent models. In the ECHAM-based climate supermodel, the corresponding critical structure is the single ITCZ pattern, a pattern that occurs in neither of the constituent models. For supermodels of both types, power spectra indicate enhanced interscale interactions in frequency or energy ranges of physical interest, in agreement with observed data, and supporting a generalized form of the self-organized criticality hypothesis. Significance Statement In a “supermodel” of Earth’s climate, alternative models (climate simulations), which differ in the way they represent processes on the smallest scales, are trained to exchange information as they run, adjusting to one another much as weather prediction models adjust to new observations. They form a consensus, capturing atmospheric behaviors that have eluded all the separate models. We demonstrate that simplified supermodels succeed, where no single approach can, by correctly representing critical phenomena involving sudden qualitative transitions, such as occur in El Niño events, that depend on interactions among atmospheric processes on many different scales in space and time. The correct reproduction of critical phenomena is vital both for predicting weather and for projecting the effects of climate change.