AbstractThe distinctive monsoon climate over East Asia, which is affected by the vast Eurasian continent and Pacific Ocean basin and the high-altitude Tibetan Plateau, provides arguably the best testbed for evaluating the competence of Earth system climate models. Here, a set of diagnostic metrics, consisting of 14 items and 7 variables, is specifically developed. This physically intuitive set of metrics focuses on the essential features of the East Asian summer monsoon (EASM) and East Asian winter monsoon (EAWM), and includes fields that depict the climatology, the major modes of variability, and unique characteristics of the EASM. The metrics are applied to multimodel historical simulations derived from 20 models that participated in phases 3 and 5 of the Coupled Model Intercomparison Project (CMIP3 and CMIP5, respectively), along with the newly developed Nanjing University of Information Science and Technology Earth System Model, version 3. The CMIP5 models show significant improvements over the CMIP3 models in terms of the simulated East Asian monsoon circulation systems on a regional scale, major modes of EAWM variability, the monsoon domain and precipitation intensity, and teleconnection associated with the heat source over the Philippine Sea. Clear deficiencies persist from CMIP3 to CMIP5 with respect to capturing the major modes of EASM variability, as well as the relationship between the EASM and ENSO during El Niño developing and decay phases. The possible origins that affect models’ performance are also discussed. The metrics provide a tool for evaluating the performance of Earth system climate models, and facilitating the assessment of past and projected future changes of the East Asian monsoon.