Abstract

Motivation. Climate models are computationally so expensive that each model is only run for a very selected set of assumptions. In policy making, this computational complexity makes it difficult to rapidly explore the comparative impact of climate policies, such as quantifying the projected difference of local climate impacts with a 30 vs. 45€ price on carbon (Lütjens et al., 2023). Recently however, machine learning (ML) models have been used to emulate climate models that can rapidly interpolate within existing climate dataset.Related Works. Several deep learning models have been developed to emulate the impact of greenhouse gas emissions onto climate variables such as temperature and precipitation. Currently, the foundation model ClimaX with O(100M-1B) parameters is considered the best performer according to the benchmark datasets, ClimateSet and ClimateBenchv1.0 (Kaltenborn et al., 2023; Nguyen et al., 2023; Watson-Parris et al., 2022).Results. We show that linear pattern scaling, a simple method with O(10K) parameters, is at least on par with the best models for some climate variables, as shown in Fig 1. In particular, the ClimateBenchv1.0 annually-averaged and locally-resolved surface temperatures, precipitation, and 90th percentile precipitation can be well estimated with linear pattern scaling. Our research resurfaces that temperature-dependent climate variables have a mostly linear relationship to cumulative CO2 emissions. As a next step, we will identify the complex climate emulation tasks that are not addressed by linear models and might benefit from deep learning research. To do so, we will plot the data complexity per climate variable and discuss the ML difficulties in multiple spatiotemporal scales, irreversible dynamics, and internal variability. We will conclude with a list of tasks that demand more advanced ML models.Conclusion. Most of the ML-based climate emulation efforts have focused on variables that can be well approximated by linear regression models. Our study reveals the solved and unsolved problems in climate emulation and provides guidance for future research directions.Data and Methods. We use the ClimateBenchv1.0 dataset and will show additional results on ClimateSet and a CMIP climate model that contains many ensemble members. Our model fits one linear regression to map cumulative CO2 emissions, co2(t), to globally- and annually-averaged surface temperature, tas(t). Our model then fits one linear regression model per grid cell to map tas(t) onto 2.5° local surface temperature. Our model is time-independent and uses only co2(t) as input. Our analysis will be available at github.com/blutjens/climate-emulator-tutorial References. Kaltenborn, J. et al., (2023). ClimateSet: A Large-Scale Climate Model Dataset for Machine Learning, in NeurIPS Datasets and Benchmarks Lütjens, B. (2023). Deep Learning Emulators for Accessible Climate Projections, Thesis, Massachusetts Institute of Technology. Nguyen, T. et al., (2023). ClimaX: A foundation model for weather and climate, in ICML Watson-Parris, D. et al. (2022). ClimateBenchv1.0: A Benchmark for Data-Driven Climate Projections, in JAMES

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.