Abstract

AbstractIn this study we investigate the ability of several different machine learning models to provide probabilistic predictions as to whether interplanetary shocks observed upstream of the Earth at L1 will lead to immediate (Sudden Commencements, SCs) or longer lasting magnetospheric activity (Storm Sudden Commencements, SSCs). Four models are tested including linear (Logistic Regression), nonlinear (Naive Bayes and Gaussian Process), and ensemble (Random Forest) models and are shown to provide skillful and reliable forecasts of SCs with Brier Skill Scores (BSSs) of ∼0.3 and ROC scores >0.8. The most powerful predictive parameter is found to be the range in the interplanetary magnetic field. The models also produce skillful forecasts of SSCs, though with less reliability than was found for SCs. The BSSs and ROC scores returned are ∼0.21 and 0.82, respectively. The most important parameter for these predictions was found to be the minimum observed BZ. The simple parameterization of the shock was tested by including additional features related to magnetospheric indices and their changes during shock impact, resulting in moderate increases in reliability. Several parameters, such as velocity and density, may be able to be more accurately predicted at a longer lead time, for example, from heliospheric imagery. When the input was limited to the velocity and density the models were found to perform well at forecasting SSCs, though with lower reliability than previously (BSSs ∼ 0.16, ROC Scores ∼ 0.8), Finally, the models were tested with hypothetical extreme data beyond current observations, showing dramatically different extrapolations.

Highlights

  • Space weather events are driven by the interaction between the solar wind and the magnetosphere-ionosphere system

  • We investigate the ability of machine learning methods to provide a probabilistic forecast as to whether an observed interplanetary shock will lead to an sudden commencements (SCs), and further whether this will be followed by a geomagnetic storm

  • We first test the ability of the models to provide a probabilistic forecast as to whether an SC will be observed on the ground, using information derived from data in the interval around the interplanetary shock

Read more

Summary

Key Points:

Four models are tested including Linear (Logistic Regression), Non-Linear Bayes and Gaussian Process) and Ensemble (Random Forest). The SC and SSC forecasts are skillful and reliable even when the input data are limited, strongly outperforming climatology. The different models provide distinct extrapolations to unseen parameter space, and require careful application to extreme events. This article has been accepted for publication and undergone full peer review but has not been through the copyediting, typesetting, pagination and proofreading process which may lead to differences between this version and the Version of Record.

Introduction
Data and Catalogs
Interplanetary Shocks and Parameters
Sudden Commencements
Cross-Comparison
Feature Selection
Models
Cross Validation
Baseline and metrics
Results
Forecasting SCs
Forecasting SSCs
Storm Sudden Commencements
Parameterization
Magnetospheric Information
Longer Lead Times
Extreme Events
Summary and Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call