The evaluation of algal bloom forecasting models typically relies on error metrics that quantify the forecasting performance over the whole test set as a single number. Furthermore, the comparison with simple baseline methods is often omitted. To address this, we introduce a novel framework for Model performance Analysis and Visualization of time series forecasting (MAVts). MAVts incorporates novel algorithms for the automatic identification and visualization of time series periods of interest where the forecasting models are evaluated and compared with simple baseline methods. The application of MAVts on evaluating algal bloom forecasting models composed of sophisticated machine learning (ML) methods, reveals that in 85% of experiments a single error metric is not enough and only in 12.5% of experiments a ML model outperforms all baselines on all metrics and periods of interest. Thus, MAVts emerges as a valuable tool for analyzing and comparing ML models, advancing environmental management and protection.
Read full abstract