The key aim of this research project is to design and evaluate advanced machine learning models for increasing accuracy in rainfall forecasting over the USA. We intended to investigate nonlinear relationships typical of the atmospheric variables using state-of-the-art ML methods for more accurate rainfall predictions. For this research project on rainfall forecasting in the USA, we utilized an extensive dataset that comprises historical rainfall data collected from the National Oceanic and Atmospheric Administration (NOAA) and other meteorological agencies. The main dataset we use in this paper consists of daily rainfall measurements across various geographical locations of the USA, thus capturing the wide-ranging historical data necessary for both training and validation of the model. Besides measuring rainfall, we included meteorological data from sources such as NOAA's Global Historical Climatology Network and NASA's Modern-Era Retrospective Analysis for Research and Applications. These datasets further provided key variables that are known to affect rain, including temperature, humidity, wind speed, and atmospheric pressure. The performance metrics used in this research work for the models considered include accuracy, precision, recall, and F1 score. The above table shows that the Random Forest Classifier outperformed the other models, achieving perfect accuracy. That indicated that it rightly classified all the instances in the test set. The Logistic Regression and Support Vector Machine models gave a quite good performance by giving above average accuracy but had lower precision and recall for the rainfall prediction. Accurate rainfall forecasting has direct consequences on agriculture, greatly empowering farmers and agricultural planners to make more effective decisions regarding planting, harvesting, and crop management. The forecasts of rainfall are also of critical importance in disaster management regarding planning for flood emergencies. Moreover, precise forecasting of rainfall, particularly in sustainable water resources management, presents the most important data in planning for and conserving these resources.
Read full abstract