Abstract

Abstract. Cochlodinium polykrikoides (C. polykrikoides) is a phytoplankton that causes red tides every year in the middle of the South Sea of Korea. C. polykrikoides is a harmful Algae that has migratory ability and causes the fisheries damage over a long period of wide sea area if it causes red tide once. To minimize red tide damage, it is important to anticipate and prepare the red tide occurrence timing and location in advance. In this study, we predicted the occurrence of red tide of C. polykrikoides using machine learning techniques and compared the results of each algorithm. Logistic regression model, decision tree model, and multilayer neural network model were used for prediction of red tide occurrence. To produce the data set for model learning, we used the red tide occurrence map provided by the National Institute of Fisheries Science, the Local Data Assimilation and Prediction System (LDAPS) provided by the Korea Meteorological Agency, and the G1SST provided by the National Oceanic and Atmospheric Administration (NOAA). The feature vectors used for modeling consisted of 59 elements, which were made by using temperature, water temperature, precipitation, solar radiation, wind direction and wind speed. Only a very small number of red tide cases can be collected compared to the case of no red tide cases. Thus, an imbalance data problem arises in the data set. To overcome this imbalanced data problem, we used adding noise after oversampling to data of red tide occurrence to solve the difference of data between two classes.The data set is divided into 8 : 2 to prevent over-fitting and 80% is used as the learning data. The remaining 20% was used to evaluate the performance of each model. As a result of evaluating the prediction performance of each model, the multilayer neural network model showed the highest prediction accuracy.

Highlights

  • A red tide is a phenomenon that sea surface color changes by phytoplankton in a special environmental condition

  • In order to solve this problem, we propose a new red tide prediction model based on machine learning

  • Since the machine learning based model uses the correlation between data and natural phenomenon instead of causal relation, it is possible to develop the model without knowing the causal relation

Read more

Summary

INTRODUCTION

A red tide is a phenomenon that sea surface color changes by phytoplankton in a special environmental condition. In Korea 67 species have been reported to cause of red tide occurrence(Kim et al, 2005; Kim et al, 1998, Yoon, 2012). Spraying yellow clay had problem that losing economic cost(Purchasing, storage and management costs) and ineffective after wide area blooming. In order to develop a model, a causal relation must be derived by analyzing a large amount of data. We can not develop a model for a natural phenomenon that causal relation can not derived. In order to solve this problem, we propose a new red tide prediction model based on machine learning. Since the machine learning based model uses the correlation between data and natural phenomenon instead of causal relation, it is possible to develop the model without knowing the causal relation

DATA AND METHOD
AND DISCUSSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call