Abstract
The delta-generalized additive model (Delta_GAM) is commonly used for analyzing zero-inflated continuous data, and has been widely applied in egg production methods (EPMs). It consists of two GAMs: one with a binomial distribution to estimate the probability of non-zero values, and the other with a log-normal distribution (Delta_LN model) or a gamma distribution (Delta_LG model) to model the continuous non-zero values. However, the rather restrictive distribution assumptions are not fulfilled for egg production data. In this study, we modified the Delta_GAMs using two machine learning techniques: random forest (Delta_RF) and support vector machines (Delta_SVM). We applied the tenfold cross-validation procedure to compare the performance of these four models using root mean square error (RMSE) and the EPM survey data of small yellow croaker Larimichthys polyactis, mullet Liza haematocheilus and gizzard shad Konosirus punctatus from Haizhou Bay, China. Both the Delta_RF and Delta_SVM models showed superior performance to that of the Delta_LN and Delta_LG models. Predicted spatial and temporal distributions varied among the models, although predictive performance varied little. The annual egg production was predicted and estimated with large uncertainty. We propose that machine learning techniques such as RFs and SVMs be used to model zero-inflated continuous data from EPM surveys, which tend to provide a more reliable estimation of annual egg production (AEP).
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.