Abstract

Abstract: Machine learning is a technique of optimizing a performance criterion using example data and past experience. Data in machine learning plays a key role, and machine leaning tools are used to discover and learn knowledge from the datasets stored. The purpose of this research is to build a model that can predict the determinant factors for crop production status using machine learning techniques as a means of visualizing the data. In order to conduct this research supervised machine learning techniques were employed. For the purpose of this research, the datasets were collected from selected region agricultural offices. The data sets used for the training and testing of the predictive model is 10,000 instances with 41 regular attributes. As a result, for identifying the determinant factors Rapid Miner machine learning tool was used. In order to find the best predictive modeling technique different experiments were conducted using Random Forest, Decision tree, Naive Bays and ID3 predictive models. To validate the predictive performance of the selected models split and cross validation testing methods was used. As the findings of this research shows that, Random Forest and decision tree models were performed the highest accuracy and precision than others. Therefore, the Random Forest predictive modeling have been used to predict the determinate factors form small and large datasets.

Highlights

  • Machine learning is a technique of optimizing a performance criterion using example data and past experience [1]

  • As Machine learning is a process of self-improvement using the system itself, and computer programs can automatically improve performance with the accumulation of experience

  • There is a large amount of data accumulation in different industries such as telecommunications, financial institutions, and research institutions, so far there are problems and needs of applying machine learning methods to train the data and to enable the machine can predict new values from the existing large datasets

Read more

Summary

Introduction

Machine learning is a technique of optimizing a performance criterion using example data and past experience [1]. Data in machine learning plays an indispensable role, and the learning algorithm is used to discover and learn knowledge or properties from the data. As Machine learning is a process of self-improvement using the system itself, and computer programs can automatically improve performance with the accumulation of experience. It is proposed for many specific learning tasks, so that computers can extract features from many data and discover hidden rules [2]. This research attempts to explore the problems on the existing agricultural data and applying machine learning techniques as a predictive model.

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call