Implementing Classification in Weka and R

Parteek Bhatia

doi:10.1017/9781108635592.007

Abstract

Chapter Objectives ✓ To demonstrate the use of the decision tree ✓ To apply the decision tree on a sample dataset ✓ To implement a decision tree process using Weka and R Building a Decision Tree Classifier in Weka In this chapter, we will learn how Weka's decision tree feature helps to classify unknown samples of a dataset based on its attribute values. When Weka's decision tree is applied to an unknown sample, the decision tree classifies the sample into different classes such as Class A, Class B and Class C as shown in Figure 6.1. For example, if we want to predict the class of an unknown sample of a flower based on the length and width dimensions of its Sepal and Petal. The first step would be to measure Sepal length and width and Petal length and width of an unknown flower and compare these dimensions to the values of the samples in our dataset of known species. The decision tree algorithm of Weka will help in creating decision rules to predict the class of unknown flower automatically as shown in Figure 6.2. As shown in Figure 6.2, the dimensions of an unknown sample of flower will be matched with the rules generated by the decision tree. First, the rules will be matched to determine whether the sample belongs to Setosa class or not, if yes, the unknown sample will be classified as setosa. If not, the unknown sample will be checked for being of the Virginica class. If it matches with the conditions of the Virginica class, it will be labeled as Virginica, otherwise Versicolor. It is important to note that it would not be simple to create these rules on the basis of the values of single attribute as shown in Table 6.1. It is clear that for the same Sepal width, the flower may be of Setosa or Versicolor or Virginica, making it unclear which species an unknown flower belongs to on the basis of Sepal width alone. Thus, the decision tree must make its prediction based on all four flower dimensions. Due to such overlaps, the decision tree cannot predict with 100% accuracy the class of flower, but can only determine the likelihood of an unknown sample belonging to a particular class. In real situations the decision tree algorithm works on the basis of probability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Implementing Classification in Weka and R

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Implementing Clustering with Weka and R

-

01 Apr 2019
01 Apr 2019

Decision Tree and Random Forest Classification Algorithms for Mangrove Forest Mapping in Sembilang National Park, Indonesia
Anang Dwi Purwanto ... Ketut Wikantika
Remote Sensing | VOL. 15
Anang Dwi Purwanto, et. al.Anang Dwi Purwanto ... Ketut Wikantika
21 Dec 2022
Remote Sensing | VOL. 15

Assessment of genetic parameters and heritability of Dendrobium species section Spatulata native to Indonesia
Etik Wukir Tini ... Zulfa Ulinnuha
Advances in Horticultural Science | VOL. 38
Etik Wukir Tini, et. al.Etik Wukir Tini ... Zulfa Ulinnuha
23 Oct 2024
Advances in Horticultural Science | VOL. 38

Comprehensive Classification of Iris Flower Species: A Machine Learning Approach
Renas Rajab Asaad ... Adnan M Abdulazeez
Indonesian Journal of Computer Science | VOL. 13
Renas Rajab Asaad, et. al. Renas Rajab Asaad ... Adnan M Abdulazeez
16 Feb 2024
Indonesian Journal of Computer Science | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Implementing Classification in Weka and R

Abstract

Talk to us

Similar Papers