Abstract

Many machine learning algorithms have been used to classify pixels in Landsat imagery. The maximum likelihood classifier is the widely-accepted classifier. Non-parametric methods of classification include neural networks and decision trees. In this research work, we implemented decision trees using the C4.5 algorithm to classify pixels of a scene from Juneau, Alaska area obtained with Landsat 8, Operation Land Imager (OLI). One of the concerns with decision trees is that they are often over fitted with training set data, which yields less accuracy in classifying unknown data. To study the effect of overfitting, we have considered noisy training set data and built decision trees using randomly-selected training samples with variable sample sizes. One of the ways to overcome the overfitting problem is pruning a decision tree. We have generated pruned trees with data sets of various sizes and compared the accuracy obtained with pruned trees to the accuracy obtained with full decision trees. Furthermore, we extracted knowledge regarding classification rules from the pruned tree. To validate the rules, we built a fuzzy inference system (FIS) and reclassified the dataset. In designing the FIS, we used threshold values obtained from extracted rules to define input membership functions and used the extracted rules as the rule-base. The classification results obtained from decision trees and the FIS are evaluated using the overall accuracy obtained from the confusion matrix.

Highlights

  • Many pixel-based classification and clustering algorithms have been developed to analyze Landsat images

  • The support vector machine (SVM) algorithm is appealing for Landsat data analysis because of its ability to successfully handle small datasets, often producing higher classification accuracy than traditional methods [2]

  • We considered the problem of overfitting the decision tree

Read more

Summary

Introduction

Many pixel-based classification and clustering algorithms have been developed to analyze Landsat images. These include the minimum distance classifier, maximum likelihood classifier (MLC), and non-parametric techniques such as the support vector machine (SVM), decision tree (DT), ensemble of decision trees, multi-layered perceptron model, fuzzy inference system, and fuzzy neural networks. Huang et al [6] have used the SVM algorithm to classify pixels in remotely sensed images. They have shown that for most training cases slightly higher accuracies were achieved when the model was trained with a randomly selected fixed number of samples for each class. Moumtrakis et al [8] have provided a review of usage of SVM in remote sensing

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.