Abstract

Ensemble classifiers are being widely used for the classification of spectroscopic data. In this regard, the random forest (RF) ensemble has been successfully applied in an array of applications, and has proven to be robust in handling high dimensional data. More recently, several variants of the traditional RF algorithm including rotation forest (rotF) and oblique random forest (oRF) have been applied to classifying high dimensional data. In this study we compare the traditional RF, rotF, and oRF (using three different splitting rules, i.e., ridge regression, partial least squares, and support vector machine) for the classification of healthy and infected Pinus radiata seedlings using high dimensional spectroscopic data. We further test the robustness of these five ensemble classifiers to reduced spectral resolution by spectral resampling (binning) of the original spectral bands. The results showed that the three oblique random forest ensembles outperformed both the traditional RF and rotF ensembles. Additionally, the rotF ensemble proved to be the least robust of the five ensembles tested. Spectral resampling of the original bands provided mixed results. Nevertheless, the results demonstrate that using spectral resampled bands is a promising approach to classifying asymptomatic stress in Pinus radiata seedlings.

Highlights

  • Hyperspectral data is characterized by a large number of contiguous bands, ranging from the visible through to the shortwave infrared portion of the electromagnetic spectrum [1]

  • To better understand the difference in behavior of the random forest (RF) and oblique random forest (oRF) models, we examined the topology of the decision boundary learned by each ensemble classifier (Figure 3)

  • This study aimed to evaluate the performance of various ensemble classifiers for the analysis of high dimensional spectral data

Read more

Summary

Introduction

Hyperspectral data is characterized by a large number of contiguous bands, ranging from the visible through to the shortwave infrared portion of the electromagnetic spectrum [1]. For the analysis of plant stress, the high spectral resolution allows for the detection and quantification of a plant’s physiological response to stress [2] This physiological response is exhibited as subtle variations in a plant’s spectral response, providing the basis for developing stress detection models [3,4]. Hyperspectral data subsequently provides the opportunity to readily monitor pest and disease stress in agricultural crops and forestry, as demonstrated by [3,4,5,6] and others. The visible-near infrared (VNIR) spectrum has been useful for the detection of stress in agricultural crops. The VNIR and shortwave infra-red (SWIR) range was utilized by [3] for Sensors 2016, 16, 1918; doi:10.3390/s16111918 www.mdpi.com/journal/sensors

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call