The application of non-parametric techniques to solve classification problems in complex data sets in veterinary epidemiology – An example

Katharina D.C Stärk,Dirk U Pfeiffer

doi:10.1016/s1088-467x(99)00003-7

Abstract

Statistical classification problems are very common in veterinary epidemiology. Traditionally, parametric techniques such as logistic regression or discriminant analysis are used to analyse data sets that contain several classes of observations. However, characteristics of the data set such as high dimensionality, multicollinearity and non-homogeneity can make a data set unsuitable for parametric techniques. In this article, classification tree algorithms (ID3, C4.5, CHAID, CART) and artificial neural networks are suggested as non-parametric alternatives. Their application is illustrated using a field data set containing pig farms with 3 levels of respiratory disease prevalence. The performance of non-parametric classification algorithms is compared with results from multinomial logistic regression. None of the algorithms was significantly better than the others. The proportions of correctly classified farms were between 84% and 96%. However, the data set was small (86 observations), which created technical problems when using the artificial neural networks and multinomial logistic regression. The choice of statistical technique should therefore be based on the objectives of the study and the data set under consideration. Classification trees are well-suited for exploratory data analysis. They are easy to apply and worth considering.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The application of non-parametric techniques to solve classification problems in complex data sets in veterinary epidemiology – An example

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis

Lead the way for us

Journal: Intelligent Data Analysis	Publication Date: May 1, 1999
Citations: 29

Similar Papers

Machine learning-based receiver operating characteristic (ROC) curves for crisp and fuzzy classification of DNA microarrays in cancer research
Leif E Peterson ... Matthew A Coleman
International Journal of Approximate Reasoning | VOL. 47
Leif E Peterson, et. al.Leif E Peterson ... Matthew A Coleman
11 Apr 2007
International Journal of Approximate Reasoning | VOL. 47

Identification of sedimentary facies with well logs: an indirect approach with multinomial logistic regression and artificial neural network
Jinliang Zhang ... Longlong Liu
Arabian Journal of Geosciences | VOL. 10
Jinliang Zhang, et. al.Jinliang Zhang ... Longlong Liu
01 Jun 2017
Arabian Journal of Geosciences | VOL. 10

Application of Classification Tree and Neural Network Algorithms to the Identification of Serological Liver Marker Profiles for the Diagnosis of Hepatocellular Carcinoma
Terence Chuen-Wai Poon ... Stephen King-Wah Ho
Oncology | VOL. 61
Terence Chuen-Wai Poon, et. al.Terence Chuen-Wai Poon ... Stephen King-Wah Ho
01 Nov 2001
Oncology | VOL. 61

Assessing Behavioral Patterns of Motorcyclists Based on Traffic Control Device at City Intersections by Classification Tree Algorithm

-

01 Apr 2018
01 Apr 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The application of non-parametric techniques to solve classification problems in complex data sets in veterinary epidemiology – An example

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis