Targeting: Logistic Regression, Special Cases and Extensions

Helmut Schaeben

doi:10.3390/ijgi3041387

Abstract

Logistic regression is a classical linear model for logit-transformed conditional probabilities of a binary target variable. It recovers the true conditional probabilities if the joint distribution of predictors and the target is of log-linear form. Weights-of-evidence is an ordinary logistic regression with parameters equal to the differences of the weights of evidence if all predictor variables are discrete and conditionally independent given the target variable. The hypothesis of conditional independence can be tested in terms of log-linear models. If the assumption of conditional independence is violated, the application of weights-of-evidence does not only corrupt the predicted conditional probabilities, but also their rank transform. Logistic regression models, including the interaction terms, can account for the lack of conditional independence, appropriate interaction terms compensate exactly for violations of conditional independence. Multilayer artificial neural nets may be seen as nested regression-like models, with some sigmoidal activation function. Most often, the logistic function is used as the activation function. If the net topology, i.e., its control, is sufficiently versatile to mimic interaction terms, artificial neural nets are able to account for violations of conditional independence and yield very similar results. Weights-of-evidence cannot reasonably include interaction terms; subsequent modifications of the weights, as often suggested, cannot emulate the effect of interaction terms.

Highlights

The objective of potential modeling or targeting [1] is to identify locations, i.e., pixels or voxels, for which the probability of an event spatially referenced in this way, e.g., a well-defined type of ore mineralization, is relatively maximum, i.e., is larger than in neighbor pixels or voxels
Lacking conditional independence can be exactly compensated for by corresponding interaction terms included in the logistic regression model, and the resulting logistic regression model with interaction terms is optimum for continuous predictor variables if the joint distribution of the target variable and the predictor variables is of a log-linear form
Targeting or potential modeling applies regression or regression-like models to estimate the conditional probability of a target variable given predictor variables

Summary

Introduction

The objective of potential modeling or targeting [1] is to identify locations, i.e., pixels or voxels, for which the probability of an event spatially referenced in this way, e.g., a well-defined type of ore mineralization, is relatively maximum, i.e., is larger than in neighbor pixels or voxels. Conceptual models of ore deposits have been compiled by [2] They may be read as factor models (in the sense of mathematical statistics), and a proper factor model may be turned into a regression-type model when using the factors as spatially-referenced predictors, which are favorable to or prohibitive of the target event. The pixels or voxels initially provide the physical support of the predictors and the target and will be assigned the predicted conditional probability and the associated estimation errors, respectively. If the spatial resolution provided by the pixels or voxels is poor with respect to the area or volume of the actual physical support of the predictors or target, the numerical results of any kind of mathematical method of targeting are rather an artifact of the inappropriate spatial resolution. Potential modeling applies the assumption of independently identically distributed random variables Their distribution does not depend on the location.

The Modeling Assumption of Conditional Independence

Logistic Regression

Weights-of-Evidence

Testing Conditional Independence

Artificial Neural Nets

Balancing

Numerical Complexity of Logistic Regression

Examples

Dataset RANKIT Revisited

Dataset DFQR

Conclusions

Findings

Derivation of Weights-of-Evidence in Elementary Terms

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ISPRS International Journal of Geo-Information	Publication Date: Dec 11, 2014
Citations: 25	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Targeting: Logistic Regression, Special Cases and Extensions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS International Journal of Geo-Information

Lead the way for us

Similar Papers

Potential modeling: conditional independence matters
Helmut Schaeben
GEM - International Journal on Geomathematics | VOL. 5
Helmut SchaebenHelmut Schaeben
20 Mar 2014
GEM - International Journal on Geomathematics | VOL. 5

A Mathematical View of Weights-of-Evidence, Conditional Independence, and Logistic Regression in Terms of Markov Random Fields
Helmut Schaeben
Mathematical Geosciences | VOL. 46
Helmut SchaebenHelmut Schaeben
15 Jan 2014
Mathematical Geosciences | VOL. 46

Testing the conditional independence and monotonicity assumptions of item response theory
Paul R Rosenbaum
Psychometrika | VOL. 49
Paul R RosenbaumPaul R Rosenbaum
01 Sep 1984
Psychometrika | VOL. 49

The quest for conditional independence in prospectivity modeling: weights-of-evidence, boost weights-of-evidence, and logistic regression
Helmut Schaeben ... Georg Semmler
Frontiers of Earth Science | VOL. 10
Helmut Schaeben, et. al.Helmut Schaeben ... Georg Semmler
17 May 2016
Frontiers of Earth Science | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Targeting: Logistic Regression, Special Cases and Extensions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS International Journal of Geo-Information