Smoothing Categorical Data

Arno Siebes,René Kersten

doi:10.1007/978-3-642-33460-3_8

Abstract

AbstractGlobal models of a dataset reflect not only the large scale structure of the data distribution, they also reflect small(er) scale structure. Hence, if one wants to see the large scale structure, one should somehow subtract this smaller scale structure from the model.While for some kinds of model – such as boosted classifiers – it is easy to see the “important” components, for many kind of models this is far harder, if at all possible. In such cases one might try an implicit approach: simplify the data distribution without changing the large scale structure. That is, one might first smooth the local structure out of the dataset. Then induce a new model from this smoothed dataset. This new model should now reflect the large scale structure of the original dataset. In this paper we propose such a smoothing for categorical data and for one particular type of models, viz., code tables.By experiments we show that our approach preserves the large scale structure of a dataset well. That is, the smoothed dataset is simpler while the original and smoothed datasets share the same large scale structure.KeywordsLocal StructureLarge Scale StructureOriginal DatasetMinimal SupportPattern MiningThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Smoothing Categorical Data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Large Scale Structures in Rayleigh-Bénard Convection at High Rayleigh Numbers
T Hartlep ... A Tilgner
Physical Review Letters | VOL. 91
T Hartlep, et. al.T Hartlep ... A Tilgner
04 Aug 2003
Physical Review Letters | VOL. 91

Dynamics of alpha helix formation in the CSAW model
J Lei ... K Huang
The European Physical Journal E | VOL. 27
J Lei, et. al.J Lei ... K Huang
30 Sep 2008
The European Physical Journal E | VOL. 27

Monte Carlo Simulation of two-dimensional Kolmogorov flow
Jun Zhang ... Jing Fan
-
Jun Zhang, et. al.Jun Zhang ... Jing Fan
01 Jan 2010
01 Jan 2010

The Formation of Halos via Mergers. The Organized and Organizing Dynamics of Mergers
P. J. Quinn ... W. H. Zurek
-
P. J. Quinn, et. al.P. J. Quinn ... W. H. Zurek
01 Jan 1990
01 Jan 1990

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Smoothing Categorical Data

Abstract

Talk to us

Similar Papers