Optimal techniques for class-dependent attribute discretization

N Bryson,A Joseph

doi:10.1057/palgrave.jors.2601174

Abstract

Preprocessing of raw data has been shown to improve performance of knowledge discovery processes. Discretization of quantitative attributes is a key component of preprocessing and has the potential to greatly impact the efficiency of the process and the quality of its outcomes. In attribute discretization, the value domain of an attribute is partitioned into a finite set of intervals so that the attribute can be described using a small number of discrete representations. Discretization therefore involves two decisions, on the number of intervals and the placement of interval boundaries. Previous approaches for quantitative attribute discretization have used heuristic algorithms to identify partitions of the attribute value domain. Therefore, these approaches cannot be guaranteed to provide the optimal solution for the given discretization criterion and number of intervals. In this paper, we use linear programming (LP) methods to formulate the attribute discretization problem. The LP formulation allows the discretization criterion and the number of intervals to be integral considerations of the problem. We conduct experiments and identify optimal solutions for various discretization criteria and numbers of intervals.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimal techniques for class-dependent attribute discretization

Abstract

Talk to us

Similar Papers

More From: Journal of the Operational Research Society

Lead the way for us

Journal: Journal of the Operational Research Society	Publication Date: Oct 1, 2001
Citations: 6

Similar Papers

HIGH FIDELITY INTERVAL ASSIGNMENT
Scott A Mitchell
International Journal of Computational Geometry & Applications | VOL. 10
Scott A MitchellScott A Mitchell
01 Aug 2000
International Journal of Computational Geometry & Applications | VOL. 10

Whole Life Cost Comparisons Based upon the Year of Required Protection
Harold J Schleef
The Journal of Risk and Insurance | VOL. 56
Harold J SchleefHarold J Schleef
01 Mar 1989
The Journal of Risk and Insurance | VOL. 56

On the Effectiveness of Discretizing Quantitative Attributes in Linear Classifiers
Nayyar A Zaidi ... Yang Du
IEEE Access | VOL. 8
Nayyar A Zaidi, et. al.Nayyar A Zaidi ... Yang Du
01 Jan 2020
IEEE Access | VOL. 8

An Innovative Formulation Tightening Approach for Job-Shop Scheduling
Bing Yan ... Peter B. Luh
IEEE Transactions on Automation Science and Engineering | VOL. 19
Bing Yan, et. al.Bing Yan ... Peter B. Luh
01 Jul 2022
IEEE Transactions on Automation Science and Engineering | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimal techniques for class-dependent attribute discretization

Abstract

Talk to us

Similar Papers

More From: Journal of the Operational Research Society