Liver Disorders Dataset Research Articles

Discretization is one of the commonly used data preprocessing technique to improve the efficiency of the knowledge extraction process on clinical data. Generally, clinical data contains numeric attributes with continuous values. Data discretization simplifies the original data by transforming continuous data attribute values into a finite set of intervals. Although discretization is capable of handling continuous attributes on clinical data, there are cases where discretization is not an appropriate technique for handling continuous attributes. There are instances where attribute values are vague, imprecise and have multiple distributions with different classes, which challenges the process of mining in clinical data. Hence, there is a need for fuzzy discretization to pre-process the clinical data before mining. The aim of this study is to derive fuzzy discretization from crisp-interval discretization using geometric approach for constructing fuzzy sets, where overlapping region between the fuzzy sets is represented as geometric area. This study comprises of three steps: First, non-overlapping fuzzy sets are constructed using intervals generated from crisp-interval discretization. Second, area of overlapping between the fuzzy sets is computed based on the geometric approach and an average area of overlapping is estimated. Third, fuzzy sets are redesigned based on the estimated average area of overlapping. Fuzzy discretizations for three, five and seven intervals have been examined using Pima Indian Diabetes dataset (PID) and Bupa Liver Disorder dataset (BLD) taken from the University of California Irvine machine learning repository. The variation in performance of crisp and fuzzy discretization methods is measured using six classification approaches namely, tree based approach, probabilistic induction based approach, rule-based approach, network learning approach, kernel-based approach and distance-based approach and a rule-based fuzzy inference system. The results show that the classification accuracy remains stable with less deviation across different classifiers with varying intervals.

Read full abstract

Background and objectivesRule-based classification is a typical data mining task that is being used in several medical diagnosis and decision support systems. The rules stored in the rule base have an impact on classification efficiency. Rule sets that are extracted with data mining tools and techniques are optimized using heuristic or meta-heuristic approaches in order to improve the quality of the rule base. In this work, a meta-heuristic approach called Wind-driven Swarm Optimization (WSO) is used. The uniqueness of this work lies in the biological inspiration that underlies the algorithm. MethodsWSO uses Jval, a new metric, to evaluate the efficiency of a rule-based classifier. Rules are extracted from decision trees. WSO is used to obtain different permutations and combinations of rules whereby the optimal ruleset that satisfies the requirement of the developer is used for predicting the test data. The performance of various extensions of decision trees, namely, RIPPER, PART, FURIA and Decision Tables are analyzed. The efficiency of WSO is also compared with the traditional Particle Swarm Optimization. ResultsExperiments were carried out with six benchmark medical datasets. The traditional C4.5 algorithm yields 62.89% accuracy with 43 rules for liver disorders dataset where as WSO yields 64.60% with 19 rules. For Heart disease dataset, C4.5 is 68.64% accurate with 98 rules where as WSO is 77.8% accurate with 34 rules. The normalized standard deviation for accuracy of PSO and WSO are 0.5921 and 0.5846 respectively. ConclusionWSO provides accurate and concise rulesets. PSO yields results similar to that of WSO but the novelty of WSO lies in its biological motivation and it is customization for rule base optimization. The trade-off between the prediction accuracy and the size of the rule base is optimized during the design and development of rule-based clinical decision support system. The efficiency of a decision support system relies on the content of the rule base and classification accuracy.

Read full abstract

Liver Disorders Dataset Research Articles

Articles published on Liver Disorders Dataset

A non-linear optimization based robust attribute weighting model for the two-class classification problems.

CAD System for Liver Diseases using Histological and Imaging features

A new similarity-based classifier with Dombi aggregative operators

Neural network parameters optimization with genetic algorithm to improve liver disease estimation

A novel machine learning technique for computer-aided diagnosis

Improved XGBoost model based on genetic algorithm

Improved XGBoost model based on genetic algorithm

Bio-inspired weighed quantum particle swarm optimization and smooth support vector machine ensembles for identification of abnormalities in medical data

An intelligent quality-based approach to fusing multi-source possibilistic information

Comparative Performance Analysis of Various Classifiers for Cloud E-Health Users

Statistical Comparison of Classification Algorithms for Medical Datasets

Fuzzy Discretization based Classification of Medical Data

RBF Neural Network (RBFNN) using Density Based Clustering for Liver Disorder Dataset

DBSCAN BASED SEED INITIALIZATION OF K-MEANS ALGORITHM

Generalized interaction LASSO based on alternating direction method of multipliers for liver disease classification

A neuron model with synaptic nonlinearities in a dendritic tree for liver disorders

Analysis of Data Mining Techniques for Healthcare Decision Support System Using Liver Disorder Dataset

A Swarm Optimization approach for clinical knowledge mining

Reverse Sequential Covering Algorithm for Medical Data Mining

Reverse Sequential Covering Algorithm for Medical Data Mining

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Liver Disorders Dataset Research Articles

Articles published on Liver Disorders Dataset

A non-linear optimization based robust attribute weighting model for the two-class classification problems.

CAD System for Liver Diseases using Histological and Imaging features

A new similarity-based classifier with Dombi aggregative operators

Neural network parameters optimization with genetic algorithm to improve liver disease estimation

A novel machine learning technique for computer-aided diagnosis

Improved XGBoost model based on genetic algorithm

Improved XGBoost model based on genetic algorithm

Bio-inspired weighed quantum particle swarm optimization and smooth support vector machine ensembles for identification of abnormalities in medical data

An intelligent quality-based approach to fusing multi-source possibilistic information

Comparative Performance Analysis of Various Classifiers for Cloud E-Health Users

Statistical Comparison of Classification Algorithms for Medical Datasets

Fuzzy Discretization based Classification of Medical Data

RBF Neural Network (RBFNN) using Density Based Clustering for Liver Disorder Dataset

DBSCAN BASED SEED INITIALIZATION OF K-MEANS ALGORITHM

Generalized interaction LASSO based on alternating direction method of multipliers for liver disease classification

A neuron model with synaptic nonlinearities in a dendritic tree for liver disorders

Analysis of Data Mining Techniques for Healthcare Decision Support System Using Liver Disorder Dataset

A Swarm Optimization approach for clinical knowledge mining

Reverse Sequential Covering Algorithm for Medical Data Mining

Reverse Sequential Covering Algorithm for Medical Data Mining