Abstract

Random <span>forest is an ensemble algorithm for machine learning. In decision trees, the splitting criteria is built on the prediction of the nodal points and formation of rules by Gini index and Information Gain. Gini index is a measure of inequality. Gini index does not take into consideration the structural changes in the dataset, and inaccurate data can distort the validity of the gini-coefficient. For data with the same feature but different outcomes, the gini-coefficient remained the same. The proposed method for attribute selection measure takes into consideration that there may be structural changes in the dataset overtime and it adapts to such expected changes and maintain the accuracy of the algorithm avoiding under-fitting and over-fitting. A dataset on myocardial infarctions was taken for the study and the results were promising.</span>

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call