Abstract

Random subspace decision forests are commonly used machine learning methods in a wide range of application domains. How to set the random subspace dimensionality ds in decision forests is a considerable issue that impacts classification quality and efficiency, especially for high dimensional cases. To obtain effective and efficient decision forests that are generally suitable for various classification cases, this paper proposes a novel framework, named Efficient Random Subspace decision forest (ERS). A Half-Range Discrete Uniform distribution-based Varied Dimensionality setting (HRDUVD) method is provided for determining the random subspace dimensionality, and the ERS is formed based on the HRDUVD method. In more detail, a simple discrete uniform distribution in a specific range is employed to set with a given probability the number of randomly selected features for each tree in random subspace decision forests. The HRDUVD method removes the hesitation which appropriate ds value one should preset for different datasets, while also achieving adequate classification performance along with a relatively short running time. Therefore, setting ds using the discrete uniform distribution is a highly useful strategy for the proposed ERS.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.