Abstract

This paper proposes a novel algorithm built on a reinforcement learning approach as an alternative to standard hyperparameter tuning methods for improving the feature selection process to obtain better classification performance. Various techniques have been proposed to handle the problem, and among them are metaheuristics, such as Particle Swarm Optimization (PSO). However, a new problem arises as learning algorithms must often be tuned in terms of their hyperparameters. We need to solve this problem separately, and vary of methods can be used for this matter. The hyperparameter optimization for the learning process of PSO can be seen as a sequence decision-making process. Hence, we can solve it with the multi-armed bandit problem algorithms.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.