Abstract

Batch mode active learning, where a batch of samples is simultaneously selected and labeled, is a challenging task. The challenge lies in how to maintain the informativeness and keep the diversity of selected samples concurrently. We propose a novel batch mode active learning that balances the informativeness and representativeness using multi-set clustering. Our method utilizes a sequential active learner to retain the informativeness by providing a ranking of unlabeled samples and constructing multiple informative sets for the subsequent clusterings. K-means clustering is used to minimize the redundancy among these samples and to improve the representativeness. Finally, the optimal batch chosen is the one minimizing the expected predictive variance on all the data. Our experimental results on a large number of benchmark datasets demonstrate excellent performance of the proposed method in comparison with current state-of-the-art batch mode active learning approaches.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.