Abstract

Multi-label optimal margin distribution machine (mlODM) is an efficient algorithm for multi-label classification. Although it can achieve great generalization performance, it is inefficient for large-scale datasets due to the huge number of label pairs. Motivated by its sparse solution, in this paper, we propose a two-stage gap safe screening rule for accelerating mlODM, termed as TSSR. First, a sequential safe screening rule (SSSR) based on gap is designed to screen out part of redundant label pairs prior to training, which reduces the scale of mlODM. Compared with the previous DVI rule, our method ensures absolute safety without destroying efficiency. In the second stage, to further speed up the solving process, a dynamic safe screening rule (DSSR) is embedded into the solving algorithm DCDM when training the simplified mlODM. More importantly, the feasible solution generated in the first stage can promote the efficiency of DSSR. To the best of our knowledge, this is the first attempt to create a hybrid screening rule for multi-label model. Our TSSR can greatly reduce the cost and achieve exactly the same accuracy. Experimental results on seven multi-label benchmark datasets and two real-world learning problems including movie genres classification and hypoglycemic drugs prediction verify the superiority of the proposed methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.