Abstract

Abundant data with limited labeling are a widespread bottleneck in multilabel learning. Active learning (AL) is an effective solution to gradually enhance model robustness, however how to effectively extend instance selection criteria to multilabel case remains challenging. Considering the label specificity and label correlation, a granular batch mode-based ranking active model for the multilabel (GBRAML) is proposed. Taking a bottom-up view, three granulation operators are successively constructed to formulate three granular structures. In low-level granulation operator, auxiliary label is introduced to enhance the informativeness and representativeness of each label. The contribution of labels to the usefulness of instances is incorporated with pair-wise label correlation, and is considered in the middle-level granulation operator. The labeling priority is determined by ranking the scorings coming from high-level granulation operator. To alleviate the impact of skewed label correlation, we take a three-way strategy on fitness of representative label correlation, thus a three-way GBRAML model (TGBRAML) is devised. Extensive experiments on six multilabel benchmark demonstrate GBRAML gains 5.4% and 210.1% improvement on MicroF1 and Average Precision over state-of-the-art batch mode multilabel active learning. The effectiveness of three-way decisions in multilabel AL is also verified.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.