Abstract

ABSTRACTFeatures relevant to a thematic class, that is, class-specific features are beneficial to thematic information extraction. However, existing class-specific feature selection methods require abundant labelled samples, while sample labelling is always labour intensive and time consuming. Therefore, it is necessary to select class-specific features with insufficient labelled objects. In this paper, we raise this problem as semi-supervised class-specific feature selection and propose a new two-stage method. First, a weight matrix fully integrates local geometrical structure and discriminative information. Second, the weight matrix is incorporated into a -norm minimization optimization problem of data reconstruction to objectively measure the effectiveness of features for a thematic class. Different from the explicit binarization in the label vector, the new method only implicitly employs binarization in the weight matrix. With area under receiver-operating characteristic curve, class-specific features result in an increase from 3% and 4% on average for Bayes and linear support vector machine, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.