Abstract
In classification problem, single classifier may not fully catch the dataset's information. Thus, an ensemble method based on Support Vector Machine (SVM) is proposed in this paper for image scene classification. First, Scale Invariant Feature Transform (SIFT) is used to extract the features of the images, and the SIFT features are clustered to form a visual vocabulary. Then, the SIFT features of each image are compared with this visual vocabulary to calculate the appearance frequencies of the visual words, which consist of the Bag-of-Words (BOW) model descriptions of the image. Probabilistic Latent Semantic Analysis (PLSA) is used to exploit the latent semantic features on the basis of the BOW model, and SVM classifier is then trained by these latent semantic features. These processes repeat N times and N different SVM classifiers are trained. Finally, they are used to classify the testing images, and the ensemble of the N different classification results are calculated as the final result. Experiments show that our method can be effectively applied to the scene classification problem, and the accuracy could be improved with a certain degree of robustness.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.