Thousands Of Classes Research Articles

Recently, there has been a lot of success in the development of effective binary classifiers. Although many statistical classification techniques have natural multiclass extensions, some, such as the support vector machines, do not. The existing techniques for mapping multiclass problems onto a set of simpler binary classification problems run into serious efficiency problems when there are hundreds or even thousands of classes, and these are the scenarios where this paper's contributions shine. We introduce the concept of correlation and joint probability of base binary learners. We learn these properties during the training stage, group the binary leaner's based on their independence and, with a Bayesian approach, combine the results to predict the class of a new instance. Finally, we also discuss two additional strategies: one to reduce the number of required base learners in the multiclass classification, and another to find new base learners that might best complement the existing set. We use these two new procedures iteratively to complement the initial solution and improve the overall performance. This paper has two goals: finding the most discriminative binary classifiers to solve a multiclass problem and keeping up the efficiency, i.e., small number of base learners. We validate and compare the method with a diverse set of methods of the literature in several public available datasets that range from small (10 to 26 classes) to large multiclass problems (1000 classes) always using simple reproducible scenarios.

Read full abstract

We benchmark several SVM objective functions for large-scale image classification. We consider one-versus-rest, multiclass, ranking, and weighted approximate ranking SVMs. A comparison of online and batch methods for optimizing the objectives shows that online methods perform as well as batch methods in terms of classification accuracy, but with a significant gain in training speed. Using stochastic gradient descent, we can scale the training to millions of images and thousands of classes. Our experimental evaluation shows that ranking-based algorithms do not outperform the one-versus-rest strategy when a large number of training examples are used. Furthermore, the gap in accuracy between the different algorithms shrinks as the dimension of the features increases. We also show that learning through cross-validation the optimal rebalancing of positive and negative examples can result in a significant improvement for the one-versus-rest strategy. Finally, early stopping can be used as an effective regularization strategy when training with online algorithms. Following these "good practices," we were able to improve the state of the art on a large subset of 10K classes and 9M images of ImageNet from 16.7 percent Top-1 accuracy to 19.1 percent.

Read full abstract

Thousands Of Classes Research Articles

Articles published on Thousands Of Classes

Sparse Output Coding for Scalable Visual Recognition

Hierarchical Bayesian Inference and Recursive Regularization for Large-Scale Classification

A Meta-Top-Down Method for Large-Scale Hierarchical Classification

Research on Java Development Kit Based on Complex Networks

Multiclass from binary: expanding one-versus-all, one-versus-one and ECOC-based approaches.

Parallel multiclass stochastic gradient descent algorithms for classifying million images with very-high-dimensional signatures into thousands classes

Good Practice in Large-Scale Learning for Image Classification

Maxi-Min discriminant analysis via online learning

Cross-Validation Optimization for Large Scale Structured Classification Kernel Methods

An improved handwritten Chinese character recognition system using support vector machine

Fast SVM training algorithm with decomposition on very large data sets

METASYNTHETIC APPROACH FOR HANDWRITTEN CHINESE CHARACTER RECOGNITION

Experimental comparison of coarse-grained concepts in UML, OML, and TOS

Foundation of the taxonomic object system

Pattern Classification with Compact Distribution Maps

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Thousands Of Classes Research Articles

Articles published on Thousands Of Classes

Sparse Output Coding for Scalable Visual Recognition

Hierarchical Bayesian Inference and Recursive Regularization for Large-Scale Classification

A Meta-Top-Down Method for Large-Scale Hierarchical Classification

Research on Java Development Kit Based on Complex Networks

Multiclass from binary: expanding one-versus-all, one-versus-one and ECOC-based approaches.

Parallel multiclass stochastic gradient descent algorithms for classifying million images with very-high-dimensional signatures into thousands classes

Good Practice in Large-Scale Learning for Image Classification

Maxi-Min discriminant analysis via online learning

Cross-Validation Optimization for Large Scale Structured Classification Kernel Methods

An improved handwritten Chinese character recognition system using support vector machine

Fast SVM training algorithm with decomposition on very large data sets

METASYNTHETIC APPROACH FOR HANDWRITTEN CHINESE CHARACTER RECOGNITION

Experimental comparison of coarse-grained concepts in UML, OML, and TOS

Foundation of the taxonomic object system

Pattern Classification with Compact Distribution Maps