Large Number Of Object Classes Research Articles

In this paper, a structured max-margin learning algorithm is developed to achieve more effective training of a large number of inter-related classifiers for multilabel image annotation application. To leverage multilabel images for classifier training, each multilabel image is partitioned into a set of image instances (image regions or image patches) and an automatic instance label identification algorithm is developed to assign multiple labels (which are given at the image level) to the most relevant image instances. A K-way min-max cut algorithm is developed for automatic instance clustering and kernel weight determination, where multiple base kernels are seamlessly combined to address the issue of huge intra-concept visual diversity more effectively. Second, a visual concept network is constructed for characterizing the inter-concept visual similarity contexts more precisely in the high-dimensional multimodal feature space. The visual concept network is used to determine the inter-related learning tasks directly in the feature space rather than in the label space because feature space is the common space for classifier training and image classification. Third, a parallel computing platform is developed to achieve more effective learning of a large number of inter-related classifiers over the visual concept network. A structured max-margin learning algorithm is developed by incorporating the visual concept network, max-margin Markov networks and multitask learning to address the issue of huge inter-concept visual similarity more effectively. By leveraging the inter-concept visual similarity contexts for inter-related classifier training, our structured max-margin learning algorithm can significantly enhance the discrimination power of the inter-related classifiers. Our experiments have also obtained very positive results for a large number of object classes and image concepts.

Read full abstract

The ability of practical recognition systems to recognize a large number of objects is constrained by a variety of factors that include choice of a feature extraction technique, quality of images, complexity and variability of underlying objects and of collected data. Given a feature extraction technique generating templates of objects from data and a resolution of the original images, the remaining factors can be attributed to distortions due to a recognition channel. We define the recognition channel as the environment that transforms reference templates of objects in a database into templates submitted for recognition. If templates in an object database are generated to be statistically independent and the noise in a query template is statistically independent of templates in the database, then the abilities of the recognition channel to recognize a large number of object classes can be characterized by a number called recognition capacity. In this paper, we evaluate the empirical recognition capacity of PCA-based object recognition systems. The encoded data (templates) and the additive noise in query templates are modeled to be Gaussian distributed with zero mean and estimated variances. We analyze both the case of a single encoded image and the case of encoded correlated multiple images. For this case, we propose a model that is orientation and elevation angle (pose) dependent. The fit of proposed models is judged using statistical goodness of fit tests. We define recognition rate as the ratio R=log(M)/n, where M is the number of objects to recognize and n is the length of PCA templates. The empirical capacity of PCA-based recognition systems is numerically evaluated. The empirical random coding exponent is also numerically evaluated and plotted as a function of the recognition rate. With these results, given a value of the recognition capacity and the length of templates (assume large), we can predict the number of distinct object classes that can be stored in an object library and be identified with probability of error close to zero.

Read full abstract

Large Number Of Object Classes Research Articles

Related Topics

Articles published on Large Number Of Object Classes

Decoding Brain Signals from Rapid-Event EEG for Visual Analysis Using Deep Learning.

Embedding Visual Hierarchy with Deep Networks for Large-Scale Visual Recognition.

Integrating multi-level deep learning and concept ontology for large-scale visual recognition

Contextual Co-occurrence Information for Object Representation and Categorization

Cost-sensitive learning of hierarchical tree classifiers for large-scale image classification and novel category detection

Training inter-related classifiers for automatic image classification and annotation

Contextually guided semantic labeling and search for three-dimensional point clouds

Multi-taskmulti-labelmultiple instance learning

Structured Max-Margin Learning for Inter-Related Classifier Training and Multilabel Image Annotation

Empirical Capacity of a Recognition Channel for Single- and Multipose Object Recognition Under the Constraint of PCA Encoding

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large Number Of Object Classes Research Articles

Related Topics

Articles published on Large Number Of Object Classes

Decoding Brain Signals from Rapid-Event EEG for Visual Analysis Using Deep Learning.

Embedding Visual Hierarchy with Deep Networks for Large-Scale Visual Recognition.

Integrating multi-level deep learning and concept ontology for large-scale visual recognition

Contextual Co-occurrence Information for Object Representation and Categorization

Cost-sensitive learning of hierarchical tree classifiers for large-scale image classification and novel category detection

Training inter-related classifiers for automatic image classification and annotation

Contextually guided semantic labeling and search for three-dimensional point clouds

Multi-taskmulti-labelmultiple instance learning

Structured Max-Margin Learning for Inter-Related Classifier Training and Multilabel Image Annotation

Empirical Capacity of a Recognition Channel for Single- and Multipose Object Recognition Under the Constraint of PCA Encoding