Low-level Feature Space Research Articles

Most existing zero-shot learning approaches exploit transfer learning via an intermediate semantic representation shared between an annotated auxiliary dataset and a target dataset with different classes and no annotation. A projection from a low-level feature space to the semantic representation space is learned from the auxiliary dataset and applied without adaptation to the target dataset. In this paper we identify two inherent limitations with these approaches. First, due to having disjoint and potentially unrelated classes, the projection functions learned from the auxiliary dataset/domain are biased when applied directly to the target dataset/domain. We call this problem the projection domain shift problem and propose a novel framework, transductive multi-view embedding, to solve it. The second limitation is the prototype sparsity problem which refers to the fact that for each target class, only a single prototype is available for zero-shot learning given a semantic representation. To overcome this problem, a novel heterogeneous multi-view hypergraph label propagation method is formulated for zero-shot learning in the transductive embedding space. It effectively exploits the complementary information offered by different semantic representations and takes advantage of the manifold structures of multiple representation spaces in a coherent manner. We demonstrate through extensive experiments that the proposed approach (1) rectifies the projection shift between the auxiliary and target domains, (2) exploits the complementarity of multiple semantic representations, (3) significantly outperforms existing methods for both zero-shot and N-shot recognition on three image and video benchmark datasets, and (4) enables novel cross-view annotation tasks.

Read full abstract

In some real world applications, like information retrieval and data classification, we often are confronted with the situation that the same semantic concept can be expressed using different views with similar information. Thus, how to obtain a certain Semantically Consistent Patterns (SCP) for cross-view data, which embeds the complementary information from different views, is of great importance for those applications. However, the heterogeneity among cross-view representations brings a significant challenge on mining the SCP. In this paper, we propose a general framework to discover the SCP for cross-view data. Specifically, aiming at building a feature-isomorphic space among different views, a novel Isomorphic Relevant Redundant Transformation (IRRT) is first proposed. The IRRT linearly maps multiple heterogeneous low-level feature spaces to a high-dimensional redundant feature-isomorphic one, which we name as mid-level space. Thus, much more complementary information from different views can be captured. Furthermore, to mine the semantic consistency among the isomorphic representations in the mid-level space, we propose a new Correlation-based Joint Feature Learning (CJFL) model to extract a unique high-level semantic subspace shared across the feature-isomorphic data. Consequently, the SCP for cross-view data can be obtained. Comprehensive experiments on three data sets demonstrate the advantages of our framework in classification and retrieval.

Read full abstract

Low-level Feature Space Research Articles

Related Topics

Articles published on Low-level Feature Space

Transductive multi-view zero-shot learning.

Self-Growing RBF Neural Network Approach for Semantic Image Retrieval

Mining Semantically Consistent Patterns for Cross-View Data

Unsupervised Multi-Spectral Satellite Image Segmentation Combining Modified Mean-Shift and a New Minimum Spanning Tree Based Clustering Technique

HBIR: Hypercube-Based Image Retrieval

Multilabel classification with meta-level features in a learning-to-rank framework

Hybrid Query Refinement

FUZZY MODE SIMILARITY MEASURE BASED ON MEASURE OF ILLUMINATION INSTABILITY

Measuring Concept Similarities in Multimedia Ontologies: Analysis and Evaluations

S-IRAS

Exploring statistical correlations for image retrieval

Optical associative processor for general linear transformations

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Low-level Feature Space Research Articles

Related Topics

Articles published on Low-level Feature Space

Transductive multi-view zero-shot learning.

Self-Growing RBF Neural Network Approach for Semantic Image Retrieval

Mining Semantically Consistent Patterns for Cross-View Data

Unsupervised Multi-Spectral Satellite Image Segmentation Combining Modified Mean-Shift and a New Minimum Spanning Tree Based Clustering Technique

HBIR: Hypercube-Based Image Retrieval

Multilabel classification with meta-level features in a learning-to-rank framework

Hybrid Query Refinement

FUZZY MODE SIMILARITY MEASURE BASED ON MEASURE OF ILLUMINATION INSTABILITY

Measuring Concept Similarities in Multimedia Ontologies: Analysis and Evaluations

S-IRAS

Exploring statistical correlations for image retrieval

Optical associative processor for general linear transformations