Object Retrieval Research Articles

Object retrieval plays an increasingly important role in video surveillance, digital marketing, e-commerce, etc. It is facing challenges such as large-scale datasets, imbalanced data, viewpoint, cluster background, and fine-grained details (attributes). This paper has proposed a model to integrate object ontology, a local multitask deep neural network (local MDNN), and an imbalanced data solver to take advantages and overcome the shortcomings of deep learning network models to improve the performance of the large-scale object retrieval system from the coarse-grained level (categories) to the fine-grained level (attributes). Our proposed coarse-to-fine object retrieval (CFOR) system can be robust and resistant to the challenges listed above. To the best of our knowledge, the new main point of our CFOR system is the power of mutual support of object ontology, a local MDNN, and an imbalanced data solver in a unified system. Object ontology supports the exploitation of the inner-group correlations to improve the system performance in category classification, attribute classification, and conducting training flow and retrieval flow to save computational costs in the training stage and retrieval stage on large-scale datasets, respectively. A local MDNN supports linking object ontology to the raw data, and an imbalanced data solver based on Matthews' correlation coefficient (MCC) addresses that the imbalance of data has contributed effectively to increasing the quality of object ontology realization without adjusting network architecture and data augmentation. In order to evaluate the performance of the CFOR system, we experimented on the DeepFashion dataset. This paper has shown that our local MDNN framework based on the pretrained NASNet architecture has achieved better performance (14.2% higher in recall rate) compared to single-task learning (STL) in the attribute learning task; it has also shown that our model with an imbalanced data solver has achieved better performance (5.14% higher in recall rate for fewer data attributes) compared to models that do not take this into account. Moreover, MAP@30 hovers 0.815 in retrieval on an average of 35 imbalanced fashion attributes.

Read full abstract

This paper proposes a method for hyper-clique graph (HCG) generation, which can be considered an extension of classical graphs and hyper-graphs in which the node is replaced with the clique (a set of neighboring nodes in a specific feature space) and the hyper-edge linking multiple nodes is replaced with the hyper-edge linking multiple cliques. In addition, we propose the HCG matching method by preserving global and local structures. Specifically, we embed the clique relations of arbitrary orders in a high-order similarity tensor in a recursive manner. Then, we formulate the objective function of HCG matching with respect to two latent variables: the latent clique structure information in the original graph and the similarity measure of clique sets from pairwise HCGs. Since the objective function is not jointly convex with respect to both latent variables, we decompose it into two consecutive measurements for optimization: 1) a clique-to-clique similarity measurement by preserving local unary and pairwise correspondences and 2) a graph-to-graph similarity measurement by preserving global clique-to-clique correspondence. We suitably adopt the affinity-preserving reweighted random walks to optimize the objective function. We extensively evaluate the HCG matching performance on multiple applications: 1) we evaluate the robustness of HCG with respect to the deformation noise, the number of outliers, and the edge density on synthetic data and explore the effects of both the clique order and hyper-edge order on performance; 2) we explore HCG matching for feature point matching on multiple image data sets (CMU house sequence, Caltech+MSRC, and Car+Motor); and 3) we explore HCG matching for multi-view object retrieval, which is a much more challenging task since multi-view objects contain significant variations of illumination, viewpoint, and so on, using popular data sets (MV-RED and NTU). A comparison against the state-of-the-art methods demonstrates the superior performance of the proposed method.

Read full abstract

Object Retrieval Research Articles

Related Topics

Articles published on Object Retrieval

Large-Scale Coarse-to-Fine Object Retrieval Ontology and Deep Local Multitask Learning.

DeepCCFV: Camera Constraint-Free Multi-View Convolutional Neural Network for 3D Object Retrieval

Angular Triplet-Center Loss for Multi-View 3D Shape Retrieval

Blinded Evaluation of Endoscopic Skill and Instructability After Implementation of an Endoscopic Simulation Experience.

Multi-view-based siamese convolutional neural network for 3D object retrieval

Detection and Content Retrieval of Object in an Image using YOLO

Co-weighting semantic convolutional features for object retrieval

Phase retrieval exact solution based on structured window modulation without direct reference waves

End-to-end semantic-aware object retrieval based on region-wise attention

Hyper-Clique Graph Matching and Applications

Effect of the average number of reference speckles in speckle imaging using off-axis speckle holography.

An improved phase-coding method for absolute phase retrieval based on the path-following algorithm

Infants plan prehension while pivoting.

Chronic phencyclidine treatment impairs spatial working memory in rhesus monkeys.

3D Object Retrieval Based on Multi-View Latent Variable Model

Improving Object Retrieval Quality by Integration of Similarity Propagation and Query Expansion

Graph-based particular object discovery

End-to-End Visual Domain Adaptation Network for Cross-Domain 3D CPS Data Retrieval

Accidental diagnosis of a foreign body embedded in maxillary anterior tooth

Multiple foreign body ingestion in pica patient

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Object Retrieval Research Articles

Related Topics

Articles published on Object Retrieval

Large-Scale Coarse-to-Fine Object Retrieval Ontology and Deep Local Multitask Learning.

DeepCCFV: Camera Constraint-Free Multi-View Convolutional Neural Network for 3D Object Retrieval

Angular Triplet-Center Loss for Multi-View 3D Shape Retrieval

Blinded Evaluation of Endoscopic Skill and Instructability After Implementation of an Endoscopic Simulation Experience.

Multi-view-based siamese convolutional neural network for 3D object retrieval

Detection and Content Retrieval of Object in an Image using YOLO

Co-weighting semantic convolutional features for object retrieval

Phase retrieval exact solution based on structured window modulation without direct reference waves

End-to-end semantic-aware object retrieval based on region-wise attention

Hyper-Clique Graph Matching and Applications

Effect of the average number of reference speckles in speckle imaging using off-axis speckle holography.

An improved phase-coding method for absolute phase retrieval based on the path-following algorithm

Infants plan prehension while pivoting.

Chronic phencyclidine treatment impairs spatial working memory in rhesus monkeys.

3D Object Retrieval Based on Multi-View Latent Variable Model

Improving Object Retrieval Quality by Integration of Similarity Propagation and Query Expansion

Graph-based particular object discovery

End-to-End Visual Domain Adaptation Network for Cross-Domain 3D CPS Data Retrieval

Accidental diagnosis of a foreign body embedded in maxillary anterior tooth

Multiple foreign body ingestion in pica patient