Image Descriptors Research Articles

As two fundamental representation modalities of 3D objects, 3D point clouds and multi-view 2D images record shape information from different domains of geometric structures and visual appearances. In the current deep learning era, remarkable progress in processing such two data modalities has been achieved through respectively customizing compatible 3D and 2D network architectures. However, unlike multi-view image-based 2D visual modeling paradigms, which have shown leading performance in several common 3D shape recognition benchmarks, point cloud-based 3D geometric modeling paradigms are still highly limited by insufficient learning capacity due to the difficulty of extracting discriminative features from irregular geometric signals. In this article, we explore the possibility of boosting deep 3D point cloud encoders by transferring visual knowledge extracted from deep 2D image encoders under a standard teacher-student distillation workflow. Generally, we propose PointMCD, a unified multi-view cross-modal distillation architecture, including a pretrained deep image encoder as the teacher and a deep point encoder as the student. To perform heterogeneous feature alignment between 2D visual and 3D geometric domains, we further investigate visibility-aware feature projection (VAFP), by which point-wise embeddings are reasonably aggregated into view-specific geometric descriptors. By pair-wisely aligning multi-view visual and geometric descriptors, we can obtain more powerful deep point encoders without exhausting and complicated network modification. Experiments on 3D shape classification, part segmentation, and unsupervised learning strongly validate the effectiveness of our method. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">The code and data will be publicly available at</i> <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/keeganhk/PointMCD</uri> .

The categorization of texture images requires the identification and extraction of meaningful keypoints, a crucial step in ensuring the precise representation of textured images. The literature has introduced numerous descriptors in order to detect and capture both local and global texture characteristics. These descriptors vary in their effectiveness depending on the specific application. However, it is generally accepted that they complement each other by compensating for their strengths and weaknesses. Therefore, many approaches focused on combining different types of image descriptors generating robust features that are invariant to image transformations. These solutions mostly focused on one way to deal with information and faced more problems as they need human intervention and a large set of data. The acknowledged benefits of combining multiple types of descriptors are accompanied by challenges specially. These challenges arise due to differences in properties, such as locality and sparsity, as well as the heterogeneity exhibited by generated features. To address this issue comprehensively, we present an approach that benefits from genetic programming techniques to generate and combine two distinct texture classifiers. It incorporates histograms of oriented gradients and local binary patterns descriptors, which capture different textures. To effectively fuse the results of both classifiers, the proposed approach employs a late fusion process along with data augmentation methods while using a limited amount of training data. To evaluate the performance of the proposed approach in texture image classification tasks, we have conducted extensive experiments on six challenging datasets encompassing various variations. We have also investigated its performance in a cross dataset problem where the model has been trained on instances of a dataset before being tested on samples of another dataset. The obtained results clearly demonstrate that the proposed approach surpasses other relevant low-level approaches, as well as existing GP-based and CNN methods specifically designed for describing and classifying textures. Thanks to its ability to simultaneously leverage multiple descriptors, the suggested solution shows a high potential for real-world applications, particularly in handling various image changes with robustness.

Image Descriptors Research Articles

Related Topics

Articles published on Image Descriptors

Computer multimedia aided design and hand-drawn effect analysis based on grid resource sharing cooperative algorithm

CSPFormer: A cross-spatial pyramid transformer for visual place recognition

Semisupervised Vector Quantization in Visual SLAM Using HGCN

Unobtrusive Cognitive Assessment in Smart-Homes: Leveraging Visual Encoding and Synthetic Movement Traces Data Mining.

Efficient and Accurate Image Classification via Spatial Pyramid Matching and SURF Sparse Coding

Acoustic and visual geometry descriptor for multi-modal emotion recognition fromvideos

A Training-Free, Lightweight Global Image Descriptor for Long-Term Visual Place Recognition Toward Autonomous Vehicles

Robust zero‐watermarking algorithm based on discrete wavelet transform and daisy descriptors for encrypted medical image

A Partitioned CAM Architecture with FPGA Acceleration for Binary Descriptor Matching

Discriminative Embedded Oriented Local Pattern (D-EOLP): a new feature based image descriptor

PointMCD: Boosting Deep Point Cloud Encoders Via Multi-View Cross-Modal Distillation for 3D Shape Recognition

Wear mechanisms and severity level classification in iron ore transfer chute linings by propagating regional labels coded as embedding deep learning vectors

Data Augmentation for Genetic Programming-Driven Late Merging of HOG and Uniform LBP Features for Texture Classification

Loop Closure Detection Based on Compressed ConvNet Features in Dynamic Environments

Depth as attention to learn image representations for visual localization, using monocular images

A robust and adaptable high-precision method for matching flipped SAR images based on an oriented descriptor

Spatial Pyramid Attention Enhanced Visual Descriptors for Landmark Retrieval

An emotion recognition system: bridging the gap between humans-machines interaction

LMFD: lightweight multi-feature descriptors for image stitching

Efficient Analysis of Large-Size Bio-Signals Based on Orthogonal Generalized Laguerre Moments of Fractional Orders and Schwarz–Rutishauser Algorithm

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Image Descriptors Research Articles

Related Topics

Articles published on Image Descriptors

Computer multimedia aided design and hand-drawn effect analysis based on grid resource sharing cooperative algorithm

CSPFormer: A cross-spatial pyramid transformer for visual place recognition

Semisupervised Vector Quantization in Visual SLAM Using HGCN

Unobtrusive Cognitive Assessment in Smart-Homes: Leveraging Visual Encoding and Synthetic Movement Traces Data Mining.

Efficient and Accurate Image Classification via Spatial Pyramid Matching and SURF Sparse Coding

Acoustic and visual geometry descriptor for multi-modal emotion recognition fromvideos

A Training-Free, Lightweight Global Image Descriptor for Long-Term Visual Place Recognition Toward Autonomous Vehicles

Robust zero‐watermarking algorithm based on discrete wavelet transform and daisy descriptors for encrypted medical image

A Partitioned CAM Architecture with FPGA Acceleration for Binary Descriptor Matching

Discriminative Embedded Oriented Local Pattern (D-EOLP): a new feature based image descriptor

PointMCD: Boosting Deep Point Cloud Encoders Via Multi-View Cross-Modal Distillation for 3D Shape Recognition

Wear mechanisms and severity level classification in iron ore transfer chute linings by propagating regional labels coded as embedding deep learning vectors

Data Augmentation for Genetic Programming-Driven Late Merging of HOG and Uniform LBP Features for Texture Classification

Loop Closure Detection Based on Compressed ConvNet Features in Dynamic Environments

Depth as attention to learn image representations for visual localization, using monocular images

A robust and adaptable high-precision method for matching flipped SAR images based on an oriented descriptor

Spatial Pyramid Attention Enhanced Visual Descriptors for Landmark Retrieval

An emotion recognition system: bridging the gap between humans-machines interaction

LMFD: lightweight multi-feature descriptors for image stitching

Efficient Analysis of Large-Size Bio-Signals Based on Orthogonal Generalized Laguerre Moments of Fractional Orders and Schwarz–Rutishauser Algorithm