Product Quantization Research Articles

Sensitive data identification represents the initial and crucial step in safeguarding sensitive information. With the ongoing evolution of the industrial internet, including its interconnectivity across various sectors like the electric power industry, the potential for sensitive data to traverse different domains increases, thereby altering the composition of sensitive data. Consequently, traditional approaches reliant on sensitive vocabularies struggle to adequately address the challenges posed by identifying sensitive data in the era of information abundance. Drawing inspiration from advancements in natural language processing within the realm of deep learning, we propose a transferable Sensitive Data Identification method based on Product Quantization, named PQ-SDI. This innovative approach harnesses both the composition and contextual cues within textual data to accurately pinpoint sensitive information within the context of Mobile Edge Computing (MEC). Notably, PQ-SDI exhibits proficiency not only within a singular domain but also demonstrates adaptability to new domains following training on heterogeneous datasets. Moreover, the method autonomously identifies sensitive data throughout the entire process, eliminating the necessity for human upkeep of sensitive vocabularies. Extensive experimentation with the PQ-SDI model across four real-world datasets, resulting in performance improvements ranging from 2% to 5% over the baseline model and achieves an accuracy of up to 94.41%. In cross-domain trials, PQ-SDI achieved comparable accuracy to training and identification within the same domain. Furthermore, our experiments showcased the product quantization technique significantly reduces the parameter size by tens of times for the subsequent sensitive data identification phase, particularly beneficial for resource-constrained environments characteristic of MEC scenarios. This inherent advantage not only bolsters sensitive data protection but also mitigates the risk of data leakage during transmission, thus enhancing overall security measures in MEC environments.

Read full abstract

Hashing and quantization have greatly succeeded by benefiting from deep learning for large-scale image retrieval. Recently, deep product quantization methods have attracted wide attention. However, representation capability of codewords needs to be further improved. Moreover, since the number of codewords in the codebook depends on experience, representation capability of codewords is usually imbalanced, which leads to redundancy or insufficiency of codewords and reduces retrieval performance. Therefore, in this paper, we propose a novel deep product quantization method, named Entropy Optimized deep Weighted Product Quantization (EOWPQ), which not only encodes samples into the weighted codewords in a new flexible manner but also balances the codeword assignment, improving while balancing representation capability of codewords. Specifically, we encode samples using the linear weighted sum of codewords instead of a single codeword as traditionally. Meanwhile, we establish the linear relationship between the weighted codewords and semantic labels, which effectively maintains semantic information of codewords. Moreover, in order to balance the codeword assignment, that is, avoiding some codewords representing most samples or some codewords representing very few samples, we maximize the entropy of the coding probability distribution and obtain the optimal coding probability distribution of samples by utilizing optimal transport theory, which achieves the optimal assignment of codewords and balances representation capability of codewords. The experimental results on three benchmark datasets show that EOWPQ can achieve better retrieval performance and also show the improvement of representation capability of codewords and the balance of codeword assignment.

Read full abstract

Product Quantization Research Articles

Related Topics

Articles published on Product Quantization

InMemQK: A Product Quantization Based MatMul Module for Compute-in-Memory Attention Macro

Vector Databases

Joint contrastive self-supervised learning and weak-orthogonal product quantization for fast image retrieval

A novel 3D LiDAR deep learning approach for uncrewed vehicle odometry.

A mobile edge computing-focused transferable sensitive data identification method based on product quantization

HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval

Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization

Learnable product quantization for anomaly detection

Entropy-Optimized Deep Weighted Product Quantization for Image Retrieval.

Logit Variated Product Quantization Based on Parts Interaction and Metric Learning With Knowledge Distillation for Fine-Grained Image Retrieval

Progressive Similarity Preservation Learning for Deep Scalable Product Quantization

Product Quantization for Limited Feedback MISO and RIS-Aided Wireless Communication Systems

Functional quantization of rough volatility and applications to volatility derivatives

Optimization method of 3D reconstruction of metal cultural relics based on 3D laser scanning data reduction

Flexible product quantization for fast approximate nearest neighbor search

Exploring Composite Indexes for Domain Adaptation in Neural Machine Translation

Quantization to speedup approximate nearest neighbor search

Query-Aware Quantization for Maximum Inner Product Search

SemDM: Task-oriented masking strategy for self-supervised visual learning

Orthonormal product quantization network for scalable face image retrieval

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Product Quantization Research Articles

Related Topics

Articles published on Product Quantization

InMemQK: A Product Quantization Based MatMul Module for Compute-in-Memory Attention Macro

Vector Databases

Joint contrastive self-supervised learning and weak-orthogonal product quantization for fast image retrieval

A novel 3D LiDAR deep learning approach for uncrewed vehicle odometry.

A mobile edge computing-focused transferable sensitive data identification method based on product quantization

HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval

Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization

Learnable product quantization for anomaly detection

Entropy-Optimized Deep Weighted Product Quantization for Image Retrieval.

Logit Variated Product Quantization Based on Parts Interaction and Metric Learning With Knowledge Distillation for Fine-Grained Image Retrieval

Progressive Similarity Preservation Learning for Deep Scalable Product Quantization

Product Quantization for Limited Feedback MISO and RIS-Aided Wireless Communication Systems

Functional quantization of rough volatility and applications to volatility derivatives

Optimization method of 3D reconstruction of metal cultural relics based on 3D laser scanning data reduction

Flexible product quantization for fast approximate nearest neighbor search

Exploring Composite Indexes for Domain Adaptation in Neural Machine Translation

Quantization to speedup approximate nearest neighbor search

Query-Aware Quantization for Maximum Inner Product Search

SemDM: Task-oriented masking strategy for self-supervised visual learning

Orthonormal product quantization network for scalable face image retrieval