QSIM: A Quantum-inspired hierarchical semantic interaction model for text classification

Hui Gao,Peng Zhang,Jing Zhang,Chang Yang

doi:10.1016/j.neucom.2024.128658

Abstract

Semantic interaction modeling is a fundamental technology in natural language understanding that guides models to extract deep semantic information from text. Currently, the attention mechanism is one of the most effective techniques in semantic interaction modeling, which learns word-level attention representation by measuring the relevance between different words. However, the attention mechanism is limited to word-level semantic interaction, it cannot meet the needs of fine-grained interactive information for some text classification tasks. In recent years, quantum-inspired language modeling methods have successfully constructed quantized representations of language systems in Hilbert spaces, which use density matrices to achieve fine-grained semantic interaction modeling.This paper presents a Quantum-inspired hierarchical Semantic Interaction Model (QSIM), which follows the sememe-word-sentence language construction principle and utilizes quantum entanglement theory to capture hierarchical semantic interaction information in Hilbert space. Our work builds on the idea of the attention mechanism and extends it. Specifically, we explore the original semantic space from a quantum theory perspective and derive the core semantic space using the Schmidt decomposition technique, where: (1) Sememe is represented as the unit vector in the two-dimensional minimum semantic space; (2) Word is represented as reduced density matrices in the core semantic space, where Schmidt coefficients quantify sememe-level semantic interaction. Compared to density matrices, reduced density matrices capture fine-grained semantic interaction information with lower computational cost; (3) Sentence is represented as quantum superposition states of words, and the degree of word-level semantic interaction is measured using entanglement entropy.To evaluate the model’s performance, we conducted experiments on 15 text classification datasets. The experimental results demonstrate that our model is superior to classical neural network models and traditional quantum-inspired language models. Furthermore, the experiment also confirms two distinct advantages of QISM: (1) flexibility, as it can be integrated into various mainstream neural network text classification architectures; and (2) practicability, as it alleviates the problem of parameter growth inherent in density matrix calculation in quantum language model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

QSIM: A Quantum-inspired hierarchical semantic interaction model for text classification

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Attention-induced semantic and boundary interaction network for camouflaged object detection
Qiao Zhang ... Hongbo Bi
Computer Vision and Image Understanding | VOL. 233
Qiao Zhang, et. al.Qiao Zhang ... Hongbo Bi
12 May 2023
Computer Vision and Image Understanding | VOL. 233

Image Caption Generation Using Contextual Information Fusion With Bi-LSTM-s
Huawei Zhang ... Jing Lian
IEEE Access | VOL. 11
Huawei Zhang, et. al.Huawei Zhang ... Jing Lian
01 Jan 2023
IEEE Access | VOL. 11

Multi-Channel Text Classification Model Based on ERNIE
Dongxue Bao ... Lila Hong
-
Dongxue Bao, et. al.Dongxue Bao ... Lila Hong
17 Nov 2022
17 Nov 2022

Research On Text Classification Based On Deep Neural Network
Deageon Kim
International Journal of Communication Networks and Information Security (IJCNIS) | VOL. 14
Deageon KimDeageon Kim
31 Dec 2022
International Journal of Communication Networks and Information Security (IJCNIS) | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

QSIM: A Quantum-inspired hierarchical semantic interaction model for text classification

Abstract

Talk to us

Similar Papers

More From: Neurocomputing