SRU-based Multi-angle Enhanced Network for Semantic Text Similarity Calculation of Big Data Language Model

Jing Huang,Keyu Ma

doi:10.4018/ijitsa.319039

Abstract

As a fundamental problem of natural language processing (NLP), the calculation of semantic text similarity plays a crucial role in a variety of big data application situations. In the process of text similarity modeling, however, owing to the complexity and ambiguity of Chinese semantics, effectively capturing the semantic interaction characteristics of Chinese text only from a single angle is impossible. This study proposes a deep learning-based computational model for semantic text similarity called SRU-based multi-angle enhanced network (SMAEN). Specifically, the authors firstly combine character-grained embeddings and word-granularity embeddings obtained from the pre-trained model to represent text. The text is encoded using a bidirectional simple recurrent unit (Bi-SRU) network, and the local text similarity is represented using a soft-aligned attention technique. In addition, the authors integrate Bi-SRU with an improved convolutional neural network (CNN) for global similarity modeling to capture semantic, time, and spatial characteristics of short text interaction. Finally, they employ a pooling layer to aggregate the calculation results into a fixed-length vector and a multi-layer perceptual (MLP) classifier to make a determination. Experimental results on Chinese public datasets LCQMC and PAWS-X show that the proposed method fully captures semantic interaction features from multiple angles and achieves advanced performance. This method can produce better matching results and enhance the accuracy of large data analysis. It is applicable to numerous scenarios involving large data, such as information retrieval and recommendation systems.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SRU-based Multi-angle Enhanced Network for Semantic Text Similarity Calculation of Big Data Language Model

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Technologies and Systems Approach

Lead the way for us

Journal: International Journal of Information Technologies and Systems Approach	Publication Date: Mar 3, 2023
License type: CC BY 3.0

Similar Papers

The 2019 n2c2/OHNLP Track on Clinical Semantic Textual Similarity: Overview.
Yanshan Wang ... Hongfang Liu
JMIR medical informatics | VOL. 8
Yanshan Wang, et. al.Yanshan Wang ... Hongfang Liu
27 Nov 2020
The 2019 n2c2/OHNLP Track on Clinical Semantic Textual Similarity: Overview.
Yanshan Wang ... Hongfang Liu

Подходы к оценке семантического сходства текстов в многоязычном пространстве
Aleksey Klokov ... Michael Charnine
-
Aleksey Klokov, et. al.Aleksey Klokov ... Michael Charnine
23 Nov 2020
23 Nov 2020

Semantic Textual Similarity Methods, Tools, and Applications: A Survey
Goutam Majumder ... Alexander Gelbukh
Computación y Sistemas | VOL. 20
Goutam Majumder, et. al.Goutam Majumder ... Alexander Gelbukh
26 Dec 2016
Computación y Sistemas | VOL. 20

Semantic textual similarity between sentences using bilingual word semantics
Masaki Aono ... Md Shajalal
Progress in Artificial Intelligence | VOL. 8
Masaki Aono, et. al.Masaki Aono ... Md Shajalal
09 Mar 2019
Progress in Artificial Intelligence | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SRU-based Multi-angle Enhanced Network for Semantic Text Similarity Calculation of Big Data Language Model

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Technologies and Systems Approach