Text Matching and Categorization: Mining Implicit Semantic Knowledge from Tree-Shape Structures

Lin Guo,Lin Yue,Tao Peng,Wanli Zuo

doi:10.1155/2015/723469

Abstract

The diversities of large-scale semistructured data make the extraction of implicit semantic information have enormous difficulties. This paper proposes an automatic and unsupervised method of text categorization, in which tree-shape structures are used to represent semantic knowledge and to explore implicit information by mining hidden structures without cumbersome lexical analysis. Mining implicit frequent structures in trees can discover both direct and indirect semantic relations, which largely enhances the accuracy of matching and classifying texts. The experimental results show that the proposed algorithm remarkably reduces the time and effort spent in training and classifying, which outperforms established competitors in correctness and effectiveness.

Highlights

Rapid developmental trend in social network means the explosive growth of users as well as dramatic changes in providing services
Zaki and Aggarwal [4] propose a structural rule-based classifier for semistructured data, called XMiner, which can mine out parent-child frequent branches and ancestor-descendant ones and conduct structured or semistructured data perfectly, but the shortness is the lack of semantic information in text representation
Semantic similarity assessment [7, 8] can be exploited to improve the accuracy of current information retrieval techniques [9], to automatically annotate documents [10, 11], to protect privacy [12, 13], to match web services [14], and to resolve problems based on knowledge reuse [15]

Summary

Introduction

Rapid developmental trend in social network means the explosive growth of users as well as dramatic changes in providing services. Semantic similarity assessment [7, 8] can be exploited to improve the accuracy of current information retrieval techniques [9], to automatically annotate documents [10, 11], to protect privacy [12, 13], to match web services [14], and to resolve problems based on knowledge reuse [15]. The method proposed can mine out implicit semantic information without cumbersome lexical analysis by making links express semantic knowledge and pointers record a traversal sequence which describes different abilities of nodes in expressing a text. The method proposed in this paper extracts semantic information by creating tresses and calculates the similarities of coexisting hidden structures to measure the similarities of texts. The other is to generate semantic trees based on the combining of pointers and a fixed traversal strategy and to use subtrees as addenda structures. The last one is to discover implicit knowledge by analyzing semantic trees and mining coexisting hidden structures

Representation of Semantic Information

Mining Implicit Frequent Structures

Scoring Tactics

Experiment

Conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematical Problems in Engineering	Publication Date: Jan 1, 2015
Citations: 22	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

Text Matching and Categorization: Mining Implicit Semantic Knowledge from Tree-Shape Structures

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering

Lead the way for us

Similar Papers

A text matching model based on dynamic multi‐mask and augmented adversarial
Lin Zhong ... Jun Zeng
Expert Systems | VOL. 40
Lin Zhong, et. al.Lin Zhong ... Jun Zeng
18 Oct 2022
Expert Systems | VOL. 40

Incremental text categorization based on hybrid optimization-based deep belief neural network
V Srilakshmi ... C Shoba Bindu
Journal of High Speed Networks | VOL. 27
V Srilakshmi, et. al.V Srilakshmi ... C Shoba Bindu
07 Jul 2021
Journal of High Speed Networks | VOL. 27

INTELLIGENT NLP-DRIVEN TEXT CLASSIFICATION
Roberto Basili ... Alessandro Moschitti
International Journal on Artificial Intelligence Tools | VOL. 11
Roberto Basili, et. al.Roberto Basili ... Alessandro Moschitti
01 Sep 2002
International Journal on Artificial Intelligence Tools | VOL. 11

A simple and efficient text matching model based on deep interaction
Chuanming Yu ... Gang Li
Information Processing and Management | VOL. 58
Chuanming Yu, et. al.Chuanming Yu ... Gang Li
09 Sep 2021
Information Processing and Management | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text Matching and Categorization: Mining Implicit Semantic Knowledge from Tree-Shape Structures

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering