Text Clustering via Term Semantic Units

Liping Jing,Jian Yu,Houkuan Huang,Jiali Yun

doi:10.1109/wi-iat.2010.23

Abstract

How best to represent text data is an important problem in text mining tasks including information retrieval, clustering, classification and etc.. In this paper, we proposed a compact document representation with term semantic units which are identified from the implicit and explicit semantic information. Among it, the implicit semantic information is extracted from syntactic content via statistical methods such as latent semantic indexing and information bottleneck. The explicit semantic information is mined from the external semantic resource (Wikipedia). The proposed compact representation model can map a document collection in a low-dimension space (term semantic units which are much less than the number of all unique terms). Experimental results on real data sets have shown that the compact representation efficiently improve the performance of text clustering.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text Clustering via Term Semantic Units

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Joint Entity and Relation Extraction Network with Enhanced Explicit and Implicit Semantic Information
Huiyan Wu ... Jun Huang
Applied Sciences | VOL. 12
Huiyan Wu, et. al.Huiyan Wu ... Jun Huang
19 Jun 2022
Applied Sciences | VOL. 12

Combining Explicit and Implicit Semantic Similarity Information for Word Embeddings
Shi Yin ... Xiaoping Chen
-
Shi Yin, et. al.Shi Yin ... Xiaoping Chen
12 Mar 2018
12 Mar 2018

Imitation Learning-Based Implicit Semantic-Aware Communication Networks: Multi-Layer Representation and Collaborative Reasoning
Yong Xiao ... Guangming Shi
IEEE Journal on Selected Areas in Communications | VOL. 41
Yong Xiao, et. al.Yong Xiao ... Guangming Shi
01 Mar 2023
IEEE Journal on Selected Areas in Communications | VOL. 41

Deep Semantic Network Representation
Xuexiong Luo ... Jia Wu
-
Xuexiong Luo, et. al.Xuexiong Luo ... Jia Wu
01 Nov 2020
01 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text Clustering via Term Semantic Units

Abstract

Talk to us

Similar Papers