Unsupervised Concept Hierarchy Learning: A Topic Modeling Guided Approach

V.S Anoop,S Asharaf,P Deepak

doi:10.1016/j.procs.2016.06.086

V.S Anoop, S Asharaf + Show 1 more

Open Access

https://doi.org/10.1016/j.procs.2016.06.086

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Abstract This paper proposes an efficient and scalable method for concept extraction and concept hierarchy learning from large unstructured text corpus which is guided by a topic modeling process. The method leverages “concepts” from statistically discovered “topics” and then learns a hierarchy of those concepts by exploiting a subsumption relation between them. Advantage of the proposed method is that the entire process falls under the unsupervised learning paradigm thus the use of a domain specific training corpus can be eliminated. Given a massive collection of text documents, the method maps topics to concepts by some lightweight statistical and linguistic processes and then probabilistically learns the subsumption hierarchy. Extensive experiments with large text corpora such as BBC News dataset and Reuters News corpus shows that our proposed method outperforms some of the existing methods for concept extraction and efficient concept hierarchy learning is possible if the overall task is guided by a topic modeling process.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Procedia Computer Science	Publication Date: Jan 1, 2016
Citations: 22	License type: cc-by-nc-nd

R Discovery Prime

Unsupervised Concept Hierarchy Learning: A Topic Modeling Guided Approach

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

Semantic Network Analysis Pipeline—Interactive Text Mining Framework for Exploration of Semantic Flows in Large Corpus of Text
Martin Cenek ... Rowan Bulkow
Applied Sciences | VOL. 9
Martin Cenek, et. al.Martin Cenek ... Rowan Bulkow
05 Dec 2019
Applied Sciences | VOL. 9

Extracting information and inferences from a large text corpus.
Sandhya Avasthi ... Debi Prasanna Acharjya
International Journal of Information Technology | VOL. 15
Sandhya Avasthi, et. al.Sandhya Avasthi ... Debi Prasanna Acharjya
20 Nov 2022
International Journal of Information Technology | VOL. 15

Statistical Analysis of Mandarin Acoustic Units and Automatic Extraction of Phonetically Rich Sentences Based upon a Very Large Chinese Text Corpus

-

01 Aug 1998
01 Aug 1998

Abstractive Summarization on Dynamically Changing Text
Rahul Rawat ... Pranay Rawat
-
Rahul Rawat, et. al.Rahul Rawat ... Pranay Rawat
08 Apr 2021
08 Apr 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Unsupervised Concept Hierarchy Learning: A Topic Modeling Guided Approach

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science