Automatically Generating a Concept Hierarchy with Graphs

Pucktada Treeratpituk,C Lee Giles,Madian Khabsa

doi:10.1145/2756406.2756967

Automatically Generating a Concept Hierarchy with Graphs

Pucktada Treeratpituk, C Lee Giles + Show 1 more

https://doi.org/10.1145/2756406.2756967

Copy DOI

Publication Date: Jun 21, 2015

Affiliation: Ministry of Science and Technology Thailand, Pennsylvania State University

#Large Text Corpus #Graph Partitioning Algorithm + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We propose a novel graph-based approach for constructing concept hierarchy from a large text corpus. Our algorithm incorporates both statistical co-occurrences and lexical similarity in optimizing the structure of the taxonomy. To automatically generate topic-dependent taxonomies from a large text corpus, we first extracts topical terms and their relationships from the corpus. The algorithm then constructs a weighted graph representing topics and their associations. A graph partitioning algorithm is then used to recursively partition the topic graph into a taxonomy. For evaluation, we apply our approach to articles, primarily computer science, in the CiteSeerX digital library and search engine.

Full Text