Abstract

We present heterogeneous networks as a way to unify lexical networks with relational data. We build a unified ACL Anthology network, tying together the citation, author collaboration, and term-cooccurence networks with affiliation and venue relations. This representation proves to be convenient and allows problems such as name disambiguation, topic modeling, and the measurement of scientific impact to be easily solved using only this network and off-the-shelf graph algorithms.

Highlights

  • Graph-based methods have been used to great effect in NLP, on problems such as word sense disambiguation (Mihalcea, 2005), summarization (Erkan and Radev, 2004), and dependency parsing (McDonald et al, 2005)

  • Throughout this paper, we will use the data from the ACL Anthology Network (AAN) (Bird et al, 2008; Radev et al, 2013), which contains additional metadata relationships not found in the ACL Anthology, as a typical heterogeneous network

  • We present a heterogeneous network treatment of the ACL Anthology Network and demonstrate several applications of it

Read more

Summary

Introduction

Graph-based methods have been used to great effect in NLP, on problems such as word sense disambiguation (Mihalcea, 2005), summarization (Erkan and Radev, 2004), and dependency parsing (McDonald et al, 2005). In this paper we will demonstrate heterogeneous networks, networks with multiple different types of nodes and edges, along with several applications of them. The applications in this paper are not presented so much as robust attempts to out-perform the current state-of-the-art, but rather attempts at being competitive against top methods with little effort beyond the construction of the heterogeneous network. Throughout this paper, we will use the data from the ACL Anthology Network (AAN) (Bird et al, 2008; Radev et al, 2013), which contains additional metadata relationships not found in the ACL Anthology, as a typical heterogeneous network. The results in this paper should be generally applicable to other heterogeneous networks

Heterogeneous AAN schema
Scientific Impact Measurement
Creating a synthetic AAN
Measuring impact on the synthetic AAN
Top-ranked entities according to heterogeneous network PageRank
Entity impact evolution
Name Disambiguation
Results
Random walk topic model
Topics from word graph clustering
Entity-topic association
Topic Model Evaluation
Topic Coherence
Extrinsic Evaluation
Related Work
Conclusion and Future Directions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.