Abstract

Finding relevant experts in specified areas is often crucial for a wide range of applications in both academia and industry. Given a user input query and a large amount of academic knowledge (e.g., academic papers), expert finding aims to find and rank the experts who are most relevant to the given query, from the academic knowledge. Existing studies mainly focus on the embedding-based solutions that (1) consider academic papers&#x0027; textual semantic similarities to a given query through document representation models and (2) extract the <tex>$\mathbf{top-}n$</tex> experts with the greatest similarities. Beyond the implicit textual semantics of papers, however, the papers&#x0027; explicit relationships (e.g., co-authorship, citation, and same-topic relationship) in a heterogeneous academic graph (e.g., DBLP) are critical for document representation, insofar as they help improve the expert finding quality. Despite their importance, the explicit relationships of papers generally have been ignored in the literature. In this paper, we study the academic expert finding on heterogeneous graphs by considering the explicit relationships besides the implicit textual semantics of papers in one representation model. Specifically, we first define the <tex>$(k,\mathcal{P})$</tex>-core to denote a cohesive community of papers that are closely connected via a meta-path <tex>$\mathcal{P}(\mathcal{P}$</tex> indicates the different relationships of papers). We then propose an offline <tex>$(k,\mathcal{P})$</tex>-core based document embedding model to capture papers&#x0027; various explicit relationships for representation. Moreover, by using papers&#x0027; embeddings, we present an online threshold algorithm (TA)-based method to efficiently return top-n experts via a carefully designed proximity graph-based index (PG-Index). We extend our approach to support multiple relationships simultaneously for representation. Extensive experiments over real-world datasets demonstrate the effectiveness and efficiency of our approach.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.