Abstract
Recent years have witnessed the boom of heterogeneous information network (HIN), which contains different types of nodes and relations. Complex structure and rich semantics are unique features of HIN. Meta-path, the sequence of object types and relations connecting them, has been widely used to mine this semantic information in HIN. Link prediction is an important data mining task to predict the potential links among nodes, which are required in many applications, e.g., filling missing links. The contemporary link prediction is usually based on simple HIN whose schema is bipartite or star schema. In these works, the meta-paths should be predefined or enumerated. However, in many real networked data, it is hard to describe their network structures with simple schema. For example, the RDF-formatted Knowledge Graph which includes tens of thousands types of objects and links is a kind of schema-rich HIN. In this kind of schema-rich HIN, it is impossible to enumerate meta-paths so that the contemporary work is invalid. In this paper, we study link prediction in schema-rich HIN and propose a novel method named Link Prediction with automatic meta Path (LiPaP). The LiPaP designs an algorithm called automatic meta-path generation to automatically extract meta-paths from schema-rich HIN in the approximate order of relevance and adopt a supervised method with likelihood function to learn the weights of extracted meta-paths. Extensive experiments on real knowledge database, Yago, demonstrate that LiPaP is an effective, steady and efficient approach.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Data Science and Analytics
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.