Diting: An Author Disambiguation Method Based on Network Representation Learning

Liwen Peng,Yongquan Fu,Siqi Shen,Dongsheng Li,Jun Xu,Adele Lu Jia

doi:10.1109/access.2019.2942477

Abstract

It is important to disambiguate names among persons in many scenarios. In this work, we propose an unsupervised method Diting and a semi-supervised method Diting++ for author disambiguation. In Diting, we learn a low-dimensional vector to represent each paper in networks, which are formed by connecting papers with multiple types of relationship (such as co-author). During representation learning, we focus on maximizing the gap between positive edges and negative edges. Further, we propose a clustering algorithm which associates papers to their real-life authors. To make full use of the authorship information, which is easy to obtain from the authors' homepages, we design Diting++ to improve the performance for name disambiguation. Diting++ uses the authorship information listed on the authors' homepages to construct label networks and uses a network representation learning method to learn paper representations based on label networks and other networks. Further, Diting++ uses a semi-supervised clustering method to partition learned paper representations into disjoint groups. Each group belongs to a distinct author. By making use of the label information, the clustering method partitions papers written by the same author in the same group, whereas papers written by different authors locate in different groups. Through extensive experiments, we show that our methods are significantly better than the state-of-the-art author disambiguation methods.

Highlights

We focus on author disambiguation that associates documents to different persons who share an identical name
We propose a novel network representation learning method for author disambiguation, which models multiple types of paper relationships to paper representations
We find that our unsupervised method Diting can obtain at least 5.7% better Marco-F1 result than the other author disambiguation methods, and our semi-supervised method Diting++ can obtain at least 10.9% better Marco-F1 result

Summary

Introduction

When we search for documents about one particular author in the field of literature search, we may get many results (e.g., papers, web pages) containing the author’s name. Even those documents share the same name we search for, they can be different peoples. A search query for the name ‘‘Mark Newman’’ could obtain a physicist who works in the University of Michigan, a computer scientist who works in the same university, and so on. Apart from these, the ambiguous name problem appears in many other fields, such as law enforcement and bibliometrics science.

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Diting: An Author Disambiguation Method Based on Network Representation Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

SBiNE: Signed Bipartite Network Embedding
Youwen Zhang ... Dengcheng Yan
-
Youwen Zhang, et. al.Youwen Zhang ... Dengcheng Yan
01 Jan 2020
01 Jan 2020

Research on Learning Method of Attribute Network Representation for Edge Node Discovery
Yiming Wang ... Yimei Zhang
-
Yiming Wang, et. al.Yiming Wang ... Yimei Zhang
01 Jul 2021
01 Jul 2021

Network representation learning via improved random walk with restart
Yanan Zhang ... Zhili Zhao
Knowledge-Based Systems | VOL. 263
Yanan Zhang, et. al.Yanan Zhang ... Zhili Zhao
13 Jan 2023
Knowledge-Based Systems | VOL. 263

A Network-embedding Based Method for Author Disambiguation
Jun Xu ... Dongsheng Li
-
Jun Xu, et. al.Jun Xu ... Dongsheng Li
17 Oct 2018
17 Oct 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Diting: An Author Disambiguation Method Based on Network Representation Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access