Swash: A collective personal name matching framework

Mohsen Raeesi,Masoud Asadpour,Azadeh Shakery

doi:10.1016/j.eswa.2019.113115

Abstract

Having a unique personal identifier is a prerequisite to run person-centric analytical queries and data mining tasks, such as fraud detection, expert finding, and credit scoring. Personal names are the most commonly used identifier of individuals in datasets; however, the name of a person may not be unique across the dataset's records, especially where data are integrated from various sources. Intelligent systems utilize name matching methods to identify different name representations of persons. The performance of previous name matching methods is inadequate since they solely consider name similarities and ignore dissimilarities. Unavailability of Part of Name (PON, e.g., first name and last name) is an important limitation of dissimilarity consideration. To address this issue, this paper proposes an unsupervised personal name matching framework, namely Swash. This framework can model the information gatherable from a name dataset into a layered Heterogeneous Information Network, which facilitates control over the learning process. Swash predicts PON tags using a self-trainable algorithm and then collectively clusters the name vertices on the network. Evaluations on three public bibliographic datasets (i.e., CiteSeer, ArXiv, and DBLP) recognize the significance of the proposed framework. The results showed that Swash outperformed F1 of the state-of-the-art method up to 38%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Swash: A collective personal name matching framework

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Journal: Expert Systems With Applications	Publication Date: Dec 2, 2019
Citations: 1

Similar Papers

Estimating person-based injury incidence: accuracy of an algorithm to identify readmissions from hospital discharge data
Gabrielle Davie ... Dave Barson
Injury Prevention | VOL. 17
Gabrielle Davie, et. al.Gabrielle Davie ... Dave Barson
27 Jul 2011
Injury Prevention | VOL. 17

Name-Ethnicity Classification and Ethnicity-Sensitive Name Matching
Pucktada Treeratpituk ... C. Lee Giles
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 26
Pucktada Treeratpituk, et. al.Pucktada Treeratpituk ... C. Lee Giles
20 Sep 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 26

About the Role of a Unique Personal Identifier in Collaboration Across Organizational Boundaries
...
Arbor-ciencia Pensamiento Y Cultura | VOL. -
, et. al. ...
01 Jan 2015
Arbor-ciencia Pensamiento Y Cultura | VOL. -

Membership Detection Using Cooperative Data Mining Algorithms
Calvin Newport ... Yiqing Ren
-
Calvin Newport, et. al.Calvin Newport ... Yiqing Ren
28 Apr 2014
28 Apr 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Swash: A collective personal name matching framework

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications