Abstract

The phenomenon of person name ambiguity is widespread on web pages in that one name may be used by different people. It is important to uniquely identify the given person on the web. In this paper, the method Baidu-PND is proposed by the authors. It is an unsupervised name disambiguation method based on Baidu Encyclopedia. We extract three features including background knowledge, contextual feature and Related-Set of the characters from the online Baidu Encyclopedia. The weights of the features are studied by logistic regression algorithm. Then we make a linear fusion of the features. The maximum combined value is selected as the correct person on web pages. Experiments are conducted to measure the performance of Baidu-PND, which show that the performance is higher than we expected, validating its feasibility and effectiveness for person name disambiguation on web pages. And, Baidu-PND is a new method for knowledge mining based on Baidu Encyclopedia.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call