Abstract

The research of distinction of name ambiguity in the field of information retrieval could enhance searching effect. Therefore, it plays an important role to mine the data of name ambiguity in order to obtain useful knowledge. In this paper, we focus on the problem of traditional evaluation and ranking method used in the clustering. Traditional evaluation and ranking method ignores the association among the subinformation and simply considers that pieces of subinformation are mutual independent. We present an effective data mining method framework based on the case study and association analysis. The method framework is evaluated on the dataset of name ambiguity from the database of CDBLP. The dataset includes the information of coauthor name, workplace, publication, years and ranking of the author of papers. The experimental results show that one piece of main sub-information assisted by some minors could form a stronger rule very useful for the distinction of name ambiguity. Also some combinations of pieces of minor sub-information could produce a stronger rule. The association rules selected by the experiment could be easily explained and commonsensible. Considering the association rules coming from the objective data and data mining method, they are more reliable.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call