Link and Annotation Prediction Using Topology and Feature Structure in Large Scale Social Networks

Burak Isikli,Fatih Erdogan Sevilgen,Mustafa Kirac

doi:10.1007/978-3-319-06932-6_23

Abstract

Repeated patterns observed in graph and network structures can be utilized for predictive purposes in various domains including cheminformatics, bioinformatics, political sciences, and sociology. In large scale network structures like social networks, graph theoretical link and annotation prediction algorithms are usually not applicable due to graph isomorphism problem, unless some form of approximation is applied. We propose a non-graph theoretical alternative to link and annotation prediction in large networks by flattening network structures into feature vectors. We extract repeated sub-network pattern vectors for the nodes of a network, and utilize traditional machine learning algorithms for estimating missing or unknown annotations and links in the network. Our main contribution is a novel method for extracting features from large scale networks, and evaluation of the benefit each extraction method provides. We applied our methodology for suggesting new Twitter friends. In our experiments, we observed 11-27% improvement in prediction accuracy when compared to the simple methodology of suggesting friends of friends.Keywordssocial networksdata mining and knowledge discoverybig databusiness intelligencelink predictiongraph processinggraph mining

Full Text