Experimental Evaluation of Train and Test Split Strategies in Link Prediction

Gerrit Jan De Bruin,Frank W Takes,H Jaap Van Den Herik,Cor J Veenman

doi:10.1007/978-3-030-65351-4_7

Abstract

In link prediction, the goal is to predict which links will appear in the future of an evolving network. To estimate the performance of these models in a supervised machine learning model, disjoint and independent train and test sets are needed. However, objects in a real-world network are inherently related to each other. Therefore, it is far from trivial to separate candidate links into these disjoint sets.Here we characterize and empirically investigate the two dominant approaches from the literature for creating separate train and test sets in link prediction, referred to as random and temporal splits. Comparing the performance of these two approaches on several large temporal network datasets, we find evidence that random splits may result in too optimistic results, whereas a temporal split may give a more fair and realistic indication of performance. Results appear robust to the selection of temporal intervals. These findings will be of interest to researchers that employ link prediction or other machine learning tasks in networks.KeywordsLink predictionPerformance estimationMachine learning

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Experimental Evaluation of Train and Test Split Strategies in Link Prediction

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Classification of High‐Activity Tiagabine Analogs by Binary QSAR Modeling
Andreas Jurik ... Gerhard F Ecker
Molecular Informatics | VOL. 32
Andreas Jurik, et. al.Andreas Jurik ... Gerhard F Ecker
15 May 2013
Molecular Informatics | VOL. 32

Machine Learning Applications in Orthopaedic Imaging.
Vincent M Wang ... Albert J Kozar
The Journal of the American Academy of Orthopaedic Surgeons | VOL. 28
Vincent M Wang, et. al.Vincent M Wang ... Albert J Kozar
15 May 2020
The Journal of the American Academy of Orthopaedic Surgeons | VOL. 28

Generalizability of deep learning models for predicting outdoor irregular walking surfaces
Vaibhav Shah ... Philippe C Dixon
Journal of Biomechanics | VOL. 139
Vaibhav Shah, et. al.Vaibhav Shah ... Philippe C Dixon
26 May 2022
Journal of Biomechanics | VOL. 139

Pushing the limits of solubility prediction via quality-oriented data selection.
Murat Cihan Sorkun ... Süleyman Er
iScience | VOL. 24
Murat Cihan Sorkun, et. al.Murat Cihan Sorkun ... Süleyman Er
17 Dec 2020
iScience | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Experimental Evaluation of Train and Test Split Strategies in Link Prediction

Abstract

Talk to us

Similar Papers