Improving the vector auto regression technique for time-series link prediction by using support vector machine

Jan Miles Co,Proceso Fernandez

doi:10.1051/matecconf/20165601008

Abstract

Predicting links between the nodes of a graph has become an important Data Mining task because of its direct applications to biology, social networking, communication surveillance, and other domains. Recent literature in time-series link prediction has shown that the Vector Auto Regression (VAR) technique is one of the most accurate for this problem. In this study, we apply Support Vector Machine (SVM) to improve the VAR technique that uses an unweighted adjacency matrix along with 5 matrices: Common Neighbor (CN), Adamic-Adar (AA), Jaccard’s Coefficient (JC), Preferential Attachment (PA), and Research Allocation Index (RA). A DBLP dataset covering the years from 2003 until 2013 was collected and transformed into time-sliced graph representations. The appropriate matrices were computed from these graphs, mapped to the feature space, and then used to build baseline VAR models with lag of 2 and some corresponding SVM classifiers. Using the Area Under the Receiver Operating Characteristic Curve (AUC-ROC) as the main fitness metric, the average result of 82.04% for the VAR was improved to 84.78% with SVM. Additional experiments to handle the highly imbalanced dataset by oversampling with SMOTE and undersampling with K-means clusters, however, did not improve the average AUC-ROC of the baseline SVM.

Highlights

One of the major problems in network analysis involves predicting the existence or emergence of links given a network
Because the Vector Auto Regression (VAR) model assumes a linear dependence of the temporal links on multiple time-series, we propose the use of Support Vector Machine (SVM) in order to more robustly handle a non-linear type of dependency even while retaining the assumption that the dependency is on multiple time-series
We were able to improve the performance of the VAR model by transforming its input multivariate time-series data as a feature set vector that was used as a training set to linear SVM

Summary

Introduction

One of the major problems in network analysis involves predicting the existence or emergence of links given a network. Most of the previous works on link prediction use a static network to predict hidden or future links. In the detection of hidden links, the network is based on a known partial snapshot, and the objective is to predict currently existing links [4]. In the prediction of future links, the network is based on a snapshot at time t, and the objective is to predict links at time t’ (t’ > t) [5]. In this framework, insight regarding the dynamics of the network is disregarded, and information on the occurrence and frequency of links across time is lost. Recent works on link prediction use a dynamic network where the network is characterized by a series of snapshots that represent the network across time [4, 5]

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: MATEC Web of Conferences	Publication Date: Jan 1, 2016
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Improving the vector auto regression technique for time-series link prediction by using support vector machine

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: MATEC Web of Conferences

Lead the way for us

Similar Papers

Computer-Based Classification of Dermoscopy Images of Melanocytic Lesions on Acral Volar Skin
Hitoshi Iyatomi ... Masaru Tanaka
Journal of Investigative Dermatology | VOL. 128
Hitoshi Iyatomi, et. al.Hitoshi Iyatomi ... Masaru Tanaka
01 Aug 2008
Journal of Investigative Dermatology | VOL. 128

Impact of Intraoperative Data on Risk Prediction for Mortality After Intra-Abdominal Surgery.
Xinyu Yan ... Minjae Kim
Anesthesia and analgesia | VOL. 134
Xinyu Yan, et. al.Xinyu Yan ... Minjae Kim
02 Sep 2021
Anesthesia and analgesia | VOL. 134

CT texture analysis for the differentiation of papillary renal cell carcinoma subtypes.
Chongfeng Duan ... Lei Niu
Abdominal Radiology | VOL. 45
Chongfeng Duan, et. al.Chongfeng Duan ... Lei Niu
22 May 2020
Abdominal Radiology | VOL. 45

WE‐C‐BRA‐01: Best in Physics (Joint Imaging‐Therapy) ‐ Modeling Pathologic Response of Locally Advanced Esophageal Cancer to Chemoradiotherapy Using Spatial‐Temporal FDG‐PET Features, Clinical Parameters and Demographics
H Zhang ... M Suntharalingam
Medical Physics | VOL. 39
H Zhang, et. al.H Zhang ... M Suntharalingam
01 Jun 2012
Medical Physics | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving the vector auto regression technique for time-series link prediction by using support vector machine

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: MATEC Web of Conferences