Least squares estimation of spatial autoregressive models for large-scale social networks

Danyang Huang,Hansheng Wang,Hao Helen Zhang,Wei Lan

doi:10.1214/19-ejs1549

Danyang Huang, Hansheng Wang + Show 2 more

Open Access

https://doi.org/10.1214/19-ejs1549

Copy DOI

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2019
Citations: 12	License type: cc-by

Affiliation: Peking University

Abstract

Due to the rapid development of various social networks, the spatial autoregressive (SAR) model is becoming an important tool in social network analysis. However, major bottlenecks remain in analyzing large-scale networks (e.g., Facebook has over 700 million active users), including computational scalability, estimation consistency, and proper network sampling. To address these challenges, we propose a novel least squares estimator (LSE) for analyzing large sparse networks based on the SAR model. Computationally, the LSE is linear in the network size, making it scalable to analysis of huge networks. In theory, the LSE is $\sqrt{n}$-consistent and asymptotically normal under certain regularity conditions. A new LSE-based network sampling technique is further developed, which can automatically adjust autocorrelation between sampled and unsampled units and hence guarantee valid statistical inferences. Moreover, we generalize the LSE approach for the classical SAR model to more complex networks associated with multiple sources of social interaction effect. Numerical results for simulated and real data are presented to illustrate performance of the LSE.

Highlights

We consider a network with n nodes
We develop a novel sampling scheme to cope with the least squares estimator (LSE) approach, and further show that the sampled data can lead to a consistent estimation for the spatial autoregressive (SAR) model
It would be intriguing to study the problem without the network sparsity assumption

Summary

Introduction

We consider a network with n nodes. An adjacency matrix A = (aij) ∈ Rn×n could be defined to describe the network structure. Huang et al (2018) proposed the pseudo likelihood estimate for SAR with random effects Because this is a likelihood-type method, complex matrix computation (e.g. log determinant) is needed. More efficient algorithms have been proposed (Barry and Pace, 1999; Smirnov and Anselin, 2001; LeSage and Pace, 2007) These methods usually rely on some stringent assumptions on In − ρW , which can hardly hold for real social network data. Better techniques for network sampling are needed to ensure consistent estimation of social interaction effect Motivated by these challenges, we propose a novel, fast and scalable estimation method for the SAR model.

Motivation

Least squares estimation

Asymptotic properties

New LSE-based scheme for sampling networks

Numerical studies

Performance of the LSE

Performance of the sample-LSE

Performance of the mLSE

Sina Weibo network analysis

Conclusion

Proof of Proposition 1 and Proposition 3

Proof of Proposition 2

Proof of Theorem 1

Proof of Theorem 2

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Least squares estimation of spatial autoregressive models for large-scale social networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

Sensitivity analysis of SAR estimators: a numerical approximation
Shuangzhe Liu ... Richard Sellner
Journal of Statistical Computation and Simulation | VOL. 82
Shuangzhe Liu, et. al.Shuangzhe Liu ... Richard Sellner
01 Feb 2012
Journal of Statistical Computation and Simulation | VOL. 82

Randomized algorithms of maximum likelihood estimation with spatial autoregressive models for large-scale networks
Miaoqi Li ... Emily L Kang
Statistics and Computing | VOL. 29
Miaoqi Li, et. al.Miaoqi Li ... Emily L Kang
14 Feb 2019
Statistics and Computing | VOL. 29

QML and Efficient GMM Estimation of Spatial Autoregressive Models with Dominant (Popular) Units
Lung-Fei Lee ... Jihai Yu
Journal of Business & Economic Statistics | VOL. 41
Lung-Fei Lee, et. al.Lung-Fei Lee ... Jihai Yu
16 Mar 2022
Journal of Business & Economic Statistics | VOL. 41

Spatial autoregressive (SAR) model for average expenditure of Papua Province
Syarifah Diana Permai ... Andry Chowanda
Procedia Computer Science | VOL. 157
Syarifah Diana Permai, et. al.Syarifah Diana Permai ... Andry Chowanda
01 Jan 2019
Procedia Computer Science | VOL. 157

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Least squares estimation of spatial autoregressive models for large-scale social networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics