Consistency and asymptotic normality of stochastic block models estimators from sampled data

Mahendra Mariadassou,Timothée Tabouy

doi:10.1214/20-ejs1750

Abstract

Statistical analysis of network is an active research area and the literature counts a lot of papers concerned with network models and statistical analysis of networks. However, very few papers deal with missing data in network analysis and we reckon that, in practice, networks are often observed with missing values. In this paper we focus on the Stochastic Block Model with valued edges and consider a MCAR setting by assuming that every dyad (pair of nodes) is sampled identically and independently of the others with probability $\rho >0$. We prove that maximum likelihood estimators and its variational approximations are consistent and asymptotically normal in the presence of missing data as soon as the sampling probability $\rho $ satisfies $\rho \gg \log (n)/n$.

Highlights

For the last decade, statistical network analyses has been a very active research topic and the statistical modeling of networks has found many applications in social sciences and biology for example Aicher et al (2014), Barbillon et al (2015), Mariadassou et al (2010), Wasserman and Faust (1994) and Zachary (1977).Many random graphs models have been widely studied, either from a theoretical or an empirical point of view
In Celisse et al (2012), consistency of MLE and VE is proven but asymptotic normality requires that the estimators converges at rate at least n−1, which is not proven in the paper, some results were available for some particular cases
According to Equation (2.2), if the sampling design is missing completely at random (MCAR), maximising pθ,ψ(yo, z, r) or pθ,ψ(yo, r) in θ is equivalent to maximising pθ(yo) in θ, this corresponds to the ignorability notion defined in Rubin (1976)

Summary

Introduction

Statistical network analyses has been a very active research topic and the statistical modeling of networks has found many applications in social sciences and biology for example Aicher et al (2014), Barbillon et al (2015), Mariadassou et al (2010), Wasserman and Faust (1994) and Zachary (1977). In Celisse et al (2012), consistency of MLE and VE is proven but asymptotic normality requires that the estimators converges at rate at least n−1, which is not proven in the paper, some results were available for some particular cases (affiliation for example). There is a strong asymmetry between the presence of an edge and its absence: the lack of proof that an edge exists is taken as proof that the edge does not exist and edges with uncertain status are considered as non existent in the graph This is the strategy adopted in most sparse asymptotic settings where the density of edges goes to 0 asymptotically (Bickel et al, 2013). Technical lemmas and details of the proofs are available in the appendices

Stochastic Block Model

Missing data for SBM

Sampling design examples

Observed-likelihoods

Models and assumptions

Identifiability

Subexponential variables

Symmetry

Other definitions

Complete-observed model

Main result

Variational and Maximum Likelihood Estimates

ML estimator

Variational estimator

Log-likelihood ratios

High level view of the proof

Global control

Local control

Proof of the main result

Discussion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2020
Citations: 6	License type: cc-by

R Discovery Prime

R Discovery Prime

Consistency and asymptotic normality of stochastic block models estimators from sampled data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

A Simplified Framework for Using Multiple Imputation in Social Work Research
R A Rose ... M W Fraser
Social Work Research | VOL. 32
R A Rose, et. al.R A Rose ... M W Fraser
01 Sep 2008
Social Work Research | VOL. 32

Exact Recovery and Sharp Thresholds of Stochastic Ising Block Model
Min Ye
IEEE Transactions on Information Theory | VOL. 67
Min YeMin Ye
01 Dec 2021
IEEE Transactions on Information Theory | VOL. 67

Discussion of “Coauthorship and citation networks for statisticians”
Song Wang ... Karl Rohe
The Annals of Applied Statistics | VOL. 10
Song Wang, et. al.Song Wang ... Karl Rohe
01 Dec 2016
The Annals of Applied Statistics | VOL. 10

Posterior Contraction Rates for Stochastic Block Models
Prasenjit Ghosh ... Debdeep Pati
Sankhya A | VOL. 82
Prasenjit Ghosh, et. al.Prasenjit Ghosh ... Debdeep Pati
14 Oct 2019
Sankhya A | VOL. 82

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Consistency and asymptotic normality of stochastic block models estimators from sampled data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics