Testing for Nodal Dependence in Relational Data Matrices

Alexander Volfovsky,Peter D Hoff

doi:10.1080/01621459.2014.965777

Alexander Volfovsky, Peter D Hoff

Open Access

https://doi.org/10.1080/01621459.2014.965777

Copy DOI

Abstract

Relational data are often represented as a square matrix, the entries of which record the relationships between pairs of objects. Many statistical methods for the analysis of such data assume some degree of similarity or dependence between objects in terms of the way they relate to each other. However, formal tests for such dependence have not been developed. We provide a test for such dependence using the framework of the matrix normal model, a type of multivariate normal distribution parameterized in terms of row- and column-specific covariance matrices. We develop a likelihood ratio test (LRT) for row and column dependence based on the observation of a single relational data matrix. We obtain a reference distribution for the LRT statistic, thereby providing an exact test for the presence of row or column correlations in a square relational data matrix. Additionally, we provide extensions of the test to accommodate common features of such data, such as undefined diagonal entries, a nonzero mean, multiple observations, and deviations from normality. Supplementary materials for this article are available online.

Highlights

Networks or relational data among m actors, nodes or objects are frequently presented in the form of an m × m matrix Y = {yij : 1 ≤ i, j ≤ m}, where the entry yij corresponds to a measure of the directed relationship from object i to object j
We show that T (Ỹ) is invariant under diagonal transformations and we can approximate the null distribution of the test statistic based on for data drawn from a matrix normal distribution where the diagonal entries are replaced with zeros
Unlike the previous testing literature for matrix normal models that required multiple observations, and concentrated on testing a null of separable covariances versus an unstructured alternative, we proposed testing a null of no row or column correlations versus an alternative of full row and column correlations using a single observation of a network

Summary

Introduction

Networks or relational data among m actors, nodes or objects are frequently presented in the form of an m × m matrix Y = {yij : 1 ≤ i, j ≤ m}, where the entry yij corresponds to a measure of the directed relationship from object i to object j. Such data are of interest in a variety of scientific disciplines: Sociologists and epidemiologists gather friendship network data to study social development and health outcomes among children (Fletcher et al, 2011; Potter et al, 2012), economists study markets by analyzing networks of business interactions among companies or countries (Westveld and Hoff, 2011a; Lazzarini et al, 2001), and biologists study gene-gene interaction networks to better understand biological pathways (Bergmann et al, 2003; Stuart et al, 2003).

Likelihood ratio test

Likelihood ratio test statistic and null distribution

Matrix variate elliptically contoured distributions

Power calculations

Maximally sparse Kronecker covariance structured correlation

Misspecified covariance structure

Extensions and Applications

Extensions and continuous data example

Application to binary protein-protein interaction network

Findings

Discussion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of the American Statistical Association	Publication Date: Jul 3, 2015
Citations: 42	License type: cc-by

R Discovery Prime

R Discovery Prime

Testing for Nodal Dependence in Relational Data Matrices

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of the American Statistical Association

Lead the way for us

Similar Papers

AN ANALYSIS OF PAIRED COMPARISON DESIGNS WITH INCOMPLETE REPETITIONS
John W Wilkinson
Biometrika | VOL. 44
John W WilkinsonJohn W Wilkinson
01 Jan 1957
Biometrika | VOL. 44

An enhanced fuzzy c-means clustering using relational information
Jian-Ping Mei ... Lihui Chen
-
Jian-Ping Mei, et. al.Jian-Ping Mei ... Lihui Chen
01 Jul 2011
01 Jul 2011

A unified approach to model selection using the likelihood ratio test
Fraser Lewis ... Lucy Gilbert
Methods in ecology and evolution | VOL. 2
Fraser Lewis, et. al.Fraser Lewis ... Lucy Gilbert
23 Aug 2010
Methods in ecology and evolution | VOL. 2

Noncentral Chi-Square Versus Normal Distributions in Describing the Likelihood Ratio Statistic: The Univariate Case and Its Multivariate Implication
Ke-Hai Yuan
Multivariate Behavioral Research | VOL. 43
Ke-Hai YuanKe-Hai Yuan
19 Mar 2008
Multivariate Behavioral Research | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Testing for Nodal Dependence in Relational Data Matrices

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of the American Statistical Association