Abstract

Projections of bipartite or two-mode networks capture co-occurrences, and are used in diverse fields (e.g., ecology, economics, bibliometrics, politics) to represent unipartite networks. A key challenge in analyzing such networks is determining whether an observed number of co-occurrences between two nodes is significant, and therefore whether an edge exists between them. One approach, the fixed degree sequence model (FDSM), evaluates the significance of an edge’s weight by comparison to a null model in which the degree sequences of the original bipartite network are fixed. Although the FDSM is an intuitive null model, it is computationally expensive because it requires Monte Carlo simulation to estimate each edge’s p value, and therefore is impractical for large projections. In this paper, we explore four potential alternatives to FDSM: fixed fill model, fixed row model, fixed column model, and stochastic degree sequence model (SDSM). We compare these models to FDSM in terms of accuracy, speed, statistical power, similarity, and ability to recover known communities. We find that the computationally-fast SDSM offers a statistically conservative but close approximation of the computationally-impractical FDSM under a wide range of conditions, and that it correctly recovers a known community structure even when the signal is weak. Therefore, although each backbone model may have particular applications, we recommend SDSM for extracting the backbone of bipartite projections when FDSM is impractical.

Highlights

  • Projections of bipartite or two-mode networks capture co-occurrences, and are used in diverse fields to represent unipartite networks

  • We find that when agent degrees are constant and artifact degrees are constant or left-tailed, all backbone models yield the same backbone as fixed degree sequence model (FDSM) (Mean J = 1 )

  • Backbones extracted using FDSM and stochastic degree sequence model (SDSM) yield modularity values that are statistically significantly larger than those obtained from fixed fill model (FFM), fixed row model (FRM), or fixed column model (FCM) backbones, but that are not statistically significantly different from each other

Read more

Summary

Introduction

Projections of bipartite or two-mode networks capture co-occurrences, and are used in diverse fields (e.g., ecology, economics, bibliometrics, politics) to represent unipartite networks. We explore four potential alternatives to FDSM: fixed fill model, fixed row model, fixed column model, and stochastic degree sequence model (SDSM) We compare these models to FDSM in terms of accuracy, speed, statistical power, similarity, and ability to recover known communities. Methods designed for bipartite projections have recently been d­ eveloped[9,18,21,22] Among these methods, the fixed degree sequence model (FDSM) relies on an intuitive null model, but requires computationally expensive Monte Carlo simulations, making it impractical for extracting the backbone of large bipartite projections. To offer guidance to researchers wishing to extract an FDSM-like backbone from a large bipartite projection, in this paper we consider four potential alternatives to FDSM: fixed fill model (FFM) fixed row model (FRM), fixed column model (FCM), and stochastic degree sequence model (SDSM). We conclude with recommendations for backbone model selection and opportunities for future model development

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call