Abstract

Trait allocations are a class of combinatorial structures in which data may belong to multiple groups and may have different levels of belonging in each group. Often the data are also exchangeable, i.e., their joint distribution is invariant to reordering. In clustering—a special case of trait allocation—exchangeability implies the existence of both a de Finetti representation and an exchangeable partition probability function (EPPF), distributional representations useful for computational and theoretical purposes. In this work, we develop the analogous de Finetti representation and exchangeable trait probability function (ETPF) for trait allocations, along with a characterization of all trait allocations with an ETPF. Unlike previous feature allocation characterizations, our proofs fully capture single-occurrence “dust” groups. We further introduce a novel constrained version of the ETPF that we use to establish an intuitive connection between the probability functions for clustering, feature allocations, and trait allocations. As an application of our general theory, we characterize the distribution of all edge-exchangeable graphs, a class of recently-developed models that captures realistic sparse graph sequences.

Highlights

  • Representation theorems for exchangeable random variables are a ubiquitous and powerful tool in Bayesian modeling and inference

  • We show that the vertex popularity model, a standard example of an edge-exchangeable model, is a constrained frequency model per Definition 5.10, guaranteeing the existence of a constrained exchangeable trait probability function (CETPF) which we call the exchangeable vertex probability function (EVPF)

  • We use a similar technique to the proof of Corollary 5.13—we seek a constrained frequency model (a sequence and set C) that corresponds to the vertex popularity model with weights, and use Theorem 5.12 to obtain a correspondence with a CETPF

Read more

Summary

Introduction

Representation theorems for exchangeable random variables are a ubiquitous and powerful tool in Bayesian modeling and inference. A similar representation generalizing partitions and edge-exchangeable (hyper)graphs has been studied in concurrent work (Crane and Dempsey, 2016b) on relational exchangeability, first introduced by Ackerman (2015); Crane and Towsner (2015)—but here we explore the existence of a trait frequency model, the existence of a constrained trait frequency model and its connection to clustering and feature allocations, and the various connections between frequency models and probability functions. The symbol ×N SN for a sequence of sets (SN ) denotes S1 × S2 × . . . , their infinite product space

Trait allocations
Ordered trait allocations and lexicographic ordering
De Finetti representation of exchangeable trait allocations
Frequency models and probability functions
Application: vertex allocations and edge-exchangeable graphs
Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.