Sampling Zeros Research Articles

An agent-based model (ABM) simulates actions and interactions of the synthetic agents to understand the system-level behaviour. The synthetic population, the key input to ABM, mimics the distribution of the individual-level attributes in the actual population. Since individual-level attributes of the entire population are unavailable, small-scale samples are generally used for population synthesis. Synthesizing the population by directly sampling from the small-scale samples ignores the possible attribute combinations that are observed in the actual population but do not exist in the small-scale samples, called ‘sampling zeros’. A deep generative model (DGM) can potentially synthesize the sampling zeros but at the expense of falsely generating the infeasible attribute combinations that should be ‘zero’ in the generated data but exist, called ‘structural zeros’. This study proposes a novel method to ensure that the generation of structural zeros is minimal while recovering the ignored sampling zeros. Two loss functions for regularizing the DGMs are devised to customize the training and applied to a generative adversarial network (GAN) and a variational autoencoder (VAE). The adopted metrics for feasibility and diversity of the synthetic population indicate the capability of generating sampling and structural zeros – lower generation probability of structural zeros and lower generation probability of sampling zeros indicate the higher feasibility and the lower diversity, respectively. Results show that the proposed loss functions achieve considerable performance improvement in the feasibility and diversity of the synthesized population over traditional models. The proposed VAE additionally generated 23.5 % of the population ignored by the sample with 79.2 % precision (i.e., the generation ratio of structural zeros and the total samples is 20.8 %), while the proposed GAN generated 18.3 % of the ignored population with 89.0 % precision. The proposed improvement in DGM generates a more feasible and diverse synthetic population. Since synthesizing the population is the first stage of ABM, the proposed approach improves the overall accuracy of the ABM by circumventing the error propagation to later modelling stages.

Read full abstract

Population synthesis is concerned with the generation of agents for agent-based modelling in many fields, such as economics, transportation, ecology and epidemiology. When the number of attributes describing the agents and/or their level of detail becomes large, survey data cannot densely support the joint distribution of the attributes in the population due to the curse of dimensionality. It leads to a situation where many attribute combinations are missing from the sample data while such combinations exist in the real population. In this case, it becomes essential to consider methods that are able to impute such missing information effectively. In this paper, we propose to use deep generative latent models. These models are able to learn a compressed representation of the data space, which when projected back to the original space, leads to an effective way of imputing information in the observed data space. Specifically, we employ the Wasserstein Generative Adversarial Network (WGAN) and the Variational Autoencoder (VAE) for a large-scale population synthesis application. The models are applied to a Danish travel survey with a feature-space of more than 60 variables and trained and tested using cross-validation. A new metric that applies to the evaluation of generative models in an unsupervised setting is proposed. It is based on the ability to generate diverse yet valid synthetic attribute combinations by comparing if the models can recover missing combinations (sampling zeros) while keeping truly impossible combinations (structural zeros) models at a minimum. For a low-dimensional experiment, the VAE, the marginal sampler and the fully random sampler generate 5%, 21% and 26% more structural zeros per sampling zero when compared to the WGAN. For a high dimensional case, these figures increase to 44%, 2217% and 170440% respectively. This research directly supports the development of agent-based systems and in particular cases where detailed socio-economic or geographical representations are required.

Read full abstract

Sampling Zeros Research Articles

Related Topics

Articles published on Sampling Zeros

CARE: Large Precision Matrix Estimation for Compositional Data

Secure Nonlinear Sampled-Data Control System Against Stealthy Attack: Multirate Approach

Overview of selected reinforcement learning solutions to several game theory problems

The limiting zero dynamics of discrete-time system based on forward triangle sample-and-hold.

A deep generative model for feasible and diverse population synthesis

Modeling Method for Sampling Zeros of Sampled-data System with Time Delay in Generalized Sample Hold

HiCImpute: A Bayesian hierarchical model for identifying structural zeros and enhancing single cell Hi-C data.

Stability of Zeros for Sampled-Data Models with Triangle Sample and Hold Implemented by Zero-Order Hold

Assessing Bayesian Semi‐Parametric Log‐Linear Models: An Application to Disclosure Risk Estimation

A zero inflated log-normal model for inference of sparse microbial association networks.

On the implication of structural zeros as independent variables in regression analysis: applications to alcohol research

A Zero-Inflated Latent Dirichlet Allocation Model for Microbiome Studies.

Simplified Realization of Zero Phase Error Tracking

Prediction of rare feature combinations in population synthesis: Application of deep generative modelling

Angle Measurement of Fuse Using Linear Frequency Modulation System Based on C6678 MultiCore DSPs

Modelling the number of antenatal care visits in Bangladesh to determine the risk factors for reduced antenatal care attendance.

Towards a Simple Sampled-Data Control Law for Stably Invertible Linear Systems

Approximate Nonlinear Discrete-Time Models Based on B-Spline Functions

Neutralizing zero dynamics attack on sampled-data systems via generalized holds

Nonlinear Sampled-Data Systems with a Generalized Hold Polynomial-Function for Fast Sampling Rates

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sampling Zeros Research Articles

Related Topics

Articles published on Sampling Zeros

CARE: Large Precision Matrix Estimation for Compositional Data

Secure Nonlinear Sampled-Data Control System Against Stealthy Attack: Multirate Approach

Overview of selected reinforcement learning solutions to several game theory problems

The limiting zero dynamics of discrete-time system based on forward triangle sample-and-hold.

A deep generative model for feasible and diverse population synthesis

Modeling Method for Sampling Zeros of Sampled-data System with Time Delay in Generalized Sample Hold

HiCImpute: A Bayesian hierarchical model for identifying structural zeros and enhancing single cell Hi-C data.

Stability of Zeros for Sampled-Data Models with Triangle Sample and Hold Implemented by Zero-Order Hold

Assessing Bayesian Semi‐Parametric Log‐Linear Models: An Application to Disclosure Risk Estimation

A zero inflated log-normal model for inference of sparse microbial association networks.

On the implication of structural zeros as independent variables in regression analysis: applications to alcohol research

A Zero-Inflated Latent Dirichlet Allocation Model for Microbiome Studies.

Simplified Realization of Zero Phase Error Tracking

Prediction of rare feature combinations in population synthesis: Application of deep generative modelling

Angle Measurement of Fuse Using Linear Frequency Modulation System Based on C6678 MultiCore DSPs

Modelling the number of antenatal care visits in Bangladesh to determine the risk factors for reduced antenatal care attendance.

Towards a Simple Sampled-Data Control Law for Stably Invertible Linear Systems

Approximate Nonlinear Discrete-Time Models Based on B-Spline Functions

Neutralizing zero dynamics attack on sampled-data systems via generalized holds

Nonlinear Sampled-Data Systems with a Generalized Hold Polynomial-Function for Fast Sampling Rates